https://github.com/bsc-wdc/pycompss_bda
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.2%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: bsc-wdc
- License: apache-2.0
- Language: Python
- Default Branch: master
- Size: 225 KB
Statistics
- Stars: 0
- Watchers: 7
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
PyCOMPSs Big Data Analytics
Available applications in the repository:
- Cascade Support Vector Machines
- K-means
The applications are implemented using both PyCOMPSs and K-means for comparison purposes.
To know more check:
Running the Apps
The folder ./scripts contains samples to run the for implementaions.
The scripts named run_APP_IMPLEMENTATION.sh are used to run the applications locally.
The scripts named enqueue_APP_IMPLEMENTATION.sh are used to run the applications in a supercomputer.
In the case of PyCOMPSs version, the script can be used out-of-the-gox in many supercomputers. The MPI version is tied to SLURM Queueing systems because MPI is not platform-agnostic as PyCOMPSs.
Computing Complexity
In order to compute the complexity metrics of the K-means and C-SVM applications for both MPI and PyCOMPSs run the script:
./scripts/get_complexities.sh
This script reports 3 complexity metrics:
- Source Lines of Code (SLOC)
- Cyclomatic complexity
- NPath complexity
Requirements:
This script uses Babelfish tools to compute the Cyclomatic and NPath complexities, and cloc for the SLOC.
Owner
- Name: Workflows and Distributed Computing
- Login: bsc-wdc
- Kind: organization
- Email: distributed_computing@bsc.es
- Location: Barcelona
- Website: https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing
- Repositories: 13
- Profile: https://github.com/bsc-wdc
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: over 1 year ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0