https://github.com/becksteinlab/parallel-analysis-in-the-mdanalysis-library
Benchmarking MDAnalysis with Dask (and MPI). Supplementary Information for SciPy 2017 paper.
https://github.com/becksteinlab/parallel-analysis-in-the-mdanalysis-library
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
✓DOI references
Found 9 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.0%) to scientific vocabulary
Keywords
Repository
Benchmarking MDAnalysis with Dask (and MPI). Supplementary Information for SciPy 2017 paper.
Basic Info
- Host: GitHub
- Owner: Becksteinlab
- License: mit
- Language: Python
- Default Branch: master
- Homepage: http://conference.scipy.org/proceedings/scipy2017/mahzad_khoslessan.html
- Size: 78.1 KB
Statistics
- Stars: 3
- Watchers: 3
- Forks: 4
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
Parallel analysis in the MDAnalysis Library
We present a benchmark suite that can be used to evaluate performance for parallel map-reduce type analysis and use it to investigate the performance of MDAnalysis with the Dask library for task-graph based computing (Khoslessan et al, 2017).
A range of commonly used MD file formats (CHARMM/NAMD DCD, Gromacs XTC, Amber NetCDF) and different trajectory sizes are tested on different high-performance computing (HPC) resources. Benchmarks are performed both on a single node and across multiple nodes.
For space reasons, not all data could be shown in the SciPy 2017 conference proceedings paper. For a full analysis see the Technical Report (Khoshlessan and Beckstein, 2017). The report is available on figshare at DOI 10.6084/m9.figshare.4695742.
Supplementary information for SciPy 2017 paper
This repository should be considered part of the Supplementary information to the SciPy 2017 Proceedings paper (Khoslessan et al, 2017).
Benchmarking code
The repository contain the code to benchmark parallelization of MDAnalysis: * RMSD calculation with MDAnalysis with Dask for XTC, DCD, NCDF * RMSD calculation with MDAnalysis with MPI
Data files
The data files consist of a topology file adk4AKE.psf (in CHARMM PSF format; N = 3341 atoms)
and a trajectory 1ake_007-nowater-core-dt240ps.dcd (DCD format) of length 1.004 μs with
4187 frames; both are freely available under the CC-BY license from figshare at DOI 10.6084/m9.figshare.5108170
Files in XTC and NetCDF formats are generated from the DCD.
Tested libraries
- MDAnalysis 0.15.0
- Dask 0.12.0 (also 0.13.0)
- Distributed 1.14.3 (also 1.15.1)
- NumPy 1.11.2 (also 1.12.0)
Comments and Questions
Please raise issues in the issue tracker or ask on the MDAnalysis developer mailing list.
References
M. Khoshlessan, I. Paraskevakos, S. Jha, and O. Beckstein (2017). Parallel analysis in MDAnalysis using the Dask parallel computing library. In S. Benthall and S. Rostrup, editors, Proceedings of the 16th Python in Science Conference, Austin, TX, 2017. SciPy.
Khoshlessan, Mahzad; Beckstein, Oliver (2017): Parallel analysis in the MDAnalysis Library: Benchmark of Trajectory File Formats. Technical report, Arizona State University, Tempe, AZ, 2017. figshare. doi:10.6084/m9.figshare.4695742
Owner
- Name: Becksteinlab
- Login: Becksteinlab
- Kind: organization
- Email: obeckste@asu.edu
- Location: Tempe, AZ
- Website: https://becksteinlab.physics.asu.edu
- Repositories: 56
- Profile: https://github.com/Becksteinlab
Computational Biophysics at Arizona State University
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: over 1 year ago
All Time
- Total issues: 2
- Total pull requests: 0
- Average time to close issues: about 1 hour
- Average time to close pull requests: N/A
- Total issue authors: 2
- Total pull request authors: 0
- Average comments per issue: 1.5
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- iparask (1)
- orbeckst (1)