Molearn

Molearn: a Python package streamlining the design of generative models of biomolecular dynamics - Published in JOSS (2023)

https://github.com/degiacomi-lab/molearn

Science Score: 95.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 4 DOI reference(s) in README and JOSS metadata
  • Academic publication links
    Links to: aps.org, joss.theoj.org, zenodo.org
  • Committers with academic emails
    5 of 9 committers (55.6%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
    Published in Journal of Open Source Software

Keywords

machine-learning molecular-dynamics molecular-modeling protein-structure
Last synced: 6 months ago · JSON representation

Repository

protein conformational spaces meet machine learning

Basic Info
Statistics
  • Stars: 47
  • Watchers: 8
  • Forks: 15
  • Open Issues: 3
  • Releases: 8
Topics
machine-learning molecular-dynamics molecular-modeling protein-structure
Created over 7 years ago · Last pushed 8 months ago
Metadata Files
Readme Contributing License

README.md

molearn

status Documentation Status TEST DOI

protein conformational spaces meet machine learning

molearn is a Python package streamlining the implementation of machine learning models dedicated to the generation of protein conformations from example data obtained via experiment or molecular simulation.

Included in this repository are the following: * Source code in the molearn folder * Software documentation (API and FAQ) in the docs folder, also accessible at molearn.readthedocs.io. * Example training and analysis scripts, along with example data, in the examples folder

Dependencies

The current version of molearn only supports Linux, and has verified to support Python >=3.9.

Required Packages

Optional Packages

To prepare a raw trajectory for training: * mdtraj

To run energy evaluations with OpenMM: * OpenMM * openmmtorchplugin

To evaluate Sinkhorn distances during training: * geomloss

To calculate DOPE and Ramachandran scores during analysis: * Modeller (requires academic license) * cctbx

To run the GUI: * plotly * NGLView

Installation

Anaconda installation from conda-forge

The most recent release can be obtained through Anaconda:

conda install molearn -c conda-forge or the much faster mamba install -c conda-forge molearn

We advise the installation is carried out in a new environment.

Clone the repo and manually install

Manual installation requires the following three steps: * Clone the repository git clone https://github.com/Degiacomi-Lab/molearn.git * Install all required packages (see section Dependencies > Required Packages, above). The easiest way is by calling mamba install -c conda-forge --only-deps molearn, where the option --only-deps will install the molearn required dependencies but not molearn itself. Optionally, packages enabling additional molearn functionalities can also be installed. This has to be done manually (see links in Dependencies > Optional Packages). * Use pip to install molearn from within the molearn directory python -m pip install .

Using molearn without installation

Molearn can used without installation by making the sure the requirements above are met, and adding the src directory to your path at the beginning of every script. For instance, to install all requirements in a new environment molearn_env: conda env create --file environment.yml -n molearn_env Then, within this environment, run scripts starting with:

import sys sys.path.insert(0, 'path/to/molearn/src') import molearn

Note in case of any installation issue, please consult our FAQ

Usage

  • See example scripts in the examples folder.
  • Jupyter notebook tutorials describing the usage of a trained neural network are available here.
  • software API and a FAQ page are available at molearn.readthedocs.io.

References

If you use molearn in your work, please cite: S.C. Musson and M.T. Degiacomi (2023). Molearn: a Python package streamlining the design of generative models of biomolecular dynamics. Journal of Open Source Software, 8(89), 5523

Theory and benchmarks of a neural network training against protein conformational spaces are presented here: V.K. Ramaswamy, S.C. Musson, C.G. Willcocks, M.T. Degiacomi (2021). Learning protein conformational space with convolutions and latent interpolations, Physical Review X 11

Contributing

For information on how to report bugs, request new features, or contribute to the code, please see CONTRIBUTING.md. For any other question please contact matteo.degiacomi@ed.ac.uk.

Owner

  • Name: Degiacomi Research Group
  • Login: Degiacomi-Lab
  • Kind: organization
  • Email: matteo.t.degiacomi@durham.ac.uk
  • Location: United Kingdom

JOSS Publication

Molearn: a Python package streamlining the design of generative models of biomolecular dynamics
Published
September 05, 2023
Volume 8, Issue 89, Page 5523
Authors
Samuel C. Musson ORCID
Department of Physics, Durham University, United Kingdom
Matteo T. Degiacomi ORCID
Department of Physics, Durham University, United Kingdom
Editor
Richard Gowers ORCID
Tags
machine learning molecular dynamics proteins

GitHub Events

Total
  • Watch event: 5
  • Delete event: 1
  • Member event: 1
  • Issue comment event: 2
  • Push event: 27
  • Pull request review event: 1
  • Pull request event: 16
  • Fork event: 4
  • Create event: 2
Last Year
  • Watch event: 5
  • Delete event: 1
  • Member event: 1
  • Issue comment event: 2
  • Push event: 27
  • Pull request review event: 1
  • Pull request event: 16
  • Fork event: 4
  • Create event: 2

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 330
  • Total Committers: 9
  • Avg Commits per committer: 36.667
  • Development Distribution Score (DDS): 0.612
Past Year
  • Commits: 33
  • Committers: 5
  • Avg Commits per committer: 6.6
  • Development Distribution Score (DDS): 0.576
Top Committers
Name Email Commits
Matteo Degiacomi d****m 128
SCM s****n@d****k 85
Samuel Musson w****1@n****k 66
gwirn 7****n 27
Yanchen Zhu r****u@g****m 14
rzhu r****u@n****k 5
DEGIACOMI x****5@d****k 3
Asal Azar y****l@g****m 1
Samuel Musson w****1@g****k 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 12
  • Total pull requests: 28
  • Average time to close issues: 26 days
  • Average time to close pull requests: 19 days
  • Total issue authors: 6
  • Total pull request authors: 5
  • Average comments per issue: 3.17
  • Average comments per pull request: 0.14
  • Merged pull requests: 17
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 18
  • Average time to close issues: N/A
  • Average time to close pull requests: 5 days
  • Issue authors: 0
  • Pull request authors: 4
  • Average comments per issue: 0
  • Average comments per pull request: 0.17
  • Merged pull requests: 9
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • RMeli (4)
  • anandojha (2)
  • JoaoRodrigues (2)
  • ryankzhu (2)
  • ABChh26 (1)
Pull Request Authors
  • gwirn (18)
  • EngAsal (12)
  • SCMusson (5)
  • ryankzhu (5)
  • degiacom (2)
Top Labels
Issue Labels
question (1) help wanted (1) bug (1)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads: unknown
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 3
conda-forge.org: molearn

This software trains a generative neural network on an ensemble of molecular conformations (typically obtained by molecular dynamics). The trained model can be used to generate new, plausible conformations repesentative of poorly sampled transition states.

  • Versions: 3
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 34.0%
Average: 46.7%
Stargazers count: 47.3%
Dependent packages count: 51.2%
Forks count: 54.2%
Last synced: 6 months ago