biopymlff

Machine Learning Force Fields for biochemical systems

https://github.com/zhengcheng233/biopymlff

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 6 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.6%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Machine Learning Force Fields for biochemical systems

Basic Info
  • Host: GitHub
  • Owner: zhengcheng233
  • License: mit
  • Default Branch: master
  • Homepage:
  • Size: 14.6 MB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created almost 4 years ago · Last pushed about 4 years ago
Metadata Files
Readme Contributing License Code of conduct Citation

README.md

biopymlff(WIP)

About biopymlff

A library for developing near constant scaling quantum mechanical quality Machine Learning Force Fields for large biochemical systems.

For more information checkout the documentation.

Dependencies

External * lsqc * amber20 * gaussian * mopac * psi4 * openbabel * stoml

Internal * See setup.py

Installation

bash git clone https://github.com/saandre15/biopymlff cd biopymlff ./setup.py

ML Model Snapshot Downloads

New biomolecule structures are downloaded from the PDB databank. Then the structures are fragmented and fed into the biopymlff program on Frontera at UTAustin and Titan at UTDallas. Everyday the models are snapshotted and publically updated on Github for public use.

Proteins

| Name | Version | Status | Computational Dataset Size | Download | Experimental Dataset Size | Download | |------| ------- | ---------------- | -------- | --------- | ------- | --------- | | GEBFGAP | v0.1 | WIP | 0 | Download | 0 | Download | | GEBFDP | v0.1 | WIP | 0 | Download | 0 | Download | | GEBF_HDNN | v0.1 | WIP | 0 | Download | 0 | Download |

Lipids

| Name | Version | Status | Computational Dataset Size | Download | Experimental Dataset Size | Download | |------| ------- | ---------------- | -------- | --------- | ------- | --------- | | GEBFGAP | v0.1 | WIP | 0 | Download | 0 | Download | | GEBFDP | v0.1 | WIP | 0 | Download | 0 | Download | | GEBF_HDNN | v0.1 | WIP | 0 | Download | 0 | Download |

DNA/RNA

| Name | Version | Status | Computational Dataset Size | Download | Experimental Dataset Size | Download | |------| ------- | ---------------- | -------- | --------- | ------- | --------- | | GEBFGAP | v0.1 | WIP | 0 | Download | 0 | Download | | GEBFDP | v0.1 | WIP | 0 | Download | 0 | Download | | GEBF_HDNN | v0.1 | WIP | 0 | Download | 0 | Download |

Water Model

| Name | Version | Status | Computational Dataset Size | Download | Experimental Dataset Size | Download | |------| ------- | ---------------- | -------- | --------- | ------- | --------- | | GEBFGAP | v0.1 | WIP | 0 | Download | 0 | Download | | GEBFDP | v0.1 | WIP | 0 | Download | 0 | Download | | GEBF_HDNN | v0.1 | WIP | 0 | Download | 0 | Download |

Contact

  • Torabifard Group(UT Dallas)

Citation

See CITATION.cff

References

  • [1] Cheng, Zheng, et al. “Building Quantum Mechanics Quality Force Fields of Proteins with the Generalized Energy-Based Fragmentation Approach and Machine Learning.” Physical Chemistry Chemical Physics, 2021, https://doi.org/10.1039/d1cp03934b.
  • [2] Ko, Tsz Wai, et al. “A Fourth-Generation High-Dimensional Neural Network Potential with Accurate Electrostatics Including Non-Local Charge Transfer.” Nature Communications, vol. 12, no. 1, 2021, https://doi.org/10.1038/s41467-020-20427-2.
  • [3] Aric A. Hagberg, Daniel A. Schult and Pieter J. Swart, “Exploring network structure, dynamics, and function using NetworkX”, in Proceedings of the 7th Python in Science Conference (SciPy2008), Gäel Varoquaux, Travis Vaught, and Jarrod Millman (Eds), (Pasadena, CA USA), pp. 11–15, Aug 2008
  • [4] Bartók, Albert P., and Gábor Csányi. “Gaussian Approximation Potentials: A Brief Tutorial Introduction.” International Journal of Quantum Chemistry, vol. 115, no. 16, 2015, pp. 1051–1057., https://doi.org/10.1002/qua.24927.
  • [5] Han, Jiequn, Linfeng Zhang, and Roberto Car. "Deep potential: A general representation of a many-body potential energy surface." arXiv preprint arXiv:1707.01478 (2017).
  • [6] Kocer, Emir, Tsz Wai Ko, and Jörg Behler. "Neural Network Potentials: A Concise Overview of Methods." arXiv preprint arXiv:2107.03727 (2021).
  • [7] Ko, Tsz Wai, et al. "A fourth-generation high-dimensional neural network potential with accurate electrostatics including non-local charge transfer." Nature communications 12.1 (2021): 1-11.
  • [8] Salakhutdinov, Russ. “Lecture 2.” STA 4273H: Statistical Machine Learning. 20 Nov. 2021.
  • [9] Rugar, Daniel, and Paul Hansma. "Atomic force microscopy." Physics today 43.10 (1990): 23-30.
  • [10] Mahoney, Michael W., and Petros Drineas. "CUR matrix decompositions for improved data analysis." Proceedings of the National Academy of Sciences 106.3 (2009): 697-702.

Owner

  • Name: Zheng Cheng
  • Login: zhengcheng233
  • Kind: user
  • Location: BeiJing
  • Company: aisi

Citation (CITATION.cff)

@misc{
  biopymlff,
  author = { Andre S. },
  title = { biopymlff },
  year = { 2021 },
  publisher = { GitHub },
  journal = { GitHub repository },
  howpublished = {\url{https://github.com/saandre15/biopymlff}}
}

GitHub Events

Total
Last Year

Dependencies

Pipfile pypi
  • pytest * develop
  • ase *
  • deepmd-kit *
  • dscribe *
  • mendeleev *
  • multipledispatch *
  • networkx *
  • pysmiles *
  • quippy-ase *
  • tensorflow *
  • tk *
  • toml *
Pipfile.lock pypi
  • attrs ==21.2.0 develop
  • iniconfig ==1.1.1 develop
  • packaging ==21.3 develop
  • pluggy ==1.0.0 develop
  • py ==1.11.0 develop
  • pyparsing ==3.0.6 develop
  • pytest ==6.2.5 develop
  • toml ==0.10.2 develop
  • absl-py ==1.0.0
  • ase ==3.22.1
  • astunparse ==1.6.3
  • bracex ==2.2.1
  • cachetools ==4.2.4
  • certifi ==2021.10.8
  • charset-normalizer ==2.0.8
  • colorama ==0.4.4
  • cycler ==0.11.0
  • dargs ==0.2.6
  • deepmd-kit ==2.0.3
  • dscribe ==1.2.0
  • f90wrap ==0.2.7
  • flatbuffers ==2.0
  • fonttools ==4.28.2
  • gast ==0.4.0
  • google-auth ==2.3.3
  • google-auth-oauthlib ==0.4.6
  • google-pasta ==0.2.0
  • greenlet ==1.1.2
  • grpcio ==1.42.0
  • h5py ==3.6.0
  • idna ==3.3
  • importlib-metadata ==4.8.2
  • joblib ==1.1.0
  • keras ==2.7.0
  • keras-preprocessing ==1.1.2
  • kiwisolver ==1.3.2
  • libclang ==12.0.0
  • llvmlite ==0.37.0
  • markdown ==3.3.6
  • matplotlib ==3.5.0
  • mendeleev ==0.9.0
  • multipledispatch ==0.6.0
  • networkx ==2.6.3
  • numba ==0.54.1
  • numpy ==1.20.3
  • oauthlib ==3.1.1
  • opt-einsum ==3.3.0
  • packaging ==21.3
  • pandas ==1.3.4
  • pbr ==5.8.0
  • pillow ==8.4.0
  • protobuf ==3.19.1
  • pyasn1 ==0.4.8
  • pyasn1-modules ==0.2.8
  • pybind11 ==2.8.1
  • pyfiglet ==0.8.post1
  • pygments ==2.10.0
  • pyparsing ==3.0.6
  • pysmiles ==1.0.1
  • python-dateutil ==2.8.2
  • python-hostlist ==1.21
  • pytz ==2021.3
  • pyyaml ==6.0
  • quippy-ase ==0.9.9
  • requests ==2.26.0
  • requests-oauthlib ==1.3.0
  • rsa ==4.8
  • scikit-learn ==1.0.1
  • scipy ==1.7.3
  • setuptools ==59.4.0
  • setuptools-scm ==6.3.2
  • six ==1.16.0
  • sparse ==0.13.0
  • sqlalchemy ==1.4.27
  • tensorboard ==2.7.0
  • tensorboard-data-server ==0.6.1
  • tensorboard-plugin-wit ==1.8.0
  • tensorflow ==2.7.0
  • tensorflow-estimator ==2.7.0
  • tensorflow-io-gcs-filesystem ==0.22.0
  • termcolor ==1.1.0
  • threadpoolctl ==3.0.0
  • tk ==0.1.0
  • toml ==0.10.2
  • tomli ==1.2.2
  • typing-extensions ==4.0.1
  • urllib3 ==1.26.7
  • wcmatch ==8.3
  • werkzeug ==2.0.2
  • wheel ==0.37.0
  • wrapt ==1.13.3
  • zipp ==3.6.0