profinity
A program for predicting unknown proton affinities of small molecules and metabolites
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.8%) to scientific vocabulary
Repository
A program for predicting unknown proton affinities of small molecules and metabolites
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
ProFinity: proton affinity prediction
Introduction
ProFinity is a machine learning program for predicting proton affinities (PA) of various small molecules and metabolites. Since PA is a gas phase property, the amount of diverse experimental PA measurements available is limited, thus, ProFinity can be practical for accurately predicting PA values for unknown chemicals within an interpolation limit.
Overall, ProFinity uses two neural network models: 1) a model for predicting PA values and 2) a model for error correction. Ultimately, both models synergistically deliver error attenuated results.
Performance
Functionality
The program supports single PA query or batch PA queries. For single query, only a canonical SMILE is required as input string. For batch queries, mirror the below input csv data layout for applicability:
| SMILES|
|-------------|
|Cc1cccc(C)c1|
|... |
Upon completion of a task a tabulated result like the table below is saved in a csv file.
|SMILES| PA (kcal/mol)| |-----|----| |Cc1cccc(C)c1 |189.36111| |...|...|
Requirement
Google account needed to access Google Colab notebook.
Support
To create a small batch query csv input file ad hoc: ```twig import pandas as pd
try: !touch small_batch.csv except: pass
columnnames=["SMILES"] smallbatch=pd.readcsv("smallbatch.csv", names=columnnames) complist = #example: ["C(=O)=O", "O"] smallbatch['SMILES'] = complist smallbatch.tocsv("small_batch.csv", index=False) ```
Limitations
ProFinity currently only supports chemicals containing the following atom types: H, He, B, C, N, O, F, P, S, Cl, Fe, As, Br, I, and Xe. The models have been trained on small molecules and metabolites, therefore, it may significantly underperform when applied to sizeable biomolecules. Also, training did not explicitly account for temperature or electric field effect.
Accessibility
to access the ProFinity platform.
Owner
- Name: Mithony Keng
- Login: mitkeng
- Kind: user
- Company: Michigan State University
- Repositories: 1
- Profile: https://github.com/mitkeng
Citation (CITATION.cff)
# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!
cff-version: 1.2.0
title: ProFinity
message: >-
If you use this software, please cite it using the
metadata from this file.
type: software
authors:
- given-names: Mithony
family-names: Keng
email: kengmith@msu.edu
affiliation: Michigan State Unviersity
orcid: 'https://orcid.org/0000-0002-9850-0120'
repository-code: >-
https://github.com/mitkeng/ProFinity/blob/main/CITATION.cff
url: 'https://github.com/mitkeng/ProFinity'
abstract: 'Program for predicting proton affinity. '
keywords:
- Gas phase
- Proton affinity
- Protonation
license: MIT
version: '1.0'
date-released: '2024-07-15'