Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: ncbi.nlm.nih.gov -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.4%) to scientific vocabulary
Keywords
cheminformatics
chemistry
python
rdkit
standardization
validation
Last synced: 6 months ago
·
JSON representation
Repository
Molecule Validation and Standardization
Basic Info
- Host: GitHub
- Owner: mcs07
- License: mit
- Language: Python
- Default Branch: master
- Homepage: https://molvs.readthedocs.io/
- Size: 266 KB
Statistics
- Stars: 174
- Watchers: 18
- Forks: 56
- Open Issues: 23
- Releases: 0
Topics
cheminformatics
chemistry
python
rdkit
standardization
validation
Created over 11 years ago
· Last pushed almost 6 years ago
Metadata Files
Readme
Changelog
Contributing
License
README.rst
MolVS: Molecule Validation and Standardization
==============================================
.. image:: https://img.shields.io/pypi/v/MolVS.svg?style=flat-square
:alt: PyPI package
:target: https://pypi.python.org/pypi/MolVS
.. image:: https://img.shields.io/conda/vn/conda-forge/molvs.svg?style=flat-square
:alt: Conda package
:target: https://anaconda.org/conda-forge/molvs
.. image:: https://img.shields.io/github/license/mcs07/MolVS.svg?style=flat-square
:alt: MIT license
:target: https://github.com/mcs07/MolVS/blob/master/LICENSE
.. image:: https://img.shields.io/azure-devops/build/mcs07/MolVS/1.svg?style=flat-square
:alt: Azure DevOps tests
:target: https://dev.azure.com/mcs07/MolVS/_build?definitionId=1
**MolVS** is a molecule validation and standardization tool, written in Python using the `RDKit chemistry framework`_.
Building a collection of chemical structures from different sources can be difficult due to differing representations,
drawing conventions and mistakes. MolVS can standardize chemical structures to improve data quality, help with
de-duplication and identify relationships between molecules.
There are sensible defaults that make it easy to get started::
>>> from molvs import standardize_smiles
>>> standardize_smiles('[Na]OC(=O)c1ccc(C[S+2]([O-])([O-]))cc1')
'[Na+].O=C([O-])c1ccc(CS(=O)=O)cc1'
Installation
------------
To install MolVS with Anaconda Python, simply run::
conda install -c conda-forge molvs
Alternatively, try one of the other `installation options`_.
Documentation
-------------
Full documentation is available at https://molvs.readthedocs.io.
Contribute
----------
- Feature ideas and bug reports are welcome on the `Issue Tracker`_.
- Fork the `source code`_ on GitHub, make changes and send a pull request.
License
-------
MolVS is licensed under the `MIT license`_.
Similar projects
----------------
There are a number of projects with similar goals that take differing approaches:
- `Francis Atkinson's Standardiser`_
- `RSC Chemistry Validation and Standardization Platform (CVSP)`_
- `PubChem Standardization Service`_
- `Tripod Structure standardizer`_
- `FDA Substance Registration System Standard Operating Procedure`_
- `ChemAxon Structure Standardizer`_
.. _`RDKit chemistry framework`: http://www.rdkit.org
.. _`installation options`: http://molvs.readthedocs.io/en/latest/guide/install.html
.. _`source code`: https://github.com/mcs07/MolVS
.. _`Issue Tracker`: https://github.com/mcs07/MolVS/issues
.. _`MIT license`: https://github.com/mcs07/MolVS/blob/master/LICENSE
.. _`Francis Atkinson's Standardiser`: https://wwwdev.ebi.ac.uk/chembl/extra/francis/standardiser/
.. _`RSC Chemistry Validation and Standardization Platform (CVSP)`: http://cvsp.chemspider.com
.. _`PubChem Standardization Service`: https://pubchem.ncbi.nlm.nih.gov/standardize/standardize.cgi
.. _`Tripod Structure standardizer`: https://tripod.nih.gov/?p=61
.. _`FDA Substance Registration System Standard Operating Procedure`: http://www.fda.gov/downloads/ForIndustry/DataStandards/SubstanceRegistrationSystem-UniqueIngredientIdentifierUNII/ucm127743.pdf
.. _`ChemAxon Structure Standardizer`: http://www.chemaxon.com/products/standardizer/
Owner
- Name: Matt Swain
- Login: mcs07
- Kind: user
- Location: New York, NY
- Company: @DEShawResearch
- Website: https://matt-swain.com
- Twitter: mattswain123
- Repositories: 76
- Profile: https://github.com/mcs07
Developing software for drug discovery
GitHub Events
Total
- Watch event: 18
- Issue comment event: 1
- Fork event: 4
Last Year
- Watch event: 18
- Issue comment event: 1
- Fork event: 4
Committers
Last synced: 9 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Matt Swain | m****n@m****m | 142 |
| Brian Cole | c****b@d****m | 2 |
| JoshuaMeyers | j****s@r****m | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 7 months ago
All Time
- Total issues: 29
- Total pull requests: 12
- Average time to close issues: about 1 month
- Average time to close pull requests: about 16 hours
- Total issue authors: 23
- Total pull request authors: 7
- Average comments per issue: 1.79
- Average comments per pull request: 0.42
- Merged pull requests: 7
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 1
- Pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- UnixJunkie (3)
- jose-manuel (3)
- benbowen (2)
- jw-feng (2)
- mainguyenanhvu (1)
- chupvl (1)
- alexisparenty (1)
- thesketh (1)
- khoivan88 (1)
- aparamon (1)
- jvansan (1)
- adlvdl (1)
- ghost (1)
- mmagithub (1)
- sergsb (1)
Pull Request Authors
- mcs07 (5)
- gjgetzinger (2)
- gitter-badger (1)
- UnixJunkie (1)
- coleb (1)
- fossabot (1)
- JoshuaMeyers (1)
Top Labels
Issue Labels
bug (7)
enhancement (2)
question (1)
wontfix (1)
documentation (1)
Pull Request Labels
bug (5)
enhancement (2)
Packages
- Total packages: 4
-
Total downloads:
- pypi 13,443 last-month
-
Total dependent packages: 1
(may contain duplicates) -
Total dependent repositories: 13
(may contain duplicates) - Total versions: 32
- Total maintainers: 1
pypi.org: molvs
Molecule Validation and Standardization
- Homepage: https://github.com/mcs07/MolVS
- Documentation: https://molvs.readthedocs.io/
- License: MIT
-
Latest release: 0.1.1
published almost 8 years ago
Rankings
Downloads: 4.7%
Dependent packages count: 4.8%
Dependent repos count: 5.2%
Average: 5.3%
Forks count: 5.9%
Stargazers count: 6.0%
Maintainers (1)
Last synced:
6 months ago
proxy.golang.org: github.com/mcs07/MolVS
- Documentation: https://pkg.go.dev/github.com/mcs07/MolVS#section-documentation
- License: mit
-
Latest release: v0.1.1
published almost 8 years ago
Rankings
Dependent packages count: 9.0%
Average: 9.6%
Dependent repos count: 10.2%
Last synced:
6 months ago
proxy.golang.org: github.com/mcs07/molvs
- Documentation: https://pkg.go.dev/github.com/mcs07/molvs#section-documentation
- License: mit
-
Latest release: v0.1.1
published almost 8 years ago
Rankings
Dependent packages count: 9.0%
Average: 9.6%
Dependent repos count: 10.2%
Last synced:
6 months ago
conda-forge.org: molvs
- Homepage: https://github.com/mcs07/MolVS
- License: MIT
-
Latest release: 0.1.1
published almost 8 years ago
Rankings
Dependent repos count: 14.8%
Forks count: 25.9%
Average: 30.9%
Stargazers count: 31.4%
Dependent packages count: 51.6%
Last synced:
6 months ago
Dependencies
environment.yml
conda
- pip 19.0.3.*
- pytest 4.3.1.*
- rdkit 2018.09.1.*
- six 1.12.0.*
- twine 1.13.0.*
setup.py
pypi
- six *