https://github.com/brsynth/biorgroup

Systematic expansion of R-group for ChEBI molecules from the RHEA database

https://github.com/brsynth/biorgroup

Science Score: 39.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 4 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.4%) to scientific vocabulary

Keywords

chebi r-group rhea
Last synced: 6 months ago · JSON representation

Repository

Systematic expansion of R-group for ChEBI molecules from the RHEA database

Basic Info
  • Host: GitHub
  • Owner: brsynth
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 426 KB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Topics
chebi r-group rhea
Created 9 months ago · Last pushed 6 months ago
Metadata Files
Readme License

README.md

BioRGroup dataset

BioRGroup Logo
https://doi.org/10.57745/V3URYA

Installation

sh conda env create --file recipes/worklow.yaml --name biorgroup pip install --no-deps -e .

Build dataset

1 - Download PubChem

sh python -m biorgroup.pubchem.download \ --output-pubchem-dir <output dir> \ --output-pubchem-db <output sql database>

2 - Download Rhea

sh python -m biorgroup.rhea.download \ --output-rhea-dir <output dir> \ --parameter-release-int <int>

3 - R-group search

sh snakemake \ -p \ -j 48 \ -c 48 \ --workflow-profile template/biorgroup \ -s ./src/biorgroup/rgroup/Snakefile \ --use-conda \ --latency-wait 5 \ --rerun-incomplete \ --config input_depot_str=./src/biorgroup/rgroup input_chebi_csv=rhea-chebi-smiles.csv input_pubchem_db=pubchem.db output_dir_str=chebi parameter_search_timeout_int=10

Dataset overview

The Snakemake workflow produces a csv.gz file containing:

| column name | type | | --- | --- | | smilesrhea | str| | chebi | List[str] | | numheavyatoms | int | | exactmolwt | float | | coresuperstructuresmiles | List[str] | | coresuperstructurepubchemcid | List[List[str]] | | rgroupextendedsmiles | List[str] | | rgroupextendedpubchem_cid | List[List[str]] |

Owner

  • Name: BioRetroSynth
  • Login: brsynth
  • Kind: organization

Our group is interested in synthetic biology and systems metabolic engineering in whole-cell and cell-free systems.

GitHub Events

Total
  • Delete event: 1
  • Push event: 1
  • Pull request event: 1
  • Create event: 3
Last Year
  • Delete event: 1
  • Push event: 1
  • Pull request event: 1
  • Create event: 3

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 0
  • Total pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: less than a minute
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: less than a minute
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • guillaume-gricourt (1)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

pyproject.toml pypi
setup.py pypi