xbcf

Predicting BCF values with explanations

https://github.com/bayer-group/xbcf

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 5 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.3%) to scientific vocabulary

Keywords

bayer-not-classified bayer-reg-none bcf beat-not-applicable chemoinformatics deep-learning explainability machine-learning
Last synced: 6 months ago · JSON representation ·

Repository

Predicting BCF values with explanations

Basic Info
  • Host: GitHub
  • Owner: Bayer-Group
  • License: bsd-3-clause
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 1.69 MB
Statistics
  • Stars: 5
  • Watchers: 3
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Topics
bayer-not-classified bayer-reg-none bcf beat-not-applicable chemoinformatics deep-learning explainability machine-learning
Created over 3 years ago · Last pushed about 3 years ago
Metadata Files
Readme Contributing License Citation Codeowners

README.md

xBCF - Predicting bioconcentration factor with explainable deep learning

Open source project for predicting logBCF with explanations on SMILES level. Find more detailed descriptions of the methodology in our research article: https://doi.org/10.1016/j.ailsci.2022.100047

SMILES preprocessing

Make sure your input SMILES strings fulfill the CDDD requirements, i.e. mono-constituent organic molecules, desalted, and neutralized.

Dependency on CDDD

The models in the project are built upon CDDD embeddings which depends on TF 1.X. The latest Python version supported TF1.X is 3.7. We have adjust all code dependencies to Python 3.7 to better fit different deployment platforms. Therefore, we suggest the following ways to install the CDDD package.

Option I - customized way of using the official CDDD repo

  1. Place the CDDD source code under ./src
  2. Place the CDDD model directory under ./models

Option II - an unofficial package

In this sister repo, an unofficial .whl file was packed with the CDDD source code and the default CDDD model. Please refer to the repo for installation. And use this installation method at your own risk.

The script src/attributor.py works with both ways of setting up CDDD, make sure you modify src/xbcf.py accordingly.

For any questions, feel free to open issues.

Model availability

Our MTL model as reported in our paper cannot be published due to the inclusion of internal secret data in the training set. Nevertheless, an SVR model with good performance is published in the repo. And a linear regression model is also available as a simple baseline.

Usage

The code has been tested with Python 3.7

Core dependencies: - Tensorflow 1.13.0

bash python3.7 -m venv bcf_env source ./bcf_env/bin/activate pip install -r requirements.txt python test.py If everything goes well, the last step should print the attribution and predictions for two SMILES.

Please cite

Zhao, L., Floriane, M., Heberle, H., & Schmidt, S. (2022). Modeling bioconcentration factors in fish with explainable deep learning. Artificial Intelligence in the Life Sciences, 2, 100047. https://doi.org/10.1016/j.ailsci.2022.100047

bibtex @article{Zhao_Modeling_bioconcentration_factors_2022, author = {Zhao, Linlin and Floriane, Montanari and Heberle, Henry and Schmidt, Sebastian}, doi = {10.1016/j.ailsci.2022.100047}, journal = {Artificial Intelligence in the Life Sciences}, month = {December}, pages = {100047}, title = {{Modeling bioconcentration factors in fish with explainable deep learning}}, volume = {2}, year = {2022} }

Owner

  • Name: Bayer Open Source
  • Login: Bayer-Group
  • Kind: organization

Science for a better life

Citation (CITATION.cff)

cff-version: 1.2.0
message: "please cite it as below."
authors:
- family-names: "Zhao"
  given-names: "Linlin"
  orcid: "https://orcid.org/0000-0001-8086-2812"
title: "Predicting Bioconcentration Factor with Explainable Deep Learning"
doi: 10.1016/j.ailsci.2022.100047
url: "https://doi.org/10.1016/j.ailsci.2022.100047"
date-released: 2022-09-23
preferred-citation:
  type: article
  authors:
  - family-names: "Zhao"
    given-names: "Linlin"
    orcid: "https://orcid.org/0000-0001-8086-2812"
  - family-names: "Floriane"
    given-names: "Montanari"
    orcid: "https://orcid.org/0000-0002-4676-6170"
  - family-names: "Heberle"
    given-names: "Henry"
    orcid: "https://orcid.org/0000-0003-1964-268X"
  - family-names: "Schmidt"
    given-names: "Sebastian"
    orcid: "https://orcid.org/0000-0002-1364-5436"
  doi: "10.1016/j.ailsci.2022.100047"
  journal: "Artificial Intelligence in the Life Sciences"
  month: December
  start:  100047
  end:  # Last page number
  title: "Modeling bioconcentration factors in fish with explainable deep learning"
  issue: 
  volume: 2
  year: 2022

GitHub Events

Total
Last Year

Committers

Last synced: about 1 year ago

All Time
  • Total Commits: 13
  • Total Committers: 2
  • Avg Commits per committer: 6.5
  • Development Distribution Score (DDS): 0.077
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
linlinzhao l****1@b****m 12
Sebastian Schmidt s****1@b****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 1
  • Total pull requests: 1
  • Average time to close issues: 12 days
  • Average time to close pull requests: 2 minutes
  • Total issue authors: 1
  • Total pull request authors: 1
  • Average comments per issue: 3.0
  • Average comments per pull request: 1.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 1
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • conrad-stork (1)
Pull Request Authors
  • dependabot[bot] (1)
Top Labels
Issue Labels
Pull Request Labels
dependencies (1)