deeppurpose

A Deep Learning Toolkit for DTI, Drug Property, PPI, DDI, Protein Function Prediction (Bioinformatics)

https://github.com/kexinhuang12345/deeppurpose

Science Score: 46.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 5 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org, ncbi.nlm.nih.gov
✓
Committers with academic emails
4 of 21 committers (19.0%) from academic institutions
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (15.2%) to scientific vocabulary

Keywords

bioinformatics covid19 ddi deep-learning drug-discovery drug-drug-interaction drug-property-prediction drug-repurposing drug-target-interaction drug-target-interactions dti-prediction ppi protein-function-prediction protein-protein-interaction qsar repurposing-drugs side-effects toolkit virtual-screening

Last synced: 6 months ago · JSON representation

Repository

A Deep Learning Toolkit for DTI, Drug Property, PPI, DDI, Protein Function Prediction (Bioinformatics)

Basic Info

Host: GitHub
Owner: kexinhuang12345
License: bsd-3-clause
Language: Jupyter Notebook
Default Branch: master
Homepage: https://doi.org/10.1093/bioinformatics/btaa1005
Size: 14.5 MB

Statistics

Stars: 1,077
Watchers: 30
Forks: 291
Open Issues: 18
Releases: 12

Topics

Created almost 6 years ago · Last pushed over 1 year ago

Metadata Files

Readme License

README.md

logo

A Deep Learning Library for Compound and Protein Modeling
DTI, Drug Property, PPI, DDI, Protein Function Prediction

Applications in Drug Repurposing, Virtual Screening, QSAR, Side Effect Prediction and More

This repository hosts DeepPurpose, a Deep Learning Based Molecular Modeling and Prediction Toolkit on Drug-Target Interaction Prediction, Compound Property Prediction, Protein-Protein Interaction Prediction, and Protein Function prediction (using PyTorch). We focus on DTI and its applications in Drug Repurposing and Virtual Screening, but support various other molecular encoding tasks. It allows very easy usage (several lines of codes only) to facilitate deep learning for life science research.

News!

[05/21] 0.1.2 Support 5 new graph neural network based models for compound encoding (DGLGCN, DGLNeuralFP, DGLGINAttrMasking, DGLGINContextPred, DGL_AttentiveFP), implemented using DGL Life Science! An example is provided here!
[12/20] DeepPurpose is now supported by TDC data loader, which contains a large collection of ML for therapeutics datasets, including many drug property, DTI datasets. Here is a tutorial!
[12/20] DeepPurpose can now be installed via pip!
[11/20] DeepPurpose is published in Bioinformatics!
[11/20] Added 5 more pretrained models on BindingDB IC50 Units (around 1Million data points).
[10/20] Google Colab Installation Instructions are provided here. Thanks to @hima111997 !
[10/20] Using DeepPurpose, we made a humans-in-the-loop molecular design web UI interface, check it out! [Website, paper]
[09/20] DeepPurpose has now supported three more tasks: DDI, PPI and Protein Function Prediction! You can simply call from DeepPurpose import DDI/PPI/ProteinPred to use, checkout examples below!
[07/20] A simple web UI for DTI prediction can be created under 10 lines using Gradio! A demo is provided here.
[07/20] A blog is posted on the Towards Data Science Medium column, check this out!
[07/20] Two tutorials are online to go through DeepPurpose's framework to do drug-target interaction prediction and drug property prediction (DTI, Drug Property).
[05/20] Support drug property prediction for screening data that does not have target proteins such as bacteria! An example using RDKit2D with DNN for training and repurposing for pseudomonas aeruginosa (MIT AI Cures's open task) is provided as a demo.
[05/20] Now supports hyperparameter tuning via Bayesian Optimization through the Ax platform! A demo is provided in here.

Features

15+ powerful encodings for drugs and proteins, ranging from deep neural network on classic cheminformatics fingerprints, CNN, transformers to message passing graph neural network, with 50+ combined models! Most of the combinations of the encodings are not yet in existing works. All of these under 10 lines but with lots of flexibility! Switching encoding is as simple as changing the encoding names!
Realistic and user-friendly design:
- support DTI, DDI, PPI, molecular property prediction, protein function predictions!
- automatic identification to do drug target binding affinity (regression) or drug target interaction prediction (binary) task.
- support cold target, cold drug settings for robust model evaluations and support single-target high throughput sequencing assay data setup.
- many dataset loading/downloading/unzipping scripts to ease the tedious preprocessing, including antiviral, COVID19 targets, BindingDB, DAVIS, KIBA, ...
- many pretrained checkpoints.
- easy monitoring of training process with detailed training metrics output such as test set figures (AUCs) and tables, also support early stopping.
- detailed output records such as rank list for repurposing result.
- various evaluation metrics: ROC-AUC, PR-AUC, F1 for binary task, MSE, R-squared, Concordance Index for regression task.
- label unit conversion for skewed label distribution such as Kd.
- time reference for computational expensive encoding.
- PyTorch based, support CPU, GPU, Multi-GPUs.

NOTE: We are actively looking for constructive advices/user feedbacks/experiences on using DeepPurpose! Please open an issue or contact us.

Cite Us

If you found this package useful, please cite our paper: @article{huang2020deeppurpose, title={DeepPurpose: A Deep Learning Library for Drug-Target Interaction Prediction}, author={Huang, Kexin and Fu, Tianfan and Glass, Lucas M and Zitnik, Marinka and Xiao, Cao and Sun, Jimeng}, journal={Bioinformatics}, year={2020} }

Installation

Try it on Binder! Binder is a cloud Jupyter Notebook interface that will install our environment dependency for you.

Video tutorial to install Binder.

We recommend to install it locally since Binder needs to be refreshed every time launching. To install locally, we recommend to install from pip:

`pip`

bash conda create -n DeepPurpose python=3.6 conda activate DeepPurpose conda install -c conda-forge notebook pip install git+https://github.com/bp-kelley/descriptastorus pip install DeepPurpose

Build from Source

First time: ```bash git clone https://github.com/kexinhuang12345/DeepPurpose.git ## Download code repository cd DeepPurpose ## Change directory to DeepPurpose conda env create -f environment.yml ## Build virtual environment with all packages installed using conda conda activate DeepPurpose ## Activate conda environment (use "source activate DeepPurpose" for anaconda 4.4 or earlier) jupyter notebook ## open the jupyter notebook with the conda env

run our code, e.g. click a file in the DEMO folder

... ...

conda deactivate ## when done, exit conda environment ```

In the future: ```bash cd DeepPurpose ## Change directory to DeepPurpose conda activate DeepPurpose ## Activate conda environment jupyter notebook ## open the jupyter notebook with the conda env

run our code, e.g. click a file in the DEMO folder

... ...

conda deactivate ## when done, exit conda environment ```

Video tutorial to install locally from source.

Example

Case Study 1(a): A Framework for Drug Target Interaction Prediction, with less than 10 lines of codes.

In addition to the DTI prediction, we also provide repurpose and virtual screening functions to rapidly generation predictions.

Click here for the code!

```python from DeepPurpose import DTI as models from DeepPurpose.utils import * from DeepPurpose.dataset import * SAVE_PATH='./saved_path' import os if not os.path.exists(SAVE_PATH): os.makedirs(SAVE_PATH) # Load Data, an array of SMILES for drug, an array of Amino Acid Sequence for Target and an array of binding values/0-1 label. # e.g. ['Cc1ccc(CNS(=O)(=O)c2ccc(s2)S(N)(=O)=O)cc1', ...], ['MSHHWGYGKHNGPEHWHKDFPIAKGERQSPVDIDTH...', ...], [0.46, 0.49, ...] # In this example, BindingDB with Kd binding score is used. X_drug, X_target, y = process_BindingDB(download_BindingDB(SAVE_PATH), y = 'Kd', binary = False, convert_to_log = True) # Type in the encoding names for drug/protein. drug_encoding, target_encoding = 'CNN', 'Transformer' # Data processing, here we select cold protein split setup. train, val, test = data_process(X_drug, X_target, y, drug_encoding, target_encoding, split_method='cold_protein', frac=[0.7,0.1,0.2]) # Generate new model using default parameters; also allow model tuning via input parameters. config = generate_config(drug_encoding, target_encoding, transformer_n_layer_target = 8) net = models.model_initialize(**config) # Train the new model. # Detailed output including a tidy table storing validation loss, metrics, AUC curves figures and etc. are stored in the ./result folder. net.train(train, val, test) # or simply load pretrained model from a model directory path or reproduced model name such as DeepDTA net = models.model_pretrained(MODEL_PATH_DIR or MODEL_NAME) # Repurpose using the trained model or pre-trained model # In this example, loading repurposing dataset using Broad Repurposing Hub and SARS-CoV 3CL Protease Target. X_repurpose, drug_name, drug_cid = load_broad_repurposing_hub(SAVE_PATH) target, target_name = load_SARS_CoV_Protease_3CL() _ = models.repurpose(X_repurpose, target, net, drug_name, target_name) # Virtual screening using the trained model or pre-trained model X_repurpose, drug_name, target, target_name = ['CCCCCCCOc1cccc(c1)C([O-])=O', ...], ['16007391', ...], ['MLARRKPVLPALTINPTIAEGPSPTSEGASEANLVDLQKKLEEL...', ...], ['P36896', 'P00374'] _ = models.virtual_screening(X_repurpose, target, net, drug_name, target_name) ```

Case Study 1(b): A Framework for Drug Property Prediction, with less than 10 lines of codes.

Many dataset is in the form of high throughput screening data, which have only drug and its activity score. It can be formulated as a drug property prediction task. We also provide a repurpose function to predict over large space of drugs.

Click here for the code!

```python from DeepPurpose import CompoundPred as models from DeepPurpose.utils import * from DeepPurpose.dataset import * SAVE_PATH='./saved_path' import os if not os.path.exists(SAVE_PATH): os.makedirs(SAVE_PATH) # load AID1706 Assay Data X_drugs, _, y = load_AID1706_SARS_CoV_3CL() drug_encoding = 'rdkit_2d_normalized' train, val, test = data_process(X_drug = X_drugs, y = y, drug_encoding = drug_encoding, split_method='random', random_seed = 1) config = generate_config(drug_encoding = drug_encoding, cls_hidden_dims = [512], train_epoch = 20, LR = 0.001, batch_size = 128, ) model = models.model_initialize(**config) model.train(train, val, test) X_repurpose, drug_name, drug_cid = load_broad_repurposing_hub(SAVE_PATH) _ = models.repurpose(X_repurpose, model, drug_name) ```

Case Study 1(c): A Framework for Drug-Drug Interaction Prediction, with less than 10 lines of codes.

DDI is very important for drug safety profiling and the success of clinical trials. This framework predicts interaction based on drug pairs chemical structure.

Click here for the code!

```python from DeepPurpose import DDI as models from DeepPurpose.utils import * from DeepPurpose.dataset import * # load DB Binary Data X_drugs, X_drugs_, y = read_file_training_dataset_drug_drug_pairs("toy_data/ddi.txt") drug_encoding = 'rdkit_2d_normalized' train, val, test = data_process(X_drug = X_drugs, X_drug_ = X_drugs_, y = y, drug_encoding = drug_encoding, split_method='random', random_seed = 1) config = generate_config(drug_encoding = drug_encoding, cls_hidden_dims = [512], train_epoch = 20, LR = 0.001, batch_size = 128, ) model = models.model_initialize(**config) model.train(train, val, test) ```

Case Study 1(d): A Framework for Protein-Protein Interaction Prediction, with less than 10 lines of codes.

PPI is important to study the relations among targets.

Click here for the code!

```python from DeepPurpose import PPI as models from DeepPurpose.utils import * from DeepPurpose.dataset import * # load DB Binary Data X_targets, X_targets_, y = read_file_training_dataset_protein_protein_pairs("toy_data/ppi.txt") target_encoding = 'CNN' train, val, test = data_process(X_target = X_targets, X_target_ = X_targets_, y = y, target_encoding = target_encoding, split_method='random', random_seed = 1) config = generate_config(target_encoding = target_encoding, cls_hidden_dims = [512], train_epoch = 20, LR = 0.001, batch_size = 128, ) model = models.model_initialize(**config) model.train(train, val, test) ```

Case Study 1(e): A Framework for Protein Function Prediction, with less than 10 lines of codes.

Protein function prediction help predict various useful functions such as GO terms, structural classification and etc. Also, for biologics drugs, it is also useful for screening.

Click here for the code!

```python from DeepPurpose import ProteinPred as models from DeepPurpose.utils import * from DeepPurpose.dataset import * # load DB Binary Data X_targets, y = read_file_protein_function() target_encoding = 'CNN' train, val, test = data_process(X_target = X_targets, y = y, target_encoding = target_encoding, split_method='random', random_seed = 1) config = generate_config(target_encoding = target_encoding, cls_hidden_dims = [512], train_epoch = 20, LR = 0.001, batch_size = 128, ) model = models.model_initialize(**config) model.train(train, val, test) ```

Case Study 2 (a): Antiviral Drugs Repurposing for SARS-CoV2 3CLPro, using One Line.

Given a new target sequence (e.g., SARS-CoV2 3CL Protease), retrieve a list of repurposing drugs from a curated drug library of 81 antiviral drugs. The Binding Score is the Kd values. Results aggregated from five pretrained model on BindingDB dataset! (Caution: this currently is for educational purposes. The pretrained DTI models only cover a small dataset and thus cannot generalize to every new unseen protein. For the best use case, train your own model with customized data.)

Click here for the code!

```python from DeepPurpose import oneliner from DeepPurpose.dataset import * oneliner.repurpose(*load_SARS_CoV2_Protease_3CL(), *load_antiviral_drugs(no_cid = True)) ``` ``` ----output---- Drug Repurposing Result for SARS-CoV2 3CL Protease +------+----------------------+------------------------+---------------+ | Rank | Drug Name | Target Name | Binding Score | +------+----------------------+------------------------+---------------+ | 1 | Sofosbuvir | SARS-CoV2 3CL Protease | 190.25 | | 2 | Daclatasvir | SARS-CoV2 3CL Protease | 214.58 | | 3 | Vicriviroc | SARS-CoV2 3CL Protease | 315.70 | | 4 | Simeprevir | SARS-CoV2 3CL Protease | 396.53 | | 5 | Etravirine | SARS-CoV2 3CL Protease | 409.34 | | 6 | Amantadine | SARS-CoV2 3CL Protease | 419.76 | | 7 | Letermovir | SARS-CoV2 3CL Protease | 460.28 | | 8 | Rilpivirine | SARS-CoV2 3CL Protease | 470.79 | | 9 | Darunavir | SARS-CoV2 3CL Protease | 472.24 | | 10 | Lopinavir | SARS-CoV2 3CL Protease | 473.01 | | 11 | Maraviroc | SARS-CoV2 3CL Protease | 474.86 | | 12 | Fosamprenavir | SARS-CoV2 3CL Protease | 487.45 | | 13 | Ritonavir | SARS-CoV2 3CL Protease | 492.19 | .... ```

Case Study 2(b): Repurposing using Customized training data, with One Line.

Given a new target sequence (e.g., SARS-CoV 3CL Pro), training on new data (AID1706 Bioassay), and then retrieve a list of repurposing drugs from a proprietary library (e.g., antiviral drugs). The model can be trained from scratch or finetuned from the pretraining checkpoint!

Click here for the code!

```python from DeepPurpose import oneliner from DeepPurpose.dataset import * oneliner.repurpose(*load_SARS_CoV_Protease_3CL(), *load_antiviral_drugs(no_cid = True), *load_AID1706_SARS_CoV_3CL(), \ split='HTS', convert_y = False, frac=[0.8,0.1,0.1], pretrained = False, agg = 'max_effect') ``` ``` ----output---- Drug Repurposing Result for SARS-CoV 3CL Protease +------+----------------------+-----------------------+-------------+-------------+ | Rank | Drug Name | Target Name | Interaction | Probability | +------+----------------------+-----------------------+-------------+-------------+ | 1 | Remdesivir | SARS-CoV 3CL Protease | YES | 0.99 | | 2 | Efavirenz | SARS-CoV 3CL Protease | YES | 0.98 | | 3 | Vicriviroc | SARS-CoV 3CL Protease | YES | 0.98 | | 4 | Tipranavir | SARS-CoV 3CL Protease | YES | 0.96 | | 5 | Methisazone | SARS-CoV 3CL Protease | YES | 0.94 | | 6 | Letermovir | SARS-CoV 3CL Protease | YES | 0.88 | | 7 | Idoxuridine | SARS-CoV 3CL Protease | YES | 0.77 | | 8 | Loviride | SARS-CoV 3CL Protease | YES | 0.76 | | 9 | Baloxavir | SARS-CoV 3CL Protease | YES | 0.74 | | 10 | Ibacitabine | SARS-CoV 3CL Protease | YES | 0.70 | | 11 | Taribavirin | SARS-CoV 3CL Protease | YES | 0.65 | | 12 | Indinavir | SARS-CoV 3CL Protease | YES | 0.62 | | 13 | Podophyllotoxin | SARS-CoV 3CL Protease | YES | 0.60 | .... ```

Demos

Checkout 10+ demos & tutorials to start:

| Name | Description | |-----------------|-------------| | Dataset Tutorial | Tutorial on how to use the dataset loader and read customized data| | Drug Repurposing for 3CLPro| Example of one-liner repurposing for 3CLPro| | Drug Repurposing with Customized Data| Example of one-liner repurposing with AID1706 Bioassay Data, training from scratch| | Virtual Screening for BindingDB IC50 | Example of one-liner virtual screening | |Reproduce DeepDTA|Reproduce DeepDTA with DAVIS dataset and show how to use the 10 lines framework| | Virtual Screening for DAVIS and Correlation Plot | Example of one-liner virtual screening and evaluate on unseen dataset by plotting correlation | | Binary Classification for DAVIS using CNNs| Binary Classification for DAVIS dataset using CNN encodings by using the 10 lines framework.| | Pretraining Model Tutorial| Tutorial on how to load pretraining models|

and more in the DEMO folder!

Contact

Please contact kexinhuang@hsph.harvard.edu or tfu42@gatech.edu for help or submit an issue.

Encodings

Currently, we support the following encodings:

| Drug Encodings | Description | |-----------------|-------------| | Morgan | Extended-Connectivity Fingerprints | | Pubchem| Pubchem Substructure-based Fingerprints| | Daylight | Daylight-type fingerprints | | rdkit2dnormalized| Normalized Descriptastorus| | ESPF | Explainable Substructure Partition Fingerprint | | ErG | 2D pharmacophore descriptions for scaffold hopping | | CNN | Convolutional Neural Network on SMILES| |CNNRNN| A GRU/LSTM on top of a CNN on SMILES| |Transformer| Transformer Encoder on ESPF| | MPNN | Message-passing neural network | | DGLGCN | Graph Convolutional Network | | DGLNeuralFP | Neural Fingerprint | | DGLGINAttrMasking | Pretrained GIN with Attribute Masking | | DGLGINContextPred | Pretrained GIN with Context Prediction | | DGLAttentiveFP | Attentive FP, Xiong et al. 2020 |

| Target Encodings | Description | |-----------------|-------------| | AAC | Amino acid composition up to 3-mers | | PseudoAAC| Pseudo amino acid composition| | Conjointtriad | Conjoint triad features | | Quasi-seq| Quasi-sequence order descriptor| | ESPF | Explainable Substructure Partition Fingerprint | | CNN | Convolutional Neural Network on target seq| |CNNRNN| A GRU/LSTM on top of a CNN on target seq| |Transformer| Transformer Encoder on ESPF|

Data

DeepPurpose supports the following dataset loaders for now and more will be added:

Public Drug-Target Binding Benchmark Dataset | Data | Function | |-------|----------| |BindingDB| download_BindingDB() to download the data and process_BindingDB() to process the data| |DAVIS|load_process_DAVIS() to download and process the data| |KIBA|load_process_KIBA() to download and process the data|

Repurposing Dataset | Data | Function | |-------|----------| |Curated Antiviral Drugs Library|load_antiviral_drugs() to load and process the data| |Broad Repurposing Hub|load_broad_repurposing_hub() downloads and process the data|

Bioassay Data for COVID-19 (Thanks to MIT AI Cures) | Data | Function | |-------|----------| |AID1706|load_AID1706_SARS_CoV_3CL() to load and process|

COVID-19 Targets | Data | Function | |-------|----------| |SARS-CoV 3CL Protease|load_SARS_CoV_Protease_3CL()| |SARS-CoV2 3CL Protease|load_SARS_CoV2_Protease_3CL()| |SARSCoV2 RNA Polymerase|```loadSARSCoV2RNApolymerase()| |SARS-CoV2 Helicase|loadSARSCoV2Helicase()| |SARS-CoV2 3to5_exonuclease|loadSARSCoV23to5exonuclease()| |SARS-CoV2 endoRNAse|loadSARSCoV2_endoRNAse()```|

DeepPurpose also supports reading from users' txt file. It assumes the following data format.

Click here for the format expected!

For drug target pairs: ``` Drug1_SMILES Target1_Seq Score/Label Drug2_SMILES Target2_Seq Score/Label .... ``` Then, use ```python from DeepPurpose import dataset X_drug, X_target, y = dataset.read_file_training_dataset_drug_target_pairs(PATH) ``` For bioassay training data: ``` Target_Seq Drug1_SMILES Score/Label Drug2_SMILES Score/Label .... ``` Then, use ```python from DeepPurpose import dataset X_drug, X_target, y = dataset.read_file_training_dataset_bioassay(PATH) ``` For drug property prediction training data: ``` Drug1_SMILES Score/Label Drug2_SMILES Score/Label .... ``` Then, use ```python from DeepPurpose import dataset X_drug, y = dataset.read_file_compound_property(PATH) ``` For protein function prediction training data: ``` Target1_Seq Score/Label Target2_Seq Score/Label .... ``` Then, use ```python from DeepPurpose import dataset X_drug, y = dataset.read_file_protein_function(PATH) ``` For drug-drug pairs: ``` Drug1_SMILES Drug1_SMILES_ Score/Label Drug2_SMILES Drug2_SMILES_ Score/Label .... ``` Then, use ```python from DeepPurpose import dataset X_drug, X_target, y = dataset.read_file_training_dataset_drug_drug_pairs(PATH) ``` For protein-protein pairs: ``` Target1_Seq Target1_Seq_ Score/Label Target2_Seq Target2_Seq_ Score/Label .... ``` Then, use ```python from DeepPurpose import dataset X_drug, X_target, y = dataset.read_file_training_dataset_protein_protein_pairs(PATH) ``` For drug repurposing library: ``` Drug1_Name Drug1_SMILES Drug2_Name Drug2_SMILES .... ``` Then, use ```python from DeepPurpose import dataset X_drug, X_drug_names = dataset.read_file_repurposing_library(PATH) ``` For target sequence to be repurposed: ``` Target_Name Target_seq ``` Then, use ```python from DeepPurpose import dataset Target_seq, Target_name = dataset.read_file_target_sequence(PATH) ``` For virtual screening library: ``` Drug1_SMILES Drug1_Name Target1_Seq Target1_Name Drug1_SMILES Drug1_Name Target1_Seq Target1_Name .... ``` Then, use ```python from DeepPurpose import dataset X_drug, X_target, X_drug_names, X_target_names = dataset.read_file_virtual_screening_drug_target_pairs(PATH) ```

Checkout Dataset Tutorial.

Pretrained models

We provide more than 10 pretrained models. Please see Pretraining Model Tutorial on how to load them. It is as simple as

python from DeepPurpose import DTI as models net = models.model_pretrained(model = 'MPNN_CNN_DAVIS') or net = models.model_pretrained(FILE_PATH) The list of available pretrained models:

Model name consists of first the drug encoding, then the target encoding and then the trained dataset.

Note that for DTI models, the BindingDB and DAVIS are trained on the log scale. But DeepPurpose allows you to specify conversion between log scale (e.g., pIC50) and original scale by the variable convert_y.

Click here for the models supported!

|Model Name| |------| |CNN_CNN_BindingDB_IC50| |Morgan_CNN_BindingDB_IC50| |Morgan_AAC_BindingDB_IC50| |MPNN_CNN_BindingDB_IC50| |Daylight_AAC_BindingDB_IC50| |CNN_CNN_DAVIS| |CNN_CNN_BindingDB| |Morgan_CNN_BindingDB| |Morgan_CNN_KIBA| |Morgan_CNN_DAVIS| |MPNN_CNN_BindingDB| |MPNN_CNN_KIBA| |MPNN_CNN_DAVIS| |Transformer_CNN_BindingDB| |Daylight_AAC_DAVIS| |Daylight_AAC_KIBA| |Daylight_AAC_BindingDB| |Morgan_AAC_BindingDB| |Morgan_AAC_KIBA| |Morgan_AAC_DAVIS|

Documentations

https://deeppurpose.readthedocs.io is under active development.

Disclaimer

The output list should be inspected manually by experts before proceeding to the wet-lab validation, and our work is still in active developement with limitations, please do not directly use the drugs.

Owner

Name: Kexin Huang
Login: kexinhuang12345
Kind: user
Location: Bay Area

Website: www.kexinhuang.com
Twitter: KexinHuang5
Repositories: 8
Profile: https://github.com/kexinhuang12345

PhD Student @ Stanford CS

GitHub Events

Total

Issues event: 1
Watch event: 104
Issue comment event: 1
Fork event: 16

Last Year

Issues event: 1
Watch event: 104
Issue comment event: 1
Fork event: 16

Committers

Last synced: over 2 years ago

All Time

Total Commits: 441
Total Committers: 21
Avg Commits per committer: 21.0
Development Distribution Score (DDS): 0.585

Past Year

Commits: 4
Committers: 3
Avg Commits per committer: 1.333
Development Distribution Score (DDS): 0.5

Top Committers

Name	Email	Commits
Kexin Huang	k**g@h**u	183
futianfan	f**n@g**m	157
kexinhuang12345	k**3@n**u	46
kexin huang	c**x@g**m	26
Po-Yu Kao	p**o@g**m	12
LucasMGlass	t**5@t**u	2
Alex Golts	a**x@g**t	1
Shengchao Liu	s**r@g**m	1
Lucas Hyunseung Kong	a**1@n**m	1
markcheung	m**7@g**m	1
LMGlass	l**s@i**m	1
Tamás Fenyvesi	t**i@d**u	1
Tianfan Fu	t**u@i**u	1
Dave Troiano	d**e@d**i	1
Navan Chauhan	n**n@g**m	1
Karthik Viswanathan	k**t@g**m	1
hima111997	5****7	1
0ling	3****g	1
cyrusmaher	c****r	1
HyeonJae	8****a	1
Jean-Paul R. Soucy	3****y	1

Committer Domains (Top 20 + Academic)

determined.ai: 1 ipsec-10-2-64-58.vpn.gatech.edu: 1 doknet.hu: 1 iqvia.com: 1 naver.com: 1 golts.net: 1 temple.edu: 1 nyu.edu: 1 hsph.harvard.edu: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 97
Total pull requests: 21
Average time to close issues: 15 days
Average time to close pull requests: about 2 months
Total issue authors: 64
Total pull request authors: 12
Average comments per issue: 3.08
Average comments per pull request: 1.0
Merged pull requests: 15
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 2
Pull requests: 0
Average time to close issues: 1 minute
Average time to close pull requests: N/A
Issue authors: 2
Pull request authors: 0
Average comments per issue: 0.0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

xuzhang5788 (10)
pykao (5)
hyojin0912 (4)
rjd55 (4)
viko-3 (4)
albertma-evotec (3)
lonngxiang (3)
sailseem (3)
SteventheExorcist (2)
BAREJAA (2)
yrq3027 (2)
Jameel9 (2)
rosa840905 (2)
songyu2022 (1)
mislam5285 (1)

Pull Request Authors

pykao (8)
gittymo97 (2)
kexinhuang12345 (2)
yazdanimehdi (2)
agamemnonc (1)
sniperyyc (1)
alex-golts (1)
la1av1a (1)
printomi (1)
0ling (1)
Gumgo91 (1)
jeanpaulrsoucy (1)

Top Labels

Issue Labels

enhancement (3) bug (2)

Pull Request Labels

Packages

Total packages: 4
Total downloads:
- pypi 1,786 last-month

Total dependent packages: 0
(may contain duplicates)
Total dependent repositories: 2
(may contain duplicates)
Total versions: 37
Total maintainers: 1

pypi.org: deeppurpose

a Deep Learning Based Toolkit for Drug Discovery

Homepage: https://github.com/kexinhuang12345/DeepPurpose
Documentation: https://deeppurpose.readthedocs.io/
License: BSD-3-Clause
Latest release: 0.1.5
published over 4 years ago

Versions: 13
Dependent Packages: 0
Dependent Repositories: 2
Downloads: 1,786 Last month

Rankings

Stargazers count: 2.2%

Forks count: 3.3%

Average: 7.0%

Downloads: 7.9%

Dependent packages count: 10.1%

Dependent repos count: 11.6%

Maintainers (1)

kexinhuang

Last synced: 6 months ago

proxy.golang.org: github.com/kexinhuang12345/deeppurpose

Documentation: https://pkg.go.dev/github.com/kexinhuang12345/deeppurpose#section-documentation
License: bsd-3-clause
Latest release: v0.1.5
published over 4 years ago

Versions: 11
Dependent Packages: 0
Dependent Repositories: 0

Rankings

Dependent packages count: 9.0%

Average: 9.6%

Dependent repos count: 10.2%

Last synced: 6 months ago

proxy.golang.org: github.com/kexinhuang12345/DeepPurpose

Documentation: https://pkg.go.dev/github.com/kexinhuang12345/DeepPurpose#section-documentation
License: bsd-3-clause
Latest release: v0.1.5
published over 4 years ago

Versions: 11
Dependent Packages: 0
Dependent Repositories: 0

Rankings

Dependent packages count: 9.0%

Average: 9.6%

Dependent repos count: 10.2%

Last synced: 6 months ago

conda-forge.org: deeppurpose

Homepage: https://github.com/kexinhuang12345/DeepPurpose
License: BSD-3-Clause
Latest release: 0.0.5
published about 5 years ago

Versions: 2
Dependent Packages: 0
Dependent Repositories: 0

Rankings

Forks count: 11.1%

Stargazers count: 14.1%

Average: 27.6%

Dependent repos count: 34.0%

Dependent packages count: 51.2%

Last synced: 6 months ago

Dependencies

requirements.txt pypi

ax-platform *
dgllife *
lifelines *
matplotlib *
numpy *
pandas *
pandas-flavor *
prettytable *
requests *
scikit-learn *
subword-nmt *
tensorboard *
torch *
tqdm *
wget *

environment.yml conda

attrs 19.3.0.*
backcall 0.1.0.*
blas 1.0.*
bleach 3.1.0.*
botorch 0.2.2.*
bzip2 1.0.8.*
ca-certificates 2020.1.1.*
cairo 1.14.12.*
certifi 2019.11.28.*
cffi 1.14.0.*
cycler 0.10.0.*
decorator 4.4.2.*
defusedxml 0.6.0.*
descriptastorus 2.2.0.*
entrypoints 0.3.*
fontconfig 2.13.0.*
freetype 2.9.1.*
gettext 0.19.8.1.*
gpytorch 1.1.1.*
icu 58.2.*
importlib-metadata 1.5.0.*
intel-openmp 2019.4.*
ipykernel 5.1.4.*
ipython 7.13.0.*
ipython_genutils 0.2.0.*
jedi 0.16.0.*
jinja2 2.11.1.*
joblib 0.14.1.*
jpeg 9b.*
jsonschema 3.2.0.*
jupyter_client 6.1.2.*
jupyter_core 4.6.3.*
kiwisolver 1.1.0.*
libblas 3.8.0.*
libboost 1.67.0.*
libcblas 3.8.0.*
libffi 3.2.1.*
libiconv 1.15.*
liblapack 3.8.0.*
libpng 1.6.37.*
libsodium 1.0.16.*
libtiff 4.1.0.*
libxml2 2.9.9.*
llvm-openmp 9.0.1.*
lz4 3.0.2.*
lz4-c 1.8.3.*
markupsafe 1.1.1.*
matplotlib 3.1.3.*
matplotlib-base 3.1.3.*
mistune 0.8.4.*
mkl 2019.4.*
mkl-service 2.3.0.*
mkl_fft 1.0.15.*
mkl_random 1.1.0.*
nbconvert 5.6.1.*
nbformat 5.0.4.*
ninja 1.9.0.*
notebook 6.0.3.*
numpy 1.18.1.*
numpy-base 1.18.1.*
olefile 0.46.*
openssl 1.1.1f.*
pandas 1.0.3.*
pandoc 2.2.3.2
pandocfilters 1.4.2.*
parso 0.6.2.*
pcre 8.43.*
pexpect 4.8.0.*
pickleshare 0.7.5.*
pillow 7.0.0.*
pip 20.0.2.*
pixman 0.38.0.*
prometheus_client 0.7.1.*
prompt-toolkit 3.0.4.*
ptyprocess 0.6.0.*
py-boost 1.67.0.*
pycparser 2.20.*
pygments 2.6.1.*
pyparsing 2.4.6.*
pyrsistent 0.16.0.*
python 3.7.7.*
python-dateutil 2.8.1.*
python_abi 3.7.*
pytorch 1.4.0.*
pytz 2019.3.*
pyzmq 18.1.1.*
scikit-learn 0.22.1.*
scikit-plot 0.3.7.*
scipy 1.4.1.*
seaborn 0.11.0.*
send2trash 1.5.0.*
setuptools 46.1.3.*
six 1.14.0
sqlite 3.31.1.*
tensorboard 2.1.0.*
terminado 0.8.3.*
testpath 0.4.4.*
tk 8.6.8.*
torchvision 0.5.0.*
tornado 6.0.4.*
tqdm 4.44.1.*
traitlets 4.3.3.*
wcwidth 0.1.9.*
webencodings 0.5.1.*
wheel 0.34.2.*
xz 5.2.4.*
zeromq 4.3.1.*
zipp 2.2.0.*
zlib 1.2.11.*
zstd 1.3.7.*

deeppurpose

Science Score: 46.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

A Deep Learning Library for Compound and Protein Modeling DTI, Drug Property, PPI, DDI, Protein Function Prediction

Applications in Drug Repurposing, Virtual Screening, QSAR, Side Effect Prediction and More

News!

Features

Cite Us

Installation

pip

Build from Source

run our code, e.g. click a file in the DEMO folder

run our code, e.g. click a file in the DEMO folder

Example

Case Study 1(a): A Framework for Drug Target Interaction Prediction, with less than 10 lines of codes.

Case Study 1(b): A Framework for Drug Property Prediction, with less than 10 lines of codes.

Case Study 1(c): A Framework for Drug-Drug Interaction Prediction, with less than 10 lines of codes.

Case Study 1(d): A Framework for Protein-Protein Interaction Prediction, with less than 10 lines of codes.

Case Study 1(e): A Framework for Protein Function Prediction, with less than 10 lines of codes.

Case Study 2 (a): Antiviral Drugs Repurposing for SARS-CoV2 3CLPro, using One Line.

Case Study 2(b): Repurposing using Customized training data, with One Line.

Demos

Contact

Encodings

Data

Pretrained models

Documentations

Disclaimer

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

pypi.org: deeppurpose

Rankings

Maintainers (1)

proxy.golang.org: github.com/kexinhuang12345/deeppurpose

Rankings

proxy.golang.org: github.com/kexinhuang12345/DeepPurpose

Rankings

conda-forge.org: deeppurpose

Rankings

Dependencies

A Deep Learning Library for Compound and Protein Modeling
DTI, Drug Property, PPI, DDI, Protein Function Prediction

`pip`