drug-repurposing-datasets
Drug Repurposing Datasets for Collaborative Filtering Methods. Notebooks and code to generate datasets for collaborative filtering-based drug repurposing.
https://github.com/recess-eu-project/drug-repurposing-datasets
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 11 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.8%) to scientific vocabulary
Keywords
Repository
Drug Repurposing Datasets for Collaborative Filtering Methods. Notebooks and code to generate datasets for collaborative filtering-based drug repurposing.
Basic Info
- Host: GitHub
- Owner: RECeSS-EU-Project
- License: mit
- Language: Jupyter Notebook
- Default Branch: master
- Homepage: https://recess-eu-project.github.io
- Size: 190 KB
Statistics
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 6
Topics
Metadata Files
README.md

Drug Repurposing Datasets for Collaborative Filtering Methods
This repository is a part of the EU-funded RECeSS project (#101102016), and hosts the notebooks and code files which helped to build the following novel drug repurposing datasets:
The latest versions (v2.0.0 and v2.0.1 of May 29th, 2023) of those datasets correspond to the current notebooks in folder notebooks/. The notebooks corresponding to the v1.0.0 and v1.0.1 (December 28th, 2022) are hosted in folder notebooks/v1.0.0/. To get more details about the datasets themselves, please refer to the corresponding Zenodo pages.
Running the notebooks
Global variables
Update to your liking the following file paths_global.py. The corresponding files and how to obtain them are detailed in notebooks/FEATURELESS_dataset.ipynb.
Environment
In order to run the notebooks:
conda create -n drug_repurposing python=3.8.5 -y
conda activate drug_repurposing
python3 -m pip install -r requirements.txt
python3 -m pip install notebook>=6.5.4 markupsafe==2.0.1 ## packages for Jupyter notebook
conda deactivate
conda activate drug_repurposing
cd notebooks/ && jupyter notebook
Licence
These datasets and code are under an OSI-approved MIT license. Note that the final matrices for PREDICT are not available, due to the usage rules of the DrugBank dataset. However, one can run the corresponding notebook PREDICT_dataset.ipynb in order to get the final matrices. A partial dataset is downloadable on Zenodo.
Citation
Clémence Réda. (2023).
Jupyter Notebooks for building drug repurposing datasets in collaborative filtering (v2.0.0).
Zenodo. https://doi.org/10.5281/zenodo.8014774
Community guidelines with respect to contributions, issue reporting, and support
Pull requests and issue flagging are welcome, and can be made through the GitHub interface. Support can be provided by reaching out to recess-project[at]proton.me.
Owner
- Name: RECeSS EU project
- Login: RECeSS-EU-Project
- Kind: user
- Location: Rostock, Germany
- Company: Universität Rostock
- Website: http://recess-eu-project.github.io/
- Twitter: recess_eu_proj
- Repositories: 1
- Profile: https://github.com/RECeSS-EU-Project
The RECeSS (Robust Explainable Controllable Standard for drug Screening) project is funded by a Marie Skłodowska-Curie Postdoctoral Fellowship 2022.
Citation (CITATION.cff)
cff-version: 1.0.0
message: "If you use these notebooks, please cite them as below."
authors:
- family-names: R\'{e}da
given-names: Cl\'{e}mence
orcid: https://orcid.org/0000-0003-3238-0258
title: "Jupyter Notebooks for building drug repurposing datasets in collaborative filtering"
version: 2.0.0
doi: "10.5281/zenodo.8014774"
date-released: 2023-06-07
CodeMeta (codemeta.json)
{
"@context": "https://doi.org/10.5063/schema/codemeta-2.0",
"@type": "SoftwareSourceCode",
"license": "https://spdx.org/licenses/MIT",
"codeRepository": "git+https://github.com/RECeSS-EU-Project/drug-repurposing-datasets.git",
"dateCreated": "2023-05-29",
"datePublished": "2023-05-29",
"dateModified": "2023-05-29",
"downloadUrl": "https://github.com/RECeSS-EU-Project/drug-repurposing-datasets/archive/refs/heads/master.zip",
"issueTracker": "https://github.com/RECeSS-EU-Project/drug-repurposing-datasets/issues",
"name": "drug-repurposing-datasets",
"version": "2.0.0",
"identifier": "10.5281/zenodo.8014774",
"description": "This repository is a part of the EU-funded RECeSS project (#101102016), and hosts the notebooks and code files which helped to build the following novel drug repurposing datasets: TRANSCRIPT dataset (https://doi.org/10.5281/zenodo.7982969) and PREDICT dataset (https://doi.org/10.5281/zenodo.7982964).",
"applicationCategory": "Drug development",
"releaseNotes": "This version is updated with versions of the different databases and knowledge as of May 2023.",
"funding": "RECeSS - Robust Explainable Controllable Standard for drug Screening (101102016)",
"developmentStatus": "active",
"isPartOf": "https://recess-eu-project.github.io/",
"readme": "https://github.com/RECeSS-EU-Project/drug-repurposing-datasets/blob/master/README.md",
"softwareVersion": "2.0.0",
"funder": {
"@type": "Organization",
"name": "European Union's Horizon 2020 research and innovation programme"
},
"keywords": [
"collaborative filtering",
"dataset",
"drug repurposing",
"science reproducibility"
],
"programmingLanguage": [
"Python 3"
],
"softwareRequirements": [
"nordic>=2.4.2",
"pandas>=1.1.4",
"numpy>=1.19.4",
"scipy>=1.5.4",
"bs4>=0.0.1",
"pubchempy>=1.0.4",
"joblib>=1.0.1",
"epam.indigo>=1.4.3",
"scikit-learn>=0.24.0",
"biopython>=1.78",
"cmapPy>=4.0.1",
"selenium>=4.9.1",
"tqdm>=4.58.0",
"requests-html>=0.10.0",
"obonet>=0.3.0",
"networkx>=2.6.3",
"nbconvert[qtpdf]"
],
"author": [
{
"@type": "Person",
"@id": "https://orcid.org/0000-0003-3238-0258",
"givenName": "Clmence",
"familyName": "Rda",
"email": "clemence.reda@uni-rostock.de",
"affiliation": {
"@type": "Organization",
"name": "Systems Biology and Informatics, University of Rostock, Rostock, Germany"
}
}
],
"maintainer": [
{
"@type": "Person",
"@id": "https://orcid.org/0000-0003-3238-0258",
"givenName": "Clmence",
"familyName": "Rda",
"email": "clemence.reda@uni-rostock.de",
"affiliation": {
"@type": "Organization",
"name": "Systems Biology and Informatics, University of Rostock, Rostock, Germany"
}
}
]
}
GitHub Events
Total
- Watch event: 2
Last Year
- Watch event: 2