awesome-drug-discovery-knowledge-graphs
A collection of research papers, datasets and software related to knowledge graphs for drug discovery. Accompanies the paper "A review of biomedical datasets relating to drug discovery: a knowledge graph perspective" (Briefings in Bioinformatics, 2022)
https://github.com/astrazeneca/awesome-drug-discovery-knowledge-graphs
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.6%) to scientific vocabulary
Keywords
Repository
A collection of research papers, datasets and software related to knowledge graphs for drug discovery. Accompanies the paper "A review of biomedical datasets relating to drug discovery: a knowledge graph perspective" (Briefings in Bioinformatics, 2022)
Basic Info
- Host: GitHub
- Owner: AstraZeneca
- License: apache-2.0
- Default Branch: main
- Homepage: https://academic.oup.com/bib/article-abstract/23/6/bbac404/6712301
- Size: 413 KB
Statistics
- Stars: 236
- Watchers: 13
- Forks: 26
- Open Issues: 0
- Releases: 2
Topics
Metadata Files
README.md
Awesome Drug Discovery Knowledge Graphs
A collection of datasets and associated research papers related to knowledge graphs suitable for use in drug discovery.
Overview
Drug discovery and development is a complex and costly process. Machine learning approaches are being investigated to help improve the effectiveness and speed of multiple stages of the drug discovery pipeline. Of these, those that use Knowledge Graphs (KG) have promise in many tasks, including drug repurposing, drug toxicity prediction and target gene-disease prioritisation. In a drug discovery KG, crucial elements including genes, diseases and drugs are represented as entities, whilst relationships between them indicate an interaction. However, to construct high-quality KGs, suitable data is required. In this review, we detail publicly available sources suitable for use in constructing drug discovery focused KGs. We aim to help guide machine learning and KG practitioners who are interested in applying new techniques to the drug discovery field, but who may be unfamiliar with the relevant data sources. The datasets are selected via strict criteria, categorised according to the primary type of information contained within and are considered based upon what information could be extracted to build a KG. We then present a comparative analysis of existing public drug discovery KGs and a evaluation of selected motivating case studies from the literature. Additionally, we raise numerous and unique challenges and issues associated with the domain and its datasets, whilst also highlighting key future research directions. We hope this review will motivate KGs use in solving key and emerging questions in the drug discovery domain.
The Survey Paper
This repository accompanies our survey paper A Review of Biomedical Datasets Relating to Drug Discovery: A Knowledge Graph Perspective.
Please consider citing the associated paper for this resource if you find it useful:
@article{bonner2022review,
title={A review of biomedical datasets relating to drug discovery: A knowledge graph perspective},
author={Bonner, Stephen and Barrett, Ian P and Ye, Cheng and Swiers, Rowan and Engkvist, Ola and Bender, Andreas and Hoyt, Charles Tapley and Hamilton, William L},
journal={Briefings in Bioinformatics},
volume={23},
number={6},
year={2022},
publisher={Oxford Academic}
}
Contents
This repository primarily collects together public knowledge graph which could be used for drug discovery. We provide a list of such resources with links to the associated manuscripts, download locations and, wherever possible, the code used to create or update the resources. The list can be found using the link below:
Drug Discovery Knowledge Graphs
Additionally, we provide separate lists of key biomedical resources which are often used to compose these graphs, as well as some notable applications of KG use within drug discovery:
Contributing
We welcome the addition of new resources, please see our contributing guide for information on how to do this.
Note On Publication Version
This list will continue to evolve as new resources are made available. If you want to view the list which matches the version of the published manuscript, please use this link.
License
- Apache 2.0
Owner
- Name: AstraZeneca
- Login: AstraZeneca
- Kind: organization
- Location: Global
- Website: https://www.astrazeneca.com/
- Repositories: 33
- Profile: https://github.com/AstraZeneca
Data and AI: Unlocking new science insights
Citation (CITATION.bib)
@article{bonner2022review,
title={A review of biomedical datasets relating to drug discovery: A knowledge graph perspective},
author={Bonner, Stephen and Barrett, Ian P and Ye, Cheng and Swiers, Rowan and Engkvist, Ola and Bender, Andreas and Hoyt, Charles Tapley and Hamilton, William L},
journal={Briefings in Bioinformatics},
volume={23},
number={6},
year={2022},
publisher={Oxford Academic}
}
GitHub Events
Total
- Watch event: 29
- Delete event: 2
- Push event: 2
- Pull request event: 5
- Fork event: 2
- Create event: 2
Last Year
- Watch event: 29
- Delete event: 2
- Push event: 2
- Pull request event: 5
- Fork event: 2
- Create event: 2
Committers
Last synced: 9 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| sbonner0 | s****0@i****m | 33 |
| ethanknights | e****s@h****k | 1 |
| Ben Busby | D****s | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 9 months ago
All Time
- Total issues: 0
- Total pull requests: 6
- Average time to close issues: N/A
- Average time to close pull requests: 6 days
- Total issue authors: 0
- Total pull request authors: 3
- Average comments per issue: 0
- Average comments per pull request: 0.33
- Merged pull requests: 6
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 3
- Average time to close issues: N/A
- Average time to close pull requests: less than a minute
- Issue authors: 0
- Pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 3
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- sbonner0 (7)
- DCGenomics (1)
- ethanknights (1)