dbparser

Source code for the R package, "dbparser" (i.e. DrugBank Parser)

https://github.com/ropensci/dbparser

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (19.2%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Source code for the R package, "dbparser" (i.e. DrugBank Parser)

Basic Info
Statistics
  • Stars: 61
  • Watchers: 4
  • Forks: 19
  • Open Issues: 3
  • Releases: 10
Created over 7 years ago · Last pushed almost 2 years ago
Metadata Files
Readme Changelog Contributing License Code of conduct Codemeta

README.Rmd

---
output: github_document
---



```{r setup, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/README-",
  out.width = "100%"
)
library(dplyr)
library(ggplot2)
```
# dbparser 

[![CRAN\_Status\_Badge](http://www.r-pkg.org/badges/version/dbparser)](https://cran.r-project.org/package=dbparser)
[![codecov](https://codecov.io/gh/ropensci/dbparser/branch/master/graph/badge.svg)](https://app.codecov.io/gh/ropensci/dbparser)
[![Project Status: Active – The project has reached a stable, usable state and is being actively developed.](https://www.repostatus.org/badges/latest/active.svg)](https://www.repostatus.org/#active)
[![Lifecycle: stable](https://img.shields.io/badge/lifecycle-stable-brightgreen.svg)](https://lifecycle.r-lib.org/articles/stages.html)
[![metacran downloads](https://cranlogs.r-pkg.org/badges/grand-total/dbparser)](https://cran.r-project.org/package=dbparser)
[![](https://img.shields.io/badge/Doc-Rdoc-blue.svg)](https://www.rdocumentation.org/packages/dbparser)
[![CII Best Practices](https://bestpractices.coreinfrastructure.org/projects/3311/badge)](https://bestpractices.coreinfrastructure.org/projects/3311)
[![](https://badges.ropensci.org/347_status.svg)](https://github.com/ropensci/software-review/issues/347)

## Overview
Drugs databases vary too much in their formats and structures which making related 
data analysis not a very easy job and requires a lot of efforts to work on only
two databases together such as [DrugBank](https://go.drugbank.com/) and [KEGG](https://www.genome.jp/kegg/).

Hence, `dbparser` package aims to parse different public drugs databases as [DrugBank](https://go.drugbank.com/) or [KEGG](https://www.genome.jp/kegg/) into
single and unified format R object called `dvobject` (stands for drugverse object).

That should help in:

- working with single data object and not multiple databases in different formats,
- using R analysis capabilities easily on drugs data,
- ease of transferring data between researchers after performing required data
analysis or `dvobject` and storing results in the same object in a very easy manner

### dvobject Structure
`dvobject` introduces a unified and compressed format of drugs data.
It is an R list object that contains one or more of the following sub-lists:

- **drugs**: list of data.frames that contain drugs information (i.e. synonyms, classifications, ...) and it is the only mandatory list
- **salts**: data.frame contains drugs salts information
- **products**: data.frame of commercially available drugs products in the world
- **references**: data.frame of articles, links and textbooks about drugs or CETT data
- **cett**: list of data.frames contain targets, enzymes, carriers and transporters information

## Drug Databases
Parsers are available for the following databases (it is in progress list)

### DrugBank

[DrugBank](https://go.drugbank.com/) database is a comprehensive, freely
accessible, online database containing information on drugs and drug
targets. As both a bioinformatics and a cheminformatics resource,
DrugBank combines detailed drug (i.e. chemical, pharmacological and
pharmaceutical) data with comprehensive drug target (i.e. sequence,
structure, and pathway) information. More information about DrugBank can
be found [here](https://go.drugbank.com/about).

In its raw form, the DrugBank database is a single
XML file. Users must create an [account](https://go.drugbank.com/public_users/sign_up)
with DrugBank and request permission to [download](https://go.drugbank.com/releases/latest)
the database. Note that this may take a couple of days.

The `dbparser` package parses the DrugBank XML database into `R` tibbles that can be explored and analyzed by the user, check [this tutorial](https://docs.ropensci.org/dbparser/articles/dbparser.html) for more details.

If you are waiting for access to the DrugBank database, or do not intend to do a deep dive with
the data, you may wish to use the `dbdataset`
[package](https://interstellar-consultation-services.github.io/dbdataset/), which contains
the DrugBank database already parsed into `dvobject`. Note that this is a large package that
exceeds the limit set by CRAN. It is only available on GitHub.

`dbparser` is tested against DrugBank versions *5.1.0* through *5.1.12*
successfully. If you find errors with these versions or any other
version please submit an issue
[here](https://github.com/ropensci/dbparser/issues).

## Installation

You can install the released version of dbparser from 
[CRAN](https://CRAN.R-project.org) with:

``` r
install.packages("dbparser")
```
or you can install the latest updates directly from the repo

``` r
library(devtools)
devtools::install_github("ropensci/dbparser")
```

## Code of Conduct
Please note that the 'dbparser' project is released with a
  [Contributor Code of Conduct](https://docs.ropensci.org/dbparser/CODE_OF_CONDUCT.html).
  By contributing to this project, you agree to abide by its terms.
  
## Contributing Guide
👍🎉 First off, thanks for taking the time to contribute! 🎉👍
Please review our [Contributing Guide](https://docs.ropensci.org/dbparser/CONTRIBUTING.html).

## Share the love ❤️

Think **dbparser** is useful? Let others discover it, by telling them in person, via Twitter or a blog post.

Using **dbparser** for a paper you are writing? Consider citing it
```{r}
citation("dbparser")
```

Owner

  • Name: rOpenSci
  • Login: ropensci
  • Kind: organization
  • Email: info@ropensci.org
  • Location: Berkeley, CA

JOSS Publication

dbparser: An R Package for Parsing and Integrating Pharmacological Databases
Published
February 13, 2026
Volume 11, Issue 118, Page 9950
Authors
Mohammed Ali ORCID
Independent Researcher
Ali Ezzat
Independent Researcher
Editor
Arfon Smith ORCID
Tags
bioinformatics pharmacology pharmacovigilance drug discovery DrugBank reproducible research

CodeMeta (codemeta.json)

{
  "@context": "https://doi.org/10.5063/schema/codemeta-2.0",
  "@type": "SoftwareSourceCode",
  "identifier": "dbparser",
  "description": "This tool is for parsing public drug databases such as 'DrugBank' XML database <https://go.drugbank.com/>. The parsed data are then returned in a proper 'R' object called 'dvobject'.",
  "name": "dbparser: Drugs Databases Parser",
  "relatedLink": [
    "https://docs.ropensci.org/dbparser/",
    "https://CRAN.R-project.org/package=dbparser"
  ],
  "codeRepository": "https://github.com/ropensci/dbparser/",
  "issueTracker": "https://github.com/ropensci/dbparser/issues",
  "license": "https://spdx.org/licenses/MIT",
  "version": "2.0.1",
  "programmingLanguage": {
    "@type": "ComputerLanguage",
    "name": "R",
    "url": "https://r-project.org"
  },
  "runtimePlatform": "R version 4.2.2 (2022-10-31 ucrt)",
  "provider": {
    "@id": "https://cran.r-project.org",
    "@type": "Organization",
    "name": "Comprehensive R Archive Network (CRAN)",
    "url": "https://cran.r-project.org"
  },
  "author": [
    {
      "@type": "Person",
      "givenName": "Mohammed",
      "familyName": "Ali",
      "email": "moh_fcis@yahoo.com"
    },
    {
      "@type": "Person",
      "givenName": "Ali",
      "familyName": "Ezzat",
      "email": "ali_ezzat85@yahoo.com"
    }
  ],
  "maintainer": [
    {
      "@type": "Person",
      "givenName": "Mohammed",
      "familyName": "Ali",
      "email": "moh_fcis@yahoo.com"
    }
  ],
  "softwareSuggestions": [
    {
      "@type": "SoftwareApplication",
      "identifier": "knitr",
      "name": "knitr",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=knitr"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "rmarkdown",
      "name": "rmarkdown",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=rmarkdown"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "testthat",
      "name": "testthat",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=testthat"
    }
  ],
  "softwareRequirements": {
    "1": {
      "@type": "SoftwareApplication",
      "identifier": "dplyr",
      "name": "dplyr",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=dplyr"
    },
    "2": {
      "@type": "SoftwareApplication",
      "identifier": "progress",
      "name": "progress",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=progress"
    },
    "3": {
      "@type": "SoftwareApplication",
      "identifier": "purrr",
      "name": "purrr",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=purrr"
    },
    "4": {
      "@type": "SoftwareApplication",
      "identifier": "tibble",
      "name": "tibble",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=tibble"
    },
    "5": {
      "@type": "SoftwareApplication",
      "identifier": "tools",
      "name": "tools"
    },
    "6": {
      "@type": "SoftwareApplication",
      "identifier": "XML",
      "name": "XML",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=XML"
    },
    "7": {
      "@type": "SoftwareApplication",
      "identifier": "R",
      "name": "R",
      "version": ">= 3.5"
    },
    "SystemRequirements": null
  },
  "fileSize": "754.17KB",
  "citation": [
    {
      "@type": "SoftwareSourceCode",
      "author": [
        {
          "@type": "Person",
          "givenName": "Mohammed",
          "familyName": "Ali"
        },
        {
          "@type": "Person",
          "givenName": "Ali",
          "familyName": "Ezzat"
        }
      ],
      "name": "DrugBank Database XML Parser",
      "url": "https://CRAN.R-project.org/package=dbparser",
      "description": "R package version 2.0.1"
    }
  ],
  "releaseNotes": "https://github.com/ropensci/dbparser/blob/master/NEWS.md",
  "readme": "https://github.com/ropensci/dbparser/blob/master/README.md",
  "contIntegration": "https://app.codecov.io/gh/ropensci/dbparser",
  "developmentStatus": [
    "https://www.repostatus.org/#active",
    "https://lifecycle.r-lib.org/articles/stages.html"
  ],
  "review": {
    "@type": "Review",
    "url": "https://github.com/ropensci/software-review/issues/347",
    "provider": "https://ropensci.org"
  }
}

GitHub Events

Total
  • Issues event: 2
  • Watch event: 2
  • Issue comment event: 7
Last Year
  • Issues event: 2
  • Watch event: 2
  • Issue comment event: 7

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 472
  • Total Committers: 8
  • Avg Commits per committer: 59.0
  • Development Distribution Score (DDS): 0.15
Past Year
  • Commits: 47
  • Committers: 1
  • Avg Commits per committer: 47.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Mohammed Ali m****i@u****m 401
Mohammed Ali m****d@a****m 47
Mohammed Ali m****s@y****m 15
alizat a****5@y****m 4
Emma Mendelsohn m****n@e****g 2
Mohammed Ali m****i@d****m 1
amrrs a****s@o****m 1
Noam Ross n****s@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 99
  • Total pull requests: 22
  • Average time to close issues: 9 months
  • Average time to close pull requests: 7 days
  • Total issue authors: 17
  • Total pull request authors: 4
  • Average comments per issue: 0.93
  • Average comments per pull request: 0.09
  • Merged pull requests: 22
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 2
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 2
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • MohammedFCIS (72)
  • alizat (9)
  • maelle (2)
  • mostafiz67 (2)
  • aycomp (2)
  • dgcovell (1)
  • AnjaConev (1)
  • lamnvcnsh (1)
  • penglimn (1)
  • ag1805x (1)
  • kcmtest (1)
  • JustinGibbons (1)
  • iDemirsoy (1)
  • miguecass (1)
  • dsasson49 (1)
Pull Request Authors
  • MohammedFCIS (25)
  • maelle (2)
  • noamross (1)
  • emmamendelsohn (1)
Top Labels
Issue Labels
enhancement (18) documentation (10) wontfix (10) bug (7) ropensci (7) question (5) release (4) help wanted (2) duplicate (2) not reproducible (1) dependencies (1) good first issue (1)
Pull Request Labels

Dependencies

DESCRIPTION cran
  • R >= 2.10 depends
  • DBI * imports
  • RSQLite * imports
  • XML * imports
  • dplyr * imports
  • odbc * imports
  • progress * imports
  • purrr * imports
  • readr * imports
  • tibble * imports
  • tools * imports
  • knitr * suggests
  • rmarkdown * suggests
  • testthat * suggests