graphsim

graphsim: An R package for simulating gene expression data from graph structures of biological pathways - Published in JOSS (2020)

https://github.com/tomkellygenetics/graphsim

Science Score: 98.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 16 DOI reference(s) in README and JOSS metadata
  • Academic publication links
    Links to: biorxiv.org, joss.theoj.org, zenodo.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
    Published in Journal of Open Source Software

Keywords

benchmarking gene-expression gene-regulatory-networks genetics genomic-data-analysis genomics graph-algorithms igraph-networks joss ngs-analysis package r-stats simulated-data simulation-modeling

Scientific Fields

Sociology Social Sciences - 31% confidence
Last synced: 4 months ago · JSON representation ·

Repository

R package: Simulate Expression data from igraph network using mvtnorm (CRAN; JOSS)

Basic Info
  • Host: GitHub
  • Owner: TomKellyGenetics
  • License: gpl-3.0
  • Language: HTML
  • Default Branch: master
  • Homepage:
  • Size: 35.8 MB
Statistics
  • Stars: 24
  • Watchers: 4
  • Forks: 7
  • Open Issues: 0
  • Releases: 9
Topics
benchmarking gene-expression gene-regulatory-networks genetics genomic-data-analysis genomics graph-algorithms igraph-networks joss ngs-analysis package r-stats simulated-data simulation-modeling
Created over 9 years ago · Last pushed 8 months ago
Metadata Files
Readme Changelog Contributing License Code of conduct Citation

README.md

Travis-CI Build Status CircleCI build status AppVeyor Build Status Project Status: Active – The project has reached a stable, usable state and is being actively developed.

CRAN_Status_Badge Downloads Total Downloads

DOI bioRxiv status


graphsim

Version 1.0.3

Simulate Expression Data from 'igraph' Networks

This package provides functions to develop simulated continuous data (e.g., gene expression) from a sigma covariance matrix derived from a graph structure in 'igraph' objects. Intended to extend 'mvtnorm' to take 'igraph' structures rather than sigma matrices as input. This allows the use of simulated data that correctly accounts for pathway relationships and correlations. Here we present a versatile statistical framework to simulate correlated gene expression data from biological pathways, by sampling from a multivariate normal distribution derived from a graph structure. This package allows the simulation of biological pathways from a graph structure based on a statistical model of gene expression, such as simulation of expression profiles that of log-transformed and normalised data from microarray and RNA-Seq data.

Motivation

Network analysis of molecular biological pathways is important for insights into biology and medical genetics. Gene expression profiles capture the regulatory state of a cell and can be used to analyse complex molecular states with genome-scale data. Biological pathways are more than simply sets of genes involved in functions, they are rich in information of relationships defined by pathway structure.

Methods to infer biological pathways and gene regulatory networks from gene expression data can be tested on simulated datasets using this framework. This also allows for pathway structures to be considered as a confounding variable when simulating gene expression data to test the performance of genomics analyses.

This package enables the generation of simulated gene expression datasets containing pathway relationships from a known underlying network. These simulated datasets can be used to evaluate various bioinformatics methodologies, including statistical and network inference procedures.

Network analysis techniques have an important role in understanding of biological pathways and interpretation of genomics studies. Modelling biological pathways allows the evaluation of gene regulatory network inference techniques (which so far rely on experimental validation or resampling). This technique also enables modelling datasets with correlated pathway-structures to assess whether other genomics analysis techniques perform as expected with the background of complex pathways.

Installation

To install the latest release from CRAN:

R install.packages("graphsim")

To install the stable release of this package from github:

```R

install.packages("devtools")

devtools::install_github("TomKellyGenetics/graphsim", ref = "master") ```

To get the development version of this package from github:

```R

install.packages("devtools")

devtools::install_github("TomKellyGenetics/graphsim", ref = "dev") ```

Usage

Please see the vignettes for demonstrations of this package on examples of simple simulated networks and the reactome pathway TGF-β receptor signaling activates SMADs (R-HSA-2173789). An article with further details has been published in the Journal of Open Source Software.

A help menu can also be accessed within the R environment:

?graphsim

help("graphsim-package")

This will display a help page and link to documentation for each function.


Citation

To cite package 'graphsim' in publications use:

S. Thomas Kelly and Michael A. Black (2020). graphsim: Simulate Expression Data from 'igraph' Networks. R package version 1.0.3. https://github.com/TomKellyGenetics/graphsim doi:10.5281/zenodo.3931288

A BibTeX entry for LaTeX users is:

@Manual{, title = {{graphsim}: Simulate Expression Data from 'igraph' Networks }, author = {S. Thomas Kelly and Michael A. Black}, year = {2020}, note = {R package version R package version 1.0.3.}, url = {https://github.com/TomKellyGenetics/graphsim}, doi = {10.5281/zenodo.3931288}, }

Please also cite the publication describing use of this package where appropriate.

Kelly, S.T. and Black, M.A. (2020). graphsim: An R package for simulating gene expression data from graph structures of biological pathways. Journal of Open Source Software, 5(51), 2161, https://doi.org/10.21105/joss.02161

@article{Kelly2020joss02161, doi = {10.21105/joss.02161}, url ={https://doi.org/10.21105/joss.02161}, year = {2020}, publisher = {The Open Journal}, volume = {5}, number = {51}, pages = {2161}, author = {S. Thomas Kelly and Michael A. Black}, title = {graphsim: An R package for simulating gene expression data from graph structures of biological pathways}, journal = {Journal of Open Source Software} }

This article is also avaliable as a preprint.

S. Thomas Kelly, Michael A. Black (2020) graphsim: An R package for simulating gene expression data from graph structures of biological pathways bioRxiv 2020.03.02.972471; doi:https://doi.org/10.1101/2020.03.02.972471

@article {Kelly2020.03.02.972471, author = {Kelly, S Thomas and Black, Michael A}, title = {graphsim: An R package for simulating gene expression data from graph structures of biological pathways}, elocation-id = {2020.03.02.972471}, year = {2020}, doi = {10.1101/2020.03.02.972471}, publisher = {Cold Spring Harbor Laboratory}, URL = {https://www.biorxiv.org/content/early/2020/03/04/2020.03.02.972471}, eprint = {https://www.biorxiv.org/content/early/2020/03/04/2020.03.02.972471.full.pdf}, journal = {bioRxiv} }


Contributions and Bug Reports

Please submit issues on GitHub to report problems or suggest features. Pull requests to the dev branch on GitHub are also welcome to add features or correct problems. Please see the contributor guide for more details.

Owner

  • Name: Tom Kelly
  • Login: TomKellyGenetics
  • Kind: user
  • Location: Akiruno, Tokyo, Japan
  • Company: RIKEN IMS; H.U. Group @hugp-ri

Bioinformatician. PhD 2014-2017: @mikblacklab Otago Uni. Postdoc 2018-2021: @minoda-lab RIKEN. The @carpentries. Genomics | Health | Math | Stats | AWS Dev

JOSS Publication

graphsim: An R package for simulating gene expression data from graph structures of biological pathways
Published
July 09, 2020
Volume 5, Issue 51, Page 2161
Authors
S. Thomas Kelly ORCID
Department of Biochemistry, University of Otago, PO Box 56, Dunedin 9054, New Zealand , RIKEN Center for Integrative Medical Sciences, Suehiro-cho-1-7-22, Tsurumi Ward, Yokohama, Kanagawa 230-0045, Japan
Michael A. Black ORCID
Department of Biochemistry, University of Otago, PO Box 56, Dunedin 9054, New Zealand
Editor
Mark A. Jensen ORCID
Tags
gene-expression simulation genomics pathway network

Citation (CITATION.cff)

cff-version: 1.2.0
title: >-
  graphsim: An R package for simulating gene
  expression data from graph structures of biological
  pathways
message: >-
  If you use this software, please cite it using the
  metadata from this file.
type: software
version: 1.0.4
authors:
  - given-names: S. Thomas
    family-names: Kelly
    email: tom.kelly@postgrad.otago.ac.nz
    affiliation1: Department of Biochemistry, University of Otago, PO Box 56, Dunedin 9054, New Zealand
    affiliation2: Center for Integrative Medical Sciences, RIKEN, Suehiro-cho-1-7-22, Tsurumi Ward, Yokohama, Japan
    orcid: 'https://orcid.org/0000-0003-3904-6690'
  - given-names: Michael A.
    family-names: Black
    email: mik.black@otago.ac.nz
    affiliation1: Department of Biochemistry, University of Otago, PO Box 56, Dunedin 9054, New Zealand
    orcid: 'https://orcid.org/0000-0003-1174-6054'
doi: 10.21105/joss.02161
date-released: 2020-07-09
url: "https://github.com/TomKellyGenetics/graphsim"

GitHub Events

Total
  • Release event: 1
  • Watch event: 1
  • Push event: 2
  • Create event: 1
Last Year
  • Release event: 1
  • Watch event: 1
  • Push event: 2
  • Create event: 1

Committers

Last synced: 5 months ago

All Time
  • Total Commits: 422
  • Total Committers: 3
  • Avg Commits per committer: 140.667
  • Development Distribution Score (DDS): 0.036
Past Year
  • Commits: 2
  • Committers: 1
  • Avg Commits per committer: 2.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
TomKellyGenetics t****s@g****m 407
= = 13
Robrecht Cannoodt r****d@g****m 2

Issues and Pull Requests

Last synced: 4 months ago

All Time
  • Total issues: 11
  • Total pull requests: 3
  • Average time to close issues: 2 months
  • Average time to close pull requests: 3 months
  • Total issue authors: 3
  • Total pull request authors: 3
  • Average comments per issue: 4.27
  • Average comments per pull request: 1.67
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • corybrunson (7)
  • rcannood (2)
  • TomKellyGenetics (2)
Pull Request Authors
  • rcannood (1)
  • kozo2 (1)
  • TomKellyGenetics (1)
Top Labels
Issue Labels
wip (2) enhancement (2) cran (1)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 445 last-month
  • Total docker downloads: 21,642
  • Total dependent packages: 1
  • Total dependent repositories: 2
  • Total versions: 6
  • Total maintainers: 1
cran.r-project.org: graphsim

Simulate Expression Data from 'igraph' Networks

  • Versions: 6
  • Dependent Packages: 1
  • Dependent Repositories: 2
  • Downloads: 445 Last month
  • Docker Downloads: 21,642
Rankings
Forks count: 8.7%
Stargazers count: 11.5%
Average: 17.5%
Dependent packages count: 18.1%
Dependent repos count: 19.2%
Downloads: 21.7%
Docker downloads count: 26.0%
Maintainers (1)
Last synced: 4 months ago

Dependencies

DESCRIPTION cran
  • R >= 2.10 depends
  • Matrix * imports
  • gplots * imports
  • graphics * imports
  • igraph * imports
  • matrixcalc * imports
  • mvtnorm * imports
  • utils * imports
  • R.rsp * suggests
  • devtools * suggests
  • knitr >= 1.5 suggests
  • markdown * suggests
  • prettydoc * suggests
  • rmarkdown * suggests
  • scales * suggests
  • testthat * suggests
  • vdiffr * suggests