graphsim
graphsim: An R package for simulating gene expression data from graph structures of biological pathways - Published in JOSS (2020)
Science Score: 98.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 16 DOI reference(s) in README and JOSS metadata -
✓Academic publication links
Links to: biorxiv.org, joss.theoj.org, zenodo.org -
○Committers with academic emails
-
○Institutional organization owner
-
✓JOSS paper metadata
Published in Journal of Open Source Software
Keywords
Scientific Fields
Repository
R package: Simulate Expression data from igraph network using mvtnorm (CRAN; JOSS)
Basic Info
Statistics
- Stars: 24
- Watchers: 4
- Forks: 7
- Open Issues: 0
- Releases: 9
Topics
Metadata Files
README.md
graphsim
Version 1.0.3
Simulate Expression Data from 'igraph' Networks
This package provides functions to develop simulated continuous data (e.g., gene expression) from a sigma covariance matrix derived from a graph structure in 'igraph' objects. Intended to extend 'mvtnorm' to take 'igraph' structures rather than sigma matrices as input. This allows the use of simulated data that correctly accounts for pathway relationships and correlations. Here we present a versatile statistical framework to simulate correlated gene expression data from biological pathways, by sampling from a multivariate normal distribution derived from a graph structure. This package allows the simulation of biological pathways from a graph structure based on a statistical model of gene expression, such as simulation of expression profiles that of log-transformed and normalised data from microarray and RNA-Seq data.
Motivation
Network analysis of molecular biological pathways is important for insights into biology and medical genetics. Gene expression profiles capture the regulatory state of a cell and can be used to analyse complex molecular states with genome-scale data. Biological pathways are more than simply sets of genes involved in functions, they are rich in information of relationships defined by pathway structure.
Methods to infer biological pathways and gene regulatory networks from gene expression data can be tested on simulated datasets using this framework. This also allows for pathway structures to be considered as a confounding variable when simulating gene expression data to test the performance of genomics analyses.
This package enables the generation of simulated gene expression datasets containing pathway relationships from a known underlying network. These simulated datasets can be used to evaluate various bioinformatics methodologies, including statistical and network inference procedures.
Network analysis techniques have an important role in understanding of biological pathways and interpretation of genomics studies. Modelling biological pathways allows the evaluation of gene regulatory network inference techniques (which so far rely on experimental validation or resampling). This technique also enables modelling datasets with correlated pathway-structures to assess whether other genomics analysis techniques perform as expected with the background of complex pathways.
Installation
To install the latest release from CRAN:
R
install.packages("graphsim")
To install the stable release of this package from github:
```R
install.packages("devtools")
devtools::install_github("TomKellyGenetics/graphsim", ref = "master") ```
To get the development version of this package from github:
```R
install.packages("devtools")
devtools::install_github("TomKellyGenetics/graphsim", ref = "dev") ```
Usage
Please see the vignettes for demonstrations of this package on examples of simple simulated networks and the reactome pathway TGF-β receptor signaling activates SMADs (R-HSA-2173789). An article with further details has been published in the Journal of Open Source Software.
A help menu can also be accessed within the R environment:
?graphsim
help("graphsim-package")
This will display a help page and link to documentation for each function.
Citation
To cite package 'graphsim' in publications use:
S. Thomas Kelly and Michael A. Black (2020). graphsim: Simulate Expression Data from 'igraph' Networks. R package version 1.0.3. https://github.com/TomKellyGenetics/graphsim doi:10.5281/zenodo.3931288
A BibTeX entry for LaTeX users is:
@Manual{,
title = {{graphsim}: Simulate Expression Data from 'igraph' Networks },
author = {S. Thomas Kelly and Michael A. Black},
year = {2020},
note = {R package version R package version 1.0.3.},
url = {https://github.com/TomKellyGenetics/graphsim},
doi = {10.5281/zenodo.3931288},
}
Please also cite the publication describing use of this package where appropriate.
Kelly, S.T. and Black, M.A. (2020). graphsim: An R package for simulating gene expression data from graph structures of biological pathways. Journal of Open Source Software, 5(51), 2161, https://doi.org/10.21105/joss.02161
@article{Kelly2020joss02161,
doi = {10.21105/joss.02161},
url ={https://doi.org/10.21105/joss.02161},
year = {2020},
publisher = {The Open Journal},
volume = {5},
number = {51},
pages = {2161},
author = {S. Thomas Kelly and Michael A. Black},
title = {graphsim: An R package for simulating gene expression data from graph structures of biological pathways},
journal = {Journal of Open Source Software} }
This article is also avaliable as a preprint.
S. Thomas Kelly, Michael A. Black (2020) graphsim: An R package for simulating gene expression data from graph structures of biological pathways bioRxiv 2020.03.02.972471; doi:https://doi.org/10.1101/2020.03.02.972471
@article {Kelly2020.03.02.972471,
author = {Kelly, S Thomas and Black, Michael A},
title = {graphsim: An R package for simulating gene expression data from graph structures of biological pathways},
elocation-id = {2020.03.02.972471},
year = {2020},
doi = {10.1101/2020.03.02.972471},
publisher = {Cold Spring Harbor Laboratory},
URL = {https://www.biorxiv.org/content/early/2020/03/04/2020.03.02.972471},
eprint = {https://www.biorxiv.org/content/early/2020/03/04/2020.03.02.972471.full.pdf},
journal = {bioRxiv}
}
Contributions and Bug Reports
Please submit issues on GitHub to report
problems or suggest features. Pull requests
to the dev branch on GitHub are also welcome to add features or correct problems. Please see
the contributor guide for more details.
Owner
- Name: Tom Kelly
- Login: TomKellyGenetics
- Kind: user
- Location: Akiruno, Tokyo, Japan
- Company: RIKEN IMS; H.U. Group @hugp-ri
- Website: https://orcid.org/0000-0003-3904-6690
- Twitter: tomkxy
- Repositories: 138
- Profile: https://github.com/TomKellyGenetics
Bioinformatician. PhD 2014-2017: @mikblacklab Otago Uni. Postdoc 2018-2021: @minoda-lab RIKEN. The @carpentries. Genomics | Health | Math | Stats | AWS Dev
JOSS Publication
graphsim: An R package for simulating gene expression data from graph structures of biological pathways
Authors
Tags
gene-expression simulation genomics pathway networkCitation (CITATION.cff)
cff-version: 1.2.0
title: >-
graphsim: An R package for simulating gene
expression data from graph structures of biological
pathways
message: >-
If you use this software, please cite it using the
metadata from this file.
type: software
version: 1.0.4
authors:
- given-names: S. Thomas
family-names: Kelly
email: tom.kelly@postgrad.otago.ac.nz
affiliation1: Department of Biochemistry, University of Otago, PO Box 56, Dunedin 9054, New Zealand
affiliation2: Center for Integrative Medical Sciences, RIKEN, Suehiro-cho-1-7-22, Tsurumi Ward, Yokohama, Japan
orcid: 'https://orcid.org/0000-0003-3904-6690'
- given-names: Michael A.
family-names: Black
email: mik.black@otago.ac.nz
affiliation1: Department of Biochemistry, University of Otago, PO Box 56, Dunedin 9054, New Zealand
orcid: 'https://orcid.org/0000-0003-1174-6054'
doi: 10.21105/joss.02161
date-released: 2020-07-09
url: "https://github.com/TomKellyGenetics/graphsim"
GitHub Events
Total
- Release event: 1
- Watch event: 1
- Push event: 2
- Create event: 1
Last Year
- Release event: 1
- Watch event: 1
- Push event: 2
- Create event: 1
Committers
Last synced: 5 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| TomKellyGenetics | t****s@g****m | 407 |
| = | = | 13 |
| Robrecht Cannoodt | r****d@g****m | 2 |
Issues and Pull Requests
Last synced: 4 months ago
All Time
- Total issues: 11
- Total pull requests: 3
- Average time to close issues: 2 months
- Average time to close pull requests: 3 months
- Total issue authors: 3
- Total pull request authors: 3
- Average comments per issue: 4.27
- Average comments per pull request: 1.67
- Merged pull requests: 2
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- corybrunson (7)
- rcannood (2)
- TomKellyGenetics (2)
Pull Request Authors
- rcannood (1)
- kozo2 (1)
- TomKellyGenetics (1)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- cran 445 last-month
- Total docker downloads: 21,642
- Total dependent packages: 1
- Total dependent repositories: 2
- Total versions: 6
- Total maintainers: 1
cran.r-project.org: graphsim
Simulate Expression Data from 'igraph' Networks
- Homepage: https://github.com/TomKellyGenetics/graphsim/
- Documentation: http://cran.r-project.org/web/packages/graphsim/graphsim.pdf
- License: GPL-3
-
Latest release: 1.0.4
published 8 months ago
Rankings
Maintainers (1)
Dependencies
- R >= 2.10 depends
- Matrix * imports
- gplots * imports
- graphics * imports
- igraph * imports
- matrixcalc * imports
- mvtnorm * imports
- utils * imports
- R.rsp * suggests
- devtools * suggests
- knitr >= 1.5 suggests
- markdown * suggests
- prettydoc * suggests
- rmarkdown * suggests
- scales * suggests
- testthat * suggests
- vdiffr * suggests
