scMultiSim

A simulator for single cell multi-omics and spatial omics data that provides ground truth to benchmark a wide range of methods.

https://github.com/zhanglabgt/scmultisim

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.8%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

A simulator for single cell multi-omics and spatial omics data that provides ground truth to benchmark a wide range of methods.

Basic Info
Statistics
  • Stars: 60
  • Watchers: 1
  • Forks: 11
  • Open Issues: 11
  • Releases: 1
Created over 4 years ago · Last pushed over 1 year ago
Metadata Files
Readme Changelog

README.md

scMultiSim

Table of contents

scMultiSim is an in silico simulator that generates multi-modality data of single-cells, including gene expression, chromatin accessibility, RNA velocity, and spatial location of cells. It takes a cell differential tree and a gene regulatory network (GRN) as input, and simulates spliced and unspliced counts while accounting for the relationships between modalities. The output single cell gene expression data is determined by three factors: cell-cell interactions, within-cell GRNs and chromatin accessibility. Users can tune the effect of each factor on the output data and set various parameters for the underlying model. Furthermore, the GRN can be set in a time-varying mode where the network's structure changes temporally to reflect the dynamic nature of biological networks. We also provide options to simulate technical variations such as batch effects. scMultiSim can be used to benchmark challenging computational tasks on single-cell multi-omics data, including the inference of GRNs, estimation of RNA velocity, integration of single-cell datasets from multiple batches and modalities, and analysis of cell-cell interaction using the cell spatial location data.

Overview

The following figure briefly shows results from the same cell differential tree:

  1. Connected scATAC-seq and scRNA-seq, in continuous or discrete mode. Visualized by t-SNE.
  2. GRN correlation heatmap, where genes regulated by the same regulator have similar correlations with others.
  3. Unspliced counts and RNA velocity ground truth visualized by t-SNE.
  4. Spatial cell locations and cell-cell interaction ground truth.
  5. Discrete cell population with added batch effects.

Results

Tutorials and documentation

Please check out the tutorials for detailed instructions on how to use scMultiSim.

Installation

scMultiSim can be installed from BioConductor using the following command:

```R if (!require("BiocManager")) { install.packages("BiocManager") }

BiocManager::install("scMultiSim") ```

Shiny App

A Shiny app is provided to help users visualize the effect of each parameter and adjust the simulation options. To run the app, simply call run_shiny().

FAQ

Running Speed

Simulations should finish in a reasonable time in most cases. On a machine with an i7-12700K CPU and 64GB RAM, using 1000 cells, 100 genes and 50 CIFs, the simulation took under 1 mimute to generate both scRNA-seq and scATAC-seq data. If also generating unspliced and spliced counts, or enabling cell-cell interactions, the running time is longer (~3 minutes when RNA velocity is enabled, and 30 minutes for 500 cells with spatial cell-cell interaction enabled).

Contact and reference

GitHub issues are welcomed. It is also possible to send email to the main author Hechen Li (hli691 at gatech.edu).

Please cite

Hechen Li, Ziqi Zhang, Michael Squires, Xi Chen, and Xiuwei Zhang. 2023. “scMultiSim: Simulation of Multi-Modality Single Cell Data Guided by Cell-Cell Interactions and Gene Regulatory Networks.” bioRxiv.

Owner

  • Name: ZhangLabGT
  • Login: ZhangLabGT
  • Kind: organization

GitHub Events

Total
  • Create event: 1
  • Release event: 1
  • Issues event: 5
  • Watch event: 33
  • Issue comment event: 5
  • Push event: 1
  • Fork event: 3
Last Year
  • Create event: 1
  • Release event: 1
  • Issues event: 5
  • Watch event: 33
  • Issue comment event: 5
  • Push event: 1
  • Fork event: 3

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 14
  • Total pull requests: 1
  • Average time to close issues: about 1 month
  • Average time to close pull requests: 2 months
  • Total issue authors: 13
  • Total pull request authors: 1
  • Average comments per issue: 1.29
  • Average comments per pull request: 1.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 5
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 5
  • Pull request authors: 0
  • Average comments per issue: 0.2
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • HelloWorldLTY (2)
  • SammiLyu (1)
  • AlessiaLeclercq (1)
  • TomVuod (1)
  • ankur26197-acog (1)
  • chiwwong (1)
  • Peg-Wu (1)
  • r-trimbour (1)
  • fanyue322 (1)
  • Zwygit2 (1)
  • ivanovaos (1)
  • AnneHartebrodt (1)
  • xfchen0912 (1)
  • vkorobeynyk (1)
Pull Request Authors
  • ankur26197-acog (1)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • bioconductor 4,039 total
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 3
  • Total maintainers: 1
bioconductor.org: scMultiSim

Simulation of Multi-Modality Single Cell Data Guided By Gene Regulatory Networks and Cell-Cell Interactions

  • Versions: 3
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 4,039 Total
Rankings
Dependent repos count: 0.0%
Average: 15.6%
Dependent packages count: 31.3%
Maintainers (1)
Last synced: 10 months ago

Dependencies

DESCRIPTION cran
  • Biobase * depends
  • IRanges * depends
  • KernelKnn * depends
  • MASS * depends
  • R >= 3.5.0 depends
  • RColorBrewer * depends
  • Rtsne * depends
  • S4Vectors * depends
  • SummarizedExperiment * depends
  • ape * depends
  • crayon * depends
  • dplyr * depends
  • foreach * depends
  • ggplot2 * depends
  • gplots * depends
  • grid * depends
  • gridExtra * depends
  • gtools * depends
  • parallel * depends
  • phytools * depends
  • repr * depends
  • reshape * depends
  • rlang * depends
  • roxygen2 * depends
  • stringi * depends
  • zeallot * depends
  • knitr * suggests
  • rmarkdown * suggests