dreamlet

Perform differential expression analysis on cohort-scale single cell datasets using linear mixed models

https://github.com/diseaseneurogenomics/dreamlet

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.0%) to scientific vocabulary
Last synced: 7 months ago · JSON representation

Repository

Perform differential expression analysis on cohort-scale single cell datasets using linear mixed models

Basic Info
Statistics
  • Stars: 14
  • Watchers: 0
  • Forks: 0
  • Open Issues: 1
  • Releases: 2
Fork of GabrielHoffman/dreamlet
Created about 3 years ago · Last pushed 11 months ago

https://github.com/DiseaseNeuroGenomics/dreamlet/blob/devel/


 
### Scalable differential expression analysis of single cell transcriptomics datasets with complex study designs The [dreamlet]() package enables differential expression analysis on multi-sample single cell datasets using linear (mixed) models with precision weights. Major functionality of `dreamlet` package using the [Bioconductor](https://www.bioconductor.org) [`SingleCellExperiment`](https://www.bioconductor.org/packages/SingleCellExperiment/) interface: + [`aggregateToPseudoBulk()`](reference/aggregateToPseudoBulk.html) Fast evaluation of pseudobulk from raw or estimated counts + [`processAssays()`](reference/processAssays.html)Normalize aggregated counts, compute precision weights + [`fitVarPart()`](reference/fitVarPart.html)Variance partitioning analysis + [`dreamlet()`](reference/dreamlet.html)Differential expression analysis across samples + [`dreamletCompareClusters()`](reference/dreamletCompareClusters.html)Differential expression analysis across cell clusters + [`zenith_gsa()`](reference/zenith_gsa-methods.html)Gene set analysis with full spectrum of test statistics + [`compositePosteriorTest()`](reference/compositePosteriorTest.html)Test cell type specifity of effects with Bayesian meta-analysis + [`meta_analysis()`](reference/meta_analysis.html)Frequentist meta-analysis across cohorts + [`outlierByAssay()`](reference/outlierByAssay.html)Outlier detection based on gene expression ## Resources + [Preprint](https://doi.org/10.1101/2023.03.17.533005) on biorxiv + Full [reproducible analysis code](https://github.com/GabrielHoffman/dreamlet_analysis) for 4 large-scale datasets ## Motivation
Recent advances in single cell/nucleus transcriptomic technology has enabled collection of population-level data sets to study cell type specific gene expression differences associated with disease state, stimulus, and genetic regulation. The scale of these data, complex study designs, and low read count per cell mean that characterizing cell type specific molecular mechanisms requires a user-friendly, purpose-built analytical framework. We have developed the dreamlet package that applies a pseudobulk approach and fits a regression model for each gene and cell cluster to test differential expression across individuals associated with a trait of interest. Use of precision-weighted linear mixed models enables accounting for repeated measures study designs, high dimensional batch effects, and varying sequencing depth or observed cells per biosample. ## Dreamlet workflow ## Technical intro Dreamlet further enables analysis of massive-scale of single cell/nucleus transcriptome datasets by addressing both CPU and memory usage limitations. Dreamlet performs preprocessing and statistical analysis in parallel on multicore machines, and can distribute work across multiple nodes on a compute cluster. Dreamlet also uses the [H5AD format](https://anndata.readthedocs.io/en/latest/index.html) for on-disk data storage to enable data processing in smaller chunks to dramatically reduce memory usage. The dreamlet workflow easily integrates into the [Bioconductor](https://www.bioconductor.org) ecosystem, and uses the [`SingleCellExperiment`](https://www.bioconductor.org/packages/SingleCellExperiment/) class to facilitate compatibility with other analyses. Beyond differential expression testing, dreamlet provides seamless integration of downstream analysis including quantifying sources of expression variation, gene set analysis using the full spectrum of gene-level t-statistics, testing differences in cell type composition and visualizing results. Dreamlet builds on previous work on variance partitioning and differential expression using precision-weighted linear mixed models in the [variancePartition](https://bioconductor.org/packages/variancePartition/) package. The dreamlet package is designed to be easily adopted by users of [variancePartition](https://bioconductor.org/packages/variancePartition/) and [limma](https://bioconductor.org/packages/limma/).
## Install `dreamlet >= 1.0.0` is compatible with `BioC v3.18` for `R v4.3`. ```r # 1) Make sure Bioconductor is installed # 2) Install dreamlet and dependencies: devtools::install_github("DiseaseNeurogenomics/dreamlet") # 3) Install zellkonverter >= v1.10.1 BiocManager::install("zellkonverter") ``` `dreamlet` is also compatible with earlier version of R and Bioconductor after installing these dependencies: ### Dependencies In case these aren't installed automatically: ```r devtools::install_github("DiseaseNeurogenomics/variancePartition") devtools::install_github("DiseaseNeurogenomics/zenith") ```

Owner

  • Name: Center for Disease Neurogenomics @ Mount Sinai
  • Login: DiseaseNeuroGenomics
  • Kind: organization
  • Location: United States of America

Open source software, documentation and applications for analysis of functional genomics datasets

GitHub Events

Total
  • Issues event: 1
  • Push event: 5
Last Year
  • Issues event: 1
  • Push event: 5

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 1
  • Total pull requests: 0
  • Average time to close issues: 11 months
  • Average time to close pull requests: N/A
  • Total issue authors: 1
  • Total pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 0
  • Average time to close issues: 11 months
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • alexclarke7 (1)
  • tweagles1 (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • bioconductor 6,138 total
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 6
  • Total maintainers: 1
bioconductor.org: dreamlet

Scalable differential expression analysis of single cell transcriptomics datasets with complex study designs

  • Versions: 6
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 6,138 Total
Rankings
Dependent repos count: 0.0%
Stargazers count: 17.7%
Forks count: 30.1%
Dependent packages count: 31.7%
Average: 35.4%
Downloads: 97.5%
Maintainers (1)
Last synced: 7 months ago