countsimQC
countsimQC - Compare characteristic features of count data sets
Science Score: 49.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 1 DOI reference(s) in README -
○Academic publication links
-
✓Committers with academic emails
2 of 7 committers (28.6%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (15.9%) to scientific vocabulary
Keywords from Contributors
Repository
countsimQC - Compare characteristic features of count data sets
Basic Info
- Host: GitHub
- Owner: csoneson
- Language: R
- Default Branch: devel
- Homepage: https://csoneson.github.io/countsimQC/index.html
- Size: 2.82 MB
Statistics
- Stars: 28
- Watchers: 3
- Forks: 4
- Open Issues: 0
- Releases: 1
Metadata Files
README.md
countsimQC
countsimQC is an R package that provides functionality to create a
comprehensive report comparing many different characteristics across multiple
count data sets. One important use case is comparing one or more
synthetic (e.g., RNA-seq) count matrices to a real count matrix, possibly the
one based on which the synthetic data sets were generated. However, any
collection of one or more count matrices can be visualized and compared.
If you use countsimQC for your work, we appreciate if you cite the
accompanying paper:
- Soneson C and Robinson MD: Towards unified quality verification of synthetic count data with countsimQC. Bioinformatics 34(4):691-692 (2018).
Installation
countsimQC can be installed from
Bioconductor with the
following commands. Note that R version >= 3.5 and Bioconductor
version >= 3.8 are required in order to use the BiocManager package.
If you have an older version of R (3.4), you can still install
countsimQC v0.5.4 (see the Releases tab in the GitHub repository).
Please see the NEWS file for differences between versions.
```
Install BiocManager if needed
if (!requireNamespace("BiocManager", quietly = TRUE)) install.packages("BiocManager")
Install countsimQC
BiocManager::install("countsimQC") ```
Getting started
To run countsimQC and generate a report, you simply need to call the
function countsimQCReport(), with an input consisting of a named list of
DESeqDataSets (see the
DESeq2
package for a description of this class). Each DESeqDataSet should
correspond to one data set and contain a count matrix, a data frame with sample
information and a design formula, which is needed for proper dispersion
calculations. To generate a DESeqDataSet from a count matrix counts, a
sample information data frame sample_df and a design formula formula
(of the form ~ predictors), you can do as follows:
library(DESeq2)
dds <- DESeqDataSetFromMatrix(countData = counts,
colData = sample_df,
design = formula)
There are many other ways of generating valid DESeqDataSets, depending on in
what form your counts are (e.g., reading directly from
HTSeq output, or from a tximport
output object (see the
DESeq2 vignette).
countsimQC contains an small example list with subsets of three data sets:
two synthetic ones and the real data set that was used to generate them. The
following code generates a comparative report for these three data sets:
library(countsimQC)
data(countsimExample)
countsimQCReport(ddsList = countsimExample,
outputFile = "countsimReport.html",
outputDir = "./",
description = "This is a comparison of three count data sets.")
For more detailed information about how to use the package, we refer to the vignette:
browseVignettes("countsimQC")
Example reports
- Comparison of 16S microbiome species count matrices for four body subsites from the Human Microbiome Project
- Comparison of three real bulk RNA-seq data sets
- Comparison of gene- and transcript-level count matrices for a single-cell RNA-seq data set
- Comparison of four real scRNA-seq data sets
- Comparison of two simulated scRNA-seq data sets to the underlying real data set
- Comparison of six simulated bulk RNA-seq data set with different number of genes
Owner
- Name: Charlotte Soneson
- Login: csoneson
- Kind: user
- Website: http://csoneson.github.io/
- Twitter: CSoneson
- Repositories: 110
- Profile: https://github.com/csoneson
GitHub Events
Total
- Issue comment event: 1
- Push event: 12
- Create event: 1
Last Year
- Issue comment event: 1
- Push event: 12
- Create event: 1
Committers
Last synced: 10 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Charlotte Soneson | c****n@g****m | 171 |
| Nitesh Turaga | n****a@g****m | 14 |
| J Wokaty | j****y@s****u | 10 |
| vobencha | v****a@g****m | 2 |
| A Wokaty | a****y@s****u | 2 |
| Hervé Pagès | h****b@g****m | 1 |
| Darío Hereñú | m****a@g****m | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 8 months ago
All Time
- Total issues: 2
- Total pull requests: 5
- Average time to close issues: 5 days
- Average time to close pull requests: 25 minutes
- Total issue authors: 2
- Total pull request authors: 2
- Average comments per issue: 6.0
- Average comments per pull request: 0.6
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- ariannafebbo (1)
- mikelove (1)
Pull Request Authors
- csoneson (4)
- kant (1)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- bioconductor 16,122 total
- Total dependent packages: 1
- Total dependent repositories: 0
- Total versions: 5
- Total maintainers: 1
bioconductor.org: countsimQC
Compare Characteristic Features of Count Data Sets
- Homepage: https://github.com/csoneson/countsimQC
- Documentation: https://bioconductor.org/packages/release/bioc/vignettes/countsimQC/inst/doc/countsimQC.pdf
- License: GPL (>=2)
-
Latest release: 1.26.0
published 11 months ago
Rankings
Maintainers (1)
Dependencies
- R >= 3.5 depends
- DESeq2 >= 1.16.0 imports
- DT * imports
- GenomeInfoDbData * imports
- SummarizedExperiment * imports
- caTools * imports
- dplyr * imports
- edgeR * imports
- genefilter * imports
- ggplot2 * imports
- grDevices * imports
- methods * imports
- ragg * imports
- randtests * imports
- rmarkdown >= 2.5 imports
- stats * imports
- tidyr * imports
- tools * imports
- utils * imports
- knitr * suggests
- testthat * suggests