countsimQC

countsimQC - Compare characteristic features of count data sets

https://github.com/csoneson/countsimqc

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
    2 of 7 committers (28.6%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.9%) to scientific vocabulary

Keywords from Contributors

bioconductor-package motif-analysis motif-enrichment-analysis sequence-logo open-science outlier-detection bioconductor bioinformatics rnaseq derfinder
Last synced: 7 months ago · JSON representation

Repository

countsimQC - Compare characteristic features of count data sets

Basic Info
Statistics
  • Stars: 28
  • Watchers: 3
  • Forks: 4
  • Open Issues: 0
  • Releases: 1
Created almost 9 years ago · Last pushed 9 months ago
Metadata Files
Readme Changelog

README.md

countsimQC

R build status

countsimQC is an R package that provides functionality to create a comprehensive report comparing many different characteristics across multiple count data sets. One important use case is comparing one or more synthetic (e.g., RNA-seq) count matrices to a real count matrix, possibly the one based on which the synthetic data sets were generated. However, any collection of one or more count matrices can be visualized and compared.

If you use countsimQC for your work, we appreciate if you cite the accompanying paper:

Installation

countsimQC can be installed from Bioconductor with the following commands. Note that R version >= 3.5 and Bioconductor version >= 3.8 are required in order to use the BiocManager package. If you have an older version of R (3.4), you can still install countsimQC v0.5.4 (see the Releases tab in the GitHub repository). Please see the NEWS file for differences between versions.

```

Install BiocManager if needed

if (!requireNamespace("BiocManager", quietly = TRUE)) install.packages("BiocManager")

Install countsimQC

BiocManager::install("countsimQC") ```

Getting started

To run countsimQC and generate a report, you simply need to call the function countsimQCReport(), with an input consisting of a named list of DESeqDataSets (see the DESeq2 package for a description of this class). Each DESeqDataSet should correspond to one data set and contain a count matrix, a data frame with sample information and a design formula, which is needed for proper dispersion calculations. To generate a DESeqDataSet from a count matrix counts, a sample information data frame sample_df and a design formula formula (of the form ~ predictors), you can do as follows:

library(DESeq2) dds <- DESeqDataSetFromMatrix(countData = counts, colData = sample_df, design = formula) There are many other ways of generating valid DESeqDataSets, depending on in what form your counts are (e.g., reading directly from HTSeq output, or from a tximport output object (see the DESeq2 vignette).

countsimQC contains an small example list with subsets of three data sets: two synthetic ones and the real data set that was used to generate them. The following code generates a comparative report for these three data sets:

library(countsimQC) data(countsimExample) countsimQCReport(ddsList = countsimExample, outputFile = "countsimReport.html", outputDir = "./", description = "This is a comparison of three count data sets.")

For more detailed information about how to use the package, we refer to the vignette:

browseVignettes("countsimQC")

Example reports

Owner

  • Name: Charlotte Soneson
  • Login: csoneson
  • Kind: user

GitHub Events

Total
  • Issue comment event: 1
  • Push event: 12
  • Create event: 1
Last Year
  • Issue comment event: 1
  • Push event: 12
  • Create event: 1

Committers

Last synced: 10 months ago

All Time
  • Total Commits: 201
  • Total Committers: 7
  • Avg Commits per committer: 28.714
  • Development Distribution Score (DDS): 0.149
Past Year
  • Commits: 6
  • Committers: 3
  • Avg Commits per committer: 2.0
  • Development Distribution Score (DDS): 0.667
Top Committers
Name Email Commits
Charlotte Soneson c****n@g****m 171
Nitesh Turaga n****a@g****m 14
J Wokaty j****y@s****u 10
vobencha v****a@g****m 2
A Wokaty a****y@s****u 2
Hervé Pagès h****b@g****m 1
Darío Hereñú m****a@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 2
  • Total pull requests: 5
  • Average time to close issues: 5 days
  • Average time to close pull requests: 25 minutes
  • Total issue authors: 2
  • Total pull request authors: 2
  • Average comments per issue: 6.0
  • Average comments per pull request: 0.6
  • Merged pull requests: 5
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • ariannafebbo (1)
  • mikelove (1)
Pull Request Authors
  • csoneson (4)
  • kant (1)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • bioconductor 16,122 total
  • Total dependent packages: 1
  • Total dependent repositories: 0
  • Total versions: 5
  • Total maintainers: 1
bioconductor.org: countsimQC

Compare Characteristic Features of Count Data Sets

  • Versions: 5
  • Dependent Packages: 1
  • Dependent Repositories: 0
  • Downloads: 16,122 Total
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Average: 22.4%
Downloads: 67.2%
Maintainers (1)
Last synced: 8 months ago

Dependencies

DESCRIPTION cran
  • R >= 3.5 depends
  • DESeq2 >= 1.16.0 imports
  • DT * imports
  • GenomeInfoDbData * imports
  • SummarizedExperiment * imports
  • caTools * imports
  • dplyr * imports
  • edgeR * imports
  • genefilter * imports
  • ggplot2 * imports
  • grDevices * imports
  • methods * imports
  • ragg * imports
  • randtests * imports
  • rmarkdown >= 2.5 imports
  • stats * imports
  • tidyr * imports
  • tools * imports
  • utils * imports
  • knitr * suggests
  • testthat * suggests