tximeta

Transcript quantification import with automatic metadata detection

https://github.com/thelovelab/tximeta

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
    1 of 7 committers (14.3%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.9%) to scientific vocabulary

Keywords from Contributors

bioconductor-package bioconductor rnaseq deseq2 mass-spectrometry bioinformatics proteomics annotation-agnostic count exon
Last synced: 6 months ago · JSON representation

Repository

Transcript quantification import with automatic metadata detection

Basic Info
Statistics
  • Stars: 67
  • Watchers: 12
  • Forks: 11
  • Open Issues: 10
  • Releases: 0
Created over 8 years ago · Last pushed 6 months ago
Metadata Files
Readme Changelog Code of conduct

README.md

tximeta

R build status

Automatic metadata for RNA-seq

tximeta provides a set of functions for conveniently working with metadata for transcript quantification data in Bioconductor. The tximeta() function imports quantification data from Salmon or other quantifiers, and returns a SummarizedExperiment object. tximeta works natively with Salmon, alevin, or piscem-infer, but can easily be configured to work with any transcript quantification tool.

If tximeta() recognizes the reference transcripts used for quantification, it will automatically download relevant information about the location of the transcripts in the correct genome. These actions happen in the background without requiring any extra effort or information from the user.

This metadata is attached to the SummarizedExperiment in the metadata() and rowRanges() slots.

For a list of the reference transcriptomes supported by tximeta(), see the "Pre-computed digests" section of the vignette in the Get started tab. We call the computed identifier for the reference transcriptome a "digest" or sometimes a "checksum".

Further steps are also facilitated, e.g. summarizeToGene(), addIds(), or even retrieveCDNA() (the transcripts used for quantification) or retrieveDb() (the correct TxDb or EnsDb to match the quantification data).

How it works

The key idea behind tximeta is that Salmon, alevin, and piscem-infer propagate a hash value summarizing the reference transcripts into each quantification directory it outputs. tximeta can be used with other tools as long as the hash of the transcripts is also included in the output directories. See customMetaInfo argument of tximeta() for more details.

Reference

A reference for tximeta package is:

Michael I. Love, Charlotte Soneson, Peter F. Hickey, Lisa K. Johnson, N. Tessa Pierce, Lori Shepherd, Martin Morgan, Rob Patro. "Tximeta: reference sequence checksums for provenance identification in RNA-seq" PLOS Computational Biology (2020) doi: 10.1371/journal.pcbi.1007664

Feedback

We would love to hear your feedback. Please post to Bioconductor support site for software usage help or post an Issue on GitHub, for software development questions.

Funding

tximeta was developed as part of NIH NHGRI R01-HG009937.

tximeta was also supported by the Chan Zuckerberg Initiative as part of the EOSS grants.

Owner

  • Name: The Love Lab
  • Login: thelovelab
  • Kind: organization
  • Email: michaelisaiahlove [at] gmail
  • Location: United States of America

Software produced by members of the Love Lab (UNC-Chapel Hill)

GitHub Events

Total
  • Issues event: 9
  • Watch event: 1
  • Issue comment event: 13
  • Push event: 20
  • Create event: 1
Last Year
  • Issues event: 9
  • Watch event: 1
  • Issue comment event: 13
  • Push event: 20
  • Create event: 1

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 302
  • Total Committers: 7
  • Avg Commits per committer: 43.143
  • Development Distribution Score (DDS): 0.096
Past Year
  • Commits: 26
  • Committers: 4
  • Avg Commits per committer: 6.5
  • Development Distribution Score (DDS): 0.231
Top Committers
Name Email Commits
mikelove m****e@g****m 273
Nitesh Turaga n****a@g****m 14
Charlotte Soneson c****n@g****m 7
J Wokaty j****y@s****u 2
vobencha v****a@g****m 2
Mike Love m****e 2
J Wokaty j****y 2
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 80
  • Total pull requests: 7
  • Average time to close issues: 4 months
  • Average time to close pull requests: about 3 hours
  • Total issue authors: 49
  • Total pull request authors: 1
  • Average comments per issue: 4.24
  • Average comments per pull request: 1.71
  • Merged pull requests: 7
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 5
  • Pull requests: 0
  • Average time to close issues: 2 months
  • Average time to close pull requests: N/A
  • Issue authors: 5
  • Pull request authors: 0
  • Average comments per issue: 1.4
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • mikelove (23)
  • kvittingseerup (4)
  • csoneson (4)
  • matmu (3)
  • karlaarz (2)
  • AdeleMangelinck (1)
  • danilotat (1)
  • JudithR (1)
  • lmweber (1)
  • ttriche (1)
  • horyunchung (1)
  • k3yavi (1)
  • csgroen (1)
  • herrroaa84 (1)
  • Knievl (1)
Pull Request Authors
  • csoneson (7)
Top Labels
Issue Labels
enhancement (10) bug (1)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • bioconductor 132,107 total
  • Total dependent packages: 3
  • Total dependent repositories: 0
  • Total versions: 10
  • Total maintainers: 1
bioconductor.org: tximeta

Transcript Quantification Import with Automatic Metadata

  • Versions: 10
  • Dependent Packages: 3
  • Dependent Repositories: 0
  • Downloads: 132,107 Total
Rankings
Dependent repos count: 0.0%
Stargazers count: 2.8%
Forks count: 5.5%
Average: 6.0%
Downloads: 10.6%
Dependent packages count: 11.0%
Maintainers (1)
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • AnnotationDbi * imports
  • AnnotationHub * imports
  • BiocFileCache * imports
  • Biostrings * imports
  • GenomeInfoDb * imports
  • GenomicFeatures * imports
  • GenomicRanges * imports
  • IRanges * imports
  • Matrix * imports
  • S4Vectors * imports
  • SummarizedExperiment * imports
  • ensembldb * imports
  • jsonlite * imports
  • methods * imports
  • tibble * imports
  • tools * imports
  • tximport * imports
  • utils * imports
  • DESeq2 * suggests
  • devtools * suggests
  • edgeR * suggests
  • fishpond * suggests
  • knitr * suggests
  • limma * suggests
  • org.Dm.eg.db * suggests
  • rmarkdown * suggests
  • testthat * suggests
  • tximportData * suggests