https://github.com/alleninstitute/scrattch
Single cell RNA-seq analysis for transcriptomic type characterization
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 1 DOI reference(s) in README -
✓Academic publication links
Links to: pubmed.ncbi, ncbi.nlm.nih.gov, zenodo.org -
✓Committers with academic emails
3 of 6 committers (50.0%) from academic institutions -
✓Institutional organization owner
Organization alleninstitute has institutional domain (alleninstitute.org) -
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.5%) to scientific vocabulary
Keywords
Repository
Single cell RNA-seq analysis for transcriptomic type characterization
Basic Info
- Host: GitHub
- Owner: AllenInstitute
- License: other
- Language: HTML
- Default Branch: master
- Size: 97.9 MB
Statistics
- Stars: 37
- Watchers: 15
- Forks: 3
- Open Issues: 2
- Releases: 2
Topics
Metadata Files
README.md
scrattch
Single-cell RNA-seq analysis for transcriptomic type characterization
This is the umbrella package for the scrattch suite of R packages from the Allen Institute for Brain Science. It is modeled after the tidyverse package. You can use scrattch to automatically install or update some of the underlying packages and can run the remaining packages in docker environments.
Scrattch packages
Scrattch includes several packages for clustering, mapping, and data formatting and visualization, along with example data for demos. These include:
Data preparation: file formats and schema
scrattch.taxonomy- Taxonomy building scripts for RNA-seq based taxonomies following the Allen Institute (AIT) schema. A table of available AIT-formatted taxonomies can be found herescrattch.io- [deprecated]. Library for file handling and data formatting, replaced byscrattch.taxonomyin 2024
Data analysis: cell clustering and mapping (also called label transfer)
scrattch.hicat- Hierarchical, iterative clustering for analysis of transcriptomicsscrattch.bigcat- Clustering analysis for extremely large single cell datasetscrattch.mapping- Generalized mapping scripts for single cell RNA-seq, Patch-seq, spatial transcriptomics, or related data typesscrattch.patchseq- Functions for generating additional QC metrics and output files for patch-seq analysis
Data visualization
scrattch.vis- Plotting functions for visualization of single cell RNA-seq data
Example data: small RNA-seq data sets
tasic2016data- Data from Tasic, et al. (2016), which is used for demoshodge2019data- Data subset from Hodge, et al. (2019), which is used for demos- A table of actual AIT-formatted taxonomies can be found here
If you're interested in only one of these modules, you can install them separately. That said, we recommend using the installation instructions below to install combinations of scrattch packages to ensure they interact properly.
Related content
Several related websites and R and python libraries are outside of the scrattch suite, but are either used as part of scrattch libraries or directly work with scrattch outputs. These include (but are not limited to):
bmark- Standardized strategies for benchmarking clustering and mapping resultstranscriptomic_clustering- Python implementation of scrattch.hicat clusteringcell_type_mapper- Python implementation of hierarchical mapping algorithm used inscrattch.mappingandMapMyCellsACE- R Shiny and web-based app for comparison of annotations, including clustering and mapping resultsmfishtools- Functions for gene selection and analysis of spatial transcriptomics data
Installation
We strongly encourage the use of docker to install the scrattch suite. In particular, several functions in scrattch.taxonomy and scrattch.mapping have known issues in certain R environments. That said, we provide options for installing and running R in both a docker environment and through standard R approaches.
Using docker (RECOMMENDED)
The current docker version is accessible through Docker Hub. As of 26 March 2025 the Docker version is docker://alleninst/scrattch:1.1.2. This corresponds to AIT (v1.1.2) (see the Allen Institute Taxonomy GitHub respository for details).
Docker can be run on some HPC environments that use singularity as follows:
- Non-interactive:
singularity shell --cleanenv [Docker version] Rscript YOUR_CODE.R - Interactive:
singularity shell --cleanenv [Docker version] - To create a sif file for use in other environments:
singularity pull scrattch:[#.#.#].sif [Docker version]
If you cannot figure out how to use Docker in your specific environment, please post an issue.
--WARNING-- The 1.1.2 docker listed above provides all the tooling for AIT (v1.1.2) and some functionality for scrattch.mapping, but is broken for hierarchical mapping and all scrattch.patchseq functionality. An update mid-April will bring both of these packages back up to speed with the AIT schema / format.
Running scrattch in R
While we advise using the provided docker, you can install all scrattch packages along with their GitHub and BioConductor dependencies, as follows:
devtools::install_github("AllenInstitute/scrattch")
scrattch::install_scrattch()
Note that doMC may need to be installed manually from the download link at https://r-forge.r-project.org/R/?group_id=947 if you use Windows.
Installing previous versions
Two historical versions of scrattch are included in this package. These can be safely run without using docker, but are missing several recent components of the scrattch suite.
- scrattch_2023 is the stable version of the package prior to the release of
scrattch.mapping,scrattch.taxonomy,scrattch.patchseq, andhodge2019data - archive is the original package from ~2018, and should not be used for most folks
Should you need one of these previous versions, they can still be installed using:
devtools::install_github("AllenInstitute/scrattch", ref = "scrattch_2023") # -OR-
devtools::install_github("AllenInstitute/scrattch", ref = "archive")
Usage and contribution
Scrattch is under active development. Please reach out if you have any challenges or suggestions!
Documentation
There is no specific documentation for scrattch. However, each child package has extensive documentation available via help commands within R and/or through the corresponding GitHub page. Conversion of documentation to web-based versions is complete for scrattch.taxonomy, scrattch.mapping, and scrattch.patchseq.
License
The license for this package is available on Github at: https://github.com/AllenInstitute/scrattch/blob/master/LICENSE.
Contribution Agreement
If you contribute code to this repository through pull requests or other mechanisms, you are subject to the Allen Institute Contribution Agreement, which is available in full at: https://github.com/AllenInstitute/scrattch/blob/master/CONTRIBUTION
Level of Support
We frequently update the child packages of this tool with no fixed schedule. Community involvement is encouraged through both issues and pull requests. We encourage community involvement in child packages directly, rather than through the scrattch umbrella package, when appropriate.
Owner
- Name: Allen Institute
- Login: AllenInstitute
- Kind: organization
- Location: Seattle, WA
- Website: https://alleninstitute.org
- Repositories: 184
- Profile: https://github.com/AllenInstitute
Please visit http://alleninstitute.github.io/ for more information.
GitHub Events
Total
- Issues event: 2
- Watch event: 6
- Issue comment event: 2
- Push event: 20
- Pull request event: 2
Last Year
- Issues event: 2
- Watch event: 6
- Issue comment event: 2
- Push event: 20
- Pull request event: 2
Committers
Last synced: 10 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Lucas Gray | s****t@g****m | 142 |
| Jeremy Miller | j****m@a****g | 13 |
| Cindy van Velthoven | 4****t | 5 |
| Nelson Johansen | n****n@a****g | 4 |
| Adriana Sedeno-Cortes | a****s@a****g | 2 |
| yzizhen | 3****n | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 2
- Total pull requests: 7
- Average time to close issues: N/A
- Average time to close pull requests: 4 minutes
- Total issue authors: 2
- Total pull request authors: 3
- Average comments per issue: 1.0
- Average comments per pull request: 0.0
- Merged pull requests: 7
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 2
- Pull requests: 1
- Average time to close issues: N/A
- Average time to close pull requests: 28 minutes
- Issue authors: 2
- Pull request authors: 1
- Average comments per issue: 1.0
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- renniek (1)
- jeremymiller (1)
Pull Request Authors
- hypercompetent (4)
- cvanvelt (4)
- jeremymiller (2)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- BiocManager * depends
- devtools * depends
- dplyr * imports
- ggplot2 * imports