https://github.com/alexslemonade/scpca-nf
scpca-nf is the Nextflow workflow for processing Single-cell Pediatric Cancer Atlas Portal data
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.3%) to scientific vocabulary
Keywords
Repository
scpca-nf is the Nextflow workflow for processing Single-cell Pediatric Cancer Atlas Portal data
Basic Info
Statistics
- Stars: 15
- Watchers: 5
- Forks: 2
- Open Issues: 54
- Releases: 45
Topics
Metadata Files
README.md
scpca-nf
This repository holds a Nextflow workflow (scpca-nf) that is used to process 10X single-cell data as part of the Single-cell Pediatric Cancer Atlas (ScPCA) project.
All dependencies for the workflow outside of the Nextflow workflow engine itself are handled automatically; setup generally requires only organizing the input files and configuring Nextflow for your computing environment.
Nextflow will also handle parallelizing sample processing as allowed by your environment, minimizing total run time.
The workflow processes fastq files from single-cell and single-nuclei RNA-seq samples using alevin-fry to create gene by cell matrices.
The workflow outputs gene expression data in two formats: as SingleCellExperiment objects and as AnnData objects.
Reads from samples are aligned using selective alignment, to an index with transcripts corresponding to spliced cDNA and to intronic regions, denoted by alevin-fry as splici.
These matrices are filtered and additional processing is performed to calculate quality control statistics, create reduced-dimension transformations, assign cell types using both SingleR and CellAssign, and create output reports.
scpca-nf can also process libraries with ADT tags (e.g., CITE-seq), multiplexed libraries (e.g., cell hashing), bulk RNA-seq, and spatial transcriptomics samples.
For more information on the contents of the output files and the processing of all modalities, please see the ScPCA Portal docs.

Using scpca-nf to process your samples
The default configuration of the scpca-nf workflow is currently set up to process samples as part of the ScPCA portal and requires access to AWS through the Data Lab.
For all other users, scpca-nf can be set up for your computing environment with a few configuration files.
| Instructions for using scpca-nf |
| ------------------------------------------------------------- |
:warning: Please note that processing single-cell and single-nuclei RNA-seq samples requires access to a high performance computing (HPC) environment with nodes that can accommodate jobs requiring up to 24 GB of RAM and 12 CPUs.
To run scpca-nf on your own samples, you will need to complete the following steps:
- Organize your files so that each folder contains fastq files relevant to a single sequencing run.
- Prepare a run metadata file with one row per library containing all information needed to process your samples.
- Prepare a sample metadata file with one row per sample containing any relevant metadata about each sample (e.g., diagnosis, age, sex, cell line).
- Set up a configuration file, including the definition of a profile, dictating where Nextflow should execute the workflow.
You may also test your configuration file using example data.
For ALSF Data Lab users, please refer to the internal instructions for how to run the workflow on our systems.
Owner
- Name: Alex's Lemonade Stand Foundation
- Login: AlexsLemonade
- Kind: organization
- Website: https://www.alexslemonade.org
- Repositories: 70
- Profile: https://github.com/AlexsLemonade
Childhood Cancer Data Lab of ALSF
Issues and Pull Requests
Last synced: 5 months ago
All Time
- Total issues: 147
- Total pull requests: 169
- Average time to close issues: about 2 months
- Average time to close pull requests: 3 days
- Total issue authors: 3
- Total pull request authors: 6
- Average comments per issue: 1.36
- Average comments per pull request: 1.01
- Merged pull requests: 132
- Bot issues: 0
- Bot pull requests: 8
Past Year
- Issues: 82
- Pull requests: 75
- Average time to close issues: 9 days
- Average time to close pull requests: 3 days
- Issue authors: 3
- Pull request authors: 5
- Average comments per issue: 0.54
- Average comments per pull request: 0.57
- Merged pull requests: 55
- Bot issues: 0
- Bot pull requests: 3
Top Authors
Issue Authors
- allyhawkins (57)
- jashapiro (54)
- sjspielman (36)
Pull Request Authors
- allyhawkins (76)
- jashapiro (48)
- sjspielman (35)
- pre-commit-ci[bot] (8)
- arkid15r (1)
- jaclyn-taroni (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- actions/checkout v3 composite
- docker://nextflow/nextflow 21.10.6 composite
- actions/checkout v3 composite
- actions/upload-artifact v3 composite
- docker://nextflow/nextflow 21.10.6 composite
- actions/checkout v3 composite
- actions/upload-artifact v3 composite