https://github.com/cellgeni/nf-reprocessing-public-10x
Nextflow pipeline of Alex's reprocessing scripts
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.1%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
Repository
Nextflow pipeline of Alex's reprocessing scripts
Basic Info
- Host: GitHub
- Owner: cellgeni
- License: gpl-3.0
- Language: Shell
- Default Branch: main
- Size: 118 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 4
Created about 3 years ago
· Last pushed 10 months ago
Metadata Files
Readme
License
README.md
nf-reprocessing-public-10x
Nextflow pipeline of our reprocessing pipeline.
Contents of Repo:
main.nf- the Nextflow pipeline that executes reprocessing.nextflow.config- the configuration script that allows the processes to be submitted to IBM LSF on Sanger's HPC and ensures correct environment is set via singularity container (this is an absolute path). Global default parameters are also set in this file and some contain absolute paths.examples/samples.list- example of the expected samplefile containing series IDs and optionally sample IDs.examples/RESUME- an example run script that executes the pipeline it has 1 hardcoded argument:/path/to/sample.listwhich you will need to change on your installation.bin- a directory containing various scripts the pipeline uses to download project data and realign the data with STARsolo.Dockerfile- a dockerfile to reproduce the environment for step3, step5 has its own container that has its Dockerfile located here
Pipeline Arguments:
--samplefile- The path to the sample file provided to the pipeline which contains one sample ID per line. This sample is assumed to have CRAM files stored on IRODS.--outdir- The path to where the results will be saved.--run_starsolo- Tells pipeline whether to realign data with STARsolo or not--keep_bams- Tells the pipeline whether to generate BAM files (default false means do not generate).--sort_bam_mem- Input memory (IN BYTES) for starsolo to use for sorting BAM files if BAM files are kept (default 60GB = 60000000000B).
Owner
- Name: Cellular Genetics Informatics
- Login: cellgeni
- Kind: organization
- Location: United Kingdom
- Website: https://www.sanger.ac.uk/science/groups/cellular-genetics-informatics
- Repositories: 19
- Profile: https://github.com/cellgeni
Wellcome Sanger Institute
GitHub Events
Total
- Push event: 15
- Create event: 2
Last Year
- Push event: 15
- Create event: 2