https://github.com/chelauk/nf-core-umi_preprocessing
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
✓DOI references
Found 7 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.7%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: chelauk
- Language: Nextflow
- Default Branch: master
- Size: 2.57 MB
Statistics
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
An open-source pipeline to prepare umi fastqs for variant detection.
Introduction
nf-core/umipreprocessing is a bioinformatics best-practise analysis pipeline for preparing bam files for variant calling from umi fastqs
The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker containers making installation trivial and results highly reproducible.
Quick Start
Install
nextflowInstall any of
Docker,SingularityorPodmanfor full pipeline reproducibility (please only useCondaas a last resort; see docs)Download the pipeline and test it on a minimal dataset with a single command:
bash nextflow run nf-core/umipreprocessing -profile test,<docker/singularity/podman/conda/institute>Please check nf-core/configs to see if a custom config file to run nf-core pipelines already exists for your Institute. If so, you can simply use
-profile <institute>in your command. This will enable eitherdockerorsingularityand set the appropriate execution settings for your local compute environment.Start running your own analysis!
bash nextflow run nf-core/umipreprocessing -profile <docker/singularity/podman/conda/institute> --input 'input.tsv' --genome GRCh37
See usage docs for all of the available options when running the pipeline.
Pipeline Summary
By default, the pipeline currently performs the following:
- Sequencing quality control (
FastQC) - Overall pipeline run summaries (
MultiQC)
Documentation
The nf-core/umipreprocessing pipeline comes with documentation about the pipeline: usage and output.
Credits
nf-core/umipreprocessing was originally written by Chela James.
We thank the following people for their extensive assistance in the development of this pipeline:
Contributions and Support
If you would like to contribute to this pipeline, please see the contributing guidelines.
For further information or help, don't hesitate to get in touch on the Slack #umipreprocessing channel (you can join with this invite).
Citations
You can cite the nf-core publication as follows:
The nf-core framework for community-curated bioinformatics pipelines.
Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.
Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x. ReadCube: Full Access Link
In addition, references of tools and data used in this pipeline are as follows:
Owner
- Name: Chela James
- Login: chelauk
- Kind: user
- Location: Milan
- Company: Fondazione Human Technopole
- Website: http://www.sottorivalab.org/
- Twitter: chelauk
- Repositories: 61
- Profile: https://github.com/chelauk
Senior Bioinformatician Fondazione Human Technopole
GitHub Events
Total
Last Year
Dependencies
- bwa 0.7.17.*
- fastqc 0.11.9.*
- fgbio 1.3.0.*
- gatk4-spark 4.1.7.0.*
- htslib 1.10.*
- markdown 3.1.1.*
- multiqc 1.9.*
- picard 2.23.8.*
- pigz 2.3.4.*
- pygments 2.5.2.*
- pymdown-extensions 6.0.*
- r-ggplot2 3.3.0.*
- samtools 1.10.*
- zlib 1.2.11.*
