https://github.com/chelauk/nf-core-umi_preprocessing

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 7 DOI reference(s) in README
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.7%) to scientific vocabulary

Last synced: 9 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: chelauk
Language: Nextflow
Default Branch: master
Size: 2.57 MB

Statistics

Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Releases: 0

Created over 5 years ago · Last pushed about 4 years ago

Metadata Files

Readme

README.md

An open-source pipeline to prepare umi fastqs for variant detection.

Introduction

nf-core/umipreprocessing is a bioinformatics best-practise analysis pipeline for preparing bam files for variant calling from umi fastqs

The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker containers making installation trivial and results highly reproducible.

Quick Start

Install nextflow
Install any of Docker, Singularity or Podman for full pipeline reproducibility (please only use Conda as a last resort; see docs)
Download the pipeline and test it on a minimal dataset with a single command:

bash nextflow run nf-core/umipreprocessing -profile test,<docker/singularity/podman/conda/institute>

Please check nf-core/configs to see if a custom config file to run nf-core pipelines already exists for your Institute. If so, you can simply use -profile <institute> in your command. This will enable either docker or singularity and set the appropriate execution settings for your local compute environment.
Start running your own analysis!

bash nextflow run nf-core/umipreprocessing -profile <docker/singularity/podman/conda/institute> --input 'input.tsv' --genome GRCh37

See usage docs for all of the available options when running the pipeline.

Pipeline Summary

By default, the pipeline currently performs the following:

Sequencing quality control (FastQC)
Overall pipeline run summaries (MultiQC)

Documentation

The nf-core/umipreprocessing pipeline comes with documentation about the pipeline: usage and output.

Credits

nf-core/umipreprocessing was originally written by Chela James.

We thank the following people for their extensive assistance in the development of this pipeline:

Contributions and Support

If you would like to contribute to this pipeline, please see the contributing guidelines.

For further information or help, don't hesitate to get in touch on the Slack #umipreprocessing channel (you can join with this invite).

Citations

You can cite the nf-core publication as follows:

The nf-core framework for community-curated bioinformatics pipelines.

Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.

Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x. ReadCube: Full Access Link

In addition, references of tools and data used in this pipeline are as follows:

Owner

Name: Chela James
Login: chelauk
Kind: user
Location: Milan
Company: Fondazione Human Technopole

Website: http://www.sottorivalab.org/
Twitter: chelauk
Repositories: 61
Profile: https://github.com/chelauk

Senior Bioinformatician Fondazione Human Technopole

GitHub Events

Total

Last Year

Dependencies

environment.yml conda

bwa 0.7.17.*
fastqc 0.11.9.*
fgbio 1.3.0.*
gatk4-spark 4.1.7.0.*
htslib 1.10.*
markdown 3.1.1.*
multiqc 1.9.*
picard 2.23.8.*
pigz 2.3.4.*
pygments 2.5.2.*
pymdown-extensions 6.0.*
r-ggplot2 3.3.0.*
samtools 1.10.*
zlib 1.2.11.*

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science