baitcapture

A Nextflow workflow for processing data obtained from targeted resistome bait-capture sequencing

https://github.com/olc-bioinformatics/baitcapture

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 10 DOI reference(s) in README
✓
Academic publication links
Links to: zenodo.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.4%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

A Nextflow workflow for processing data obtained from targeted resistome bait-capture sequencing

Basic Info

Host: GitHub
Owner: OLC-Bioinformatics
License: mit
Language: Nextflow
Default Branch: main
Size: 29.4 MB

Statistics

Stars: 2
Watchers: 1
Forks: 0
Open Issues: 1
Releases: 2

Created almost 3 years ago · Last pushed 11 months ago

Metadata Files

Readme Changelog Contributing License Citation

BaitCapture

BaitCapture banner

BaitCapture is a bioinformatics workflow designed for processing sequencing data obtained from targeted resistome bait-capture sequencing, built using Nextflow.

Introduction
Quick start
Pipeline summary
Usage
Output
Testing the workflow
Running the workflow on high-performance compute clusters
- Waffles
Advanced usage
Contributions and support
Citations

Introduction

BaitCapture is based upon a Bash script workflow originally created by Shay et al. (2023). Though it was designed in particular consideration of bait-capture sequencing data, BaitCapture can be used for any paired-end Illumina sequencing dataset where the user needs to align many sequence reads to a reference database of gene targets.

BaitCapture offers the following features:

Quality control: Assess the quality of raw and pre-processed sequence data.
Pre-processing:
- Read decontamination using a host reference genome
- Quality-based trimming
- Adapter removal
Read alignment: Align reads against a reference database of gene targets using KMA, BWA-MEM2, or BWA.
Alignment reports:
- mapstats.tsv: A table of read alignment statistics against each gene target for each sample, including KMA-specific alignment statistics.
- sumstats.tsv: A table of summary statistics for each sample, including the on-target alignment rate, and the number of reads lost from host decontamination and filtering.
- presence_absence.tsv: A table of presence-absence calls for each gene target in each sample, based upon user-defined thresholds.
- presence_absence_clusters.tsv: A table of presence-absence calls for each gene target cluster in each sample, with clusters defined by a target metadata file (e.g. resistance mechanism).

Quick start

BaitCapture terminal demo

Pipeline summary

The steps of the workflow are:

Report the quality of the raw sequence data using FastQC.
(Optional) Trim the raw sequence reads using fastp.
(Optional) Decontaminate the trimmed sequence reads using a host reference genome with BWA-MEM2.
Report the quality of the pre-processed sequence data using FastQC.
Obtain total read and bp counts from the raw and pre-processed sequence data using fastq-scan.
Align trimmed and/or decontaminated reads against the database of gene targets using KMA, BWA-MEM2, or BWA.
Obtain sequence coverage and depth statistics from the alignments using AlignCov, Mosdepth, and SAMtools.
Create a MultiQC report and other summary reports with custom scripts.

Usage

If you are new to Nextflow and the nf-core framework, please refer to this page on how to set-up Nextflow.

Please provide pipeline parameters via the CLI or Nextflow -params-file option. Custom config files including those provided by the -c Nextflow option can be used to provide any configuration except for parameters; see docs.

BaitCapture can be run using two different input types:

A samplesheet, including sample names and paths to paired-end gzipped FASTQ files, or
A folder containing paired-end gzipped FASTQ files.

Additionally, a target metadata file can optionally be provided to generate a presence_absence_clusters.tsv file.

Input type: Samplesheet

First, prepare a samplesheet with your input data that looks as follows:

samplesheet.csv:

csv sample,fastq_1,fastq_2 CONTROL_REP1,AEG588A1_S1_L002_R1_001.fastq.gz,AEG588A1_S1_L002_R2_001.fastq.gz

Each row represents a gzipped FASTQ file.

Now, you can run the pipeline using:

bash nextflow run OLC-Bioinformatics/BaitCapture \ -profile <docker/singularity/.../institute> \ --input samplesheet.csv \ --targets targets.fa \ --outdir <OUTDIR>

Input type: Folder

Instead of a samplesheet, the user can instead provide a path to a directory containing gzipped FASTQ files. In this case, the sample name will be the name of the file up until the first underscore (_).

Case #1: Default file name pattern

The default file naming pattern that --input_folder searches for is "/*_R{1,2}.fastq.gz". For example, for a folder data/ containing sequencing files that looks as follows:

bash data ├── ERR9958133_R1_001.fastq.gz ├── ERR9958133_R2_001.fastq.gz ├── ERR9958134_R1_001.fastq.gz └── ERR9958134_R2_001.fastq.gz

The workflow can be run simply by using:

bash nextflow run OLC-Bioinformatics/BaitCapture \ -profile <docker/singularity/.../institute> \ --input_folder data/ \ --targets targets.fa \ --outdir <OUTDIR>

And the sample names will be: - ERR9958133 - ERR9958134

Case #2: Alternate file name pattern

If the names of the gzipped FASTQ files do not end with _R{1,2}_001.fastq.gz, an alternate sequencing file pattern must be specified using --pattern. For example, for a folder more-data/ that looks as follows:

bash more-data ├── SAMN000214_R1.fastq.gz ├── SAMN000214_R2.fastq.gz ├── SAMN000215_R1.fastq.gz └── SAMN000215_R2.fastq.gz

The workflow can be run using:

bash nextflow run OLC-Bioinformatics/BaitCapture \ -profile <docker/singularity/.../institute> \ --input_folder more-data/ \ --pattern "/*_R{1,2}.fastq.gz" \ --targets targets.fa \ --outdir <OUTDIR>

And the sample names will be: - SAMN000214 - SAMN000215

[!NOTE] When providing an argument to --pattern, the string must be enclosed in double quotes ("") and must be prepended with a forward slash (/).

(Optional) Input type: Target metadata

The path to a target metadata file can optionally be provided with --target_metadata to generate a presence_absence_clusters.tsv file, which describes the presence or absence of clusters of gene targets (e.g. AMR resistance mechanism, target drug class, replicon type). The requirements for this file are:

A comma-separated value (CSV) file with a header row, with one column header named target.
At least one column in addition to the target column, which describes a categorical variable to cluster the targets by.
Each value in the target column must exactly match the first word of a sequence header in the --targets FASTA database.

For example, if we provide a --targets database containing two sequences:

```fasta

gb|X91840.1|+|1-1146|CMY-2 [Klebsiella pneumoniae] ATGATGAAAAAATCGTTATGCTGCGCTCTGCTGCTGACAGCCTCTTTCTCCACATTTGCTGCCGCAAAAACAGAACAACAGATTGCCGAT ATCGTTAATCGCACCATCACCCCGTTGATGCAGGAGCAGGCTATTCCGGGTATGGCCGTTGCCGTTATCTACCAGGGAAAACCCTATTAT TTCACCTGGGGTAAAGCCGATATCGCCAATAACCACCCAGTCACGCAGCAAACGCTGTTTGAGCTAGGATCGGTTAGTAAGACGTTTAAC GGCGTGTTGGGCGGCGATGCTATCGCCCGCGGCGAAATTAAGCTCAGCGATCCGGTCACGAAATACTGGCCAGAACTGACAGGCAAACAG TGGCAGGGTATCCGCCTGCTGCACTTAGCCACCTATACGGCAGGCGGCCTACCGCTGCAGATCCCCGATGACGTTAGGGATAAAGCCGCA TTACTGCATTTTTATCAAAACTGGCAGCCGCAATGGACTCCGGGCGCTAAGCGACTTTACGCTAACTCCAGCATTGGTCTGTTTGGCGCG CTGGCGGTGAAACCCTCAGGAATGAGTTACGAAGAGGCAATGACCAGACGCGTCCTGCAACCATTAAAACTGGCGCATACCTGGATTACG GTTCCGCAGAACGAACAAAAAGATTATGCCTGGGGCTATCGCGAAGGGAAGCCCGTACACGTTTCTCCGGGACAACTTGACGCCGAAGCC TATGGCGTGAAATCCAGCGTTATTGATATGGCCCGCTGGGTTCAGGCCAACATGGATGCCAGCCACGTTCAGGAGAAAACGCTCCAGCAG GGCATTGCGCTTGCGCAGTCTCGCTACTGGCGTATTGGCGATATGTACCAGGGATTAGGCTGGGAGATGCTGAACTGGCCGCTGAAAGCT GATTCGATCATCAACGGCAGCGACAGCAAAGTGGCATTGGCAGCGCTTCCCGCCGTTGAGGTAAACCCGCCCGCCCCCGCAGTGAAAGCC TCATGGGTGCATAAAACGGGCTCCACTGGTGGATTTGGCAGCTACGTAGCCTTCGTTCCAGAAAAAAACCTTGGCATCGTGATGCTGGCA AACAAAAGCTATCCTAACCCTGTCCGTGTCGAGGCGGCCTGGCGCATTCTTGAAAAGCTGCAATAA gb|AF231986.2|+|3308-4522|floR [Escherichia coli] ATGACCACCACACGCCCCGCGTGGGCCTATACGCTGCCGGCAGCACTGCTGCTGATGGCTCCTTTCGACATCCTCGCTTCACTGGCGATG GATATTTATCTCCCTGTCGTTCCAGCGATGCCCGGCATCCTGAACACGACGCCCGCTATGATCCAACTCACGTTGAGCCTCTATATGGTG ATGCTCGGCGTGGGCCAGGTGATTTTTGGTCCGCTCTCAGACAGAATCGGGCGACGGCCAATTCTACTTGCGGGCGCAACGGCTTTCGTC ATTGCGTCTCTGGGAGCAGCTTGGTCTTCAACTGCACCGGCCTTTGTCGCTTTCCGTCTACTTCAAGCAGTGGGCGCGTCGGCCATGCTG GTGGCGACGTTCGCGACGGTTCGCGACGTTTATGCCAACCGTCCTGAGGGTGTCGTCATCTACGGCCTTTTCAGTTCGGTGCTGGCGTTC GTGCCTGCGCTCGGCCCTATCGCCGGAGCATTGATCGGCGAGTTCTTGGGATGGCAGGCGATATTCATTACTTTGGCTATACTGGCGATG CTCGCACTCCTAAATGCGGGTTTCAGGTGGCACGAAACCCGCCCTCTGGATCAAGTCAAGACGCGCCGATCTGTCTTGCCGATCTTCGCG AGTCCGGCTTTTTGGGTTTACACTGTCGGCTTTAGCGCCGGTATGGGCACCTTCTTCGTCTTCTTCTCGACGGCTCCCCGTGTGCTCATA GGCCAAGCGGAATATTCCGAGATCGGATTCAGCTTTGCCTTCGCCACTGTCGCGCTTGTAATGATCGTGACAACCCGTTTCGCGAAGTCC TTTGTCGCCAGATGGGGCATCGCAGGATGCGTGGCGCGTGGGATGGCGTTGCTTGTTTGCGGAGCGGTCCTGTTGGGGATCGGCGAACTT TACGGCTCGCCGTCATTCCTCACCTTCATCCTACCGATGTGGGTTGTCGCGGTCGGTATTGTCTTCACGGTGTCCGTTACCGCGAACGGC GCTTTGGCAGAGTTCGACGACATCGCGGGATCAGCGGTCGCGTTCTACTTCTGCGTTCAAAGCCTGATAGTCAGCATTGTCGGGACATTG GCGGTGGCACTTTTAAACGGTGACACAGCGTGGCCCGTGATCTGTTACGCCACGGCGATGGCGGTACTGGTTTCGTTGGGGCTGGTGCTC CTTCGGCTCCGTGGGGCTGCCACCGAGAAGTCGCCAGTCGTCTAA ```

We might optionally provide a --target_metadata CSV file which looks as follows:

| target | drugclass | resistancemechanism | |------------------------------------|-------------|-------------------------| | gb|X91840.1|+|1-1146|CMY-2 | betalactam | antibioticinactivation | | gb|AF231986.2|+|3308-4522|floR | phenicol | antibiotic_efflux |

So, BaitCapture would generate the following presence_absence_clusters.tsv file if only the CMY-2 target is detected in sampleA:

| sampleid | metavarname | metavarvalue | presenceabsence | |----------|----------------------|-------------------------|------------------| | sampleA | drugclass | betalactam | 1 | | sampleA | drugclass | phenicol | 0 | | sampleA | resistancemechanism | antibioticinactivation | 1 | | sampleA | resistancemechanism | antibioticefflux | 0 |

Output

The pipeline will output the following directories:

results/ ├── fastp ├── fastqc │ ├── preprocessed │ └── raw ├── multiqc │ ├── multiqc_data │ └── multiqc_plots ├── pipeline_info └── summary

fastp/: Contains fastp reports for trimmed FASTQ files.
fastqc/: Contains FastQC reports for raw and pre-processed FASTQ files.
multiqc/: Contains MultiQC reports.
pipeline_info/: Contains Nextflow logs and reports.
summary/: Contains alignment reports for each sample.

Testing the workflow

To check if BaitCapture, Nextflow, and your container manager have been configured properly, a test run of the workflow can be performed by first cloning the GitHub repository and then running the test workflow as follows:

bash git clone https://github.com/OLC-Bioinformatics/BaitCapture cd BaitCapture/ nextflow run . \ -profile test,<docker/singularity/.../institute> \ --outdir <OUTDIR>

If your <OUTDIR> was results/, you could then run the following command to inspect the summary statistics for the test sample:

bash $ cat results/summary/sumstats.tsv sampleid raw_total_reads raw_total_bp fastp_total_reads fastp_total_bp decontam_total_reads decontam_total_bp mapped_total_reads mapped_total_bp percent_reads_lost_fastp percent_reads_lost_decontam percent_reads_on_target SRR14739083 553624 83043600 464776 69619648 455764 68278988 98485 14753774 16.05 1.94 21.61

Running the workflow on high-performance compute clusters

Nextflow is capable of running several jobs in parallel using job submission managers (e.g. SLURM) that have been configured on high-performance compute (HPC) clusters. For your convenience, profiles have been added to simplify running the workflow on commonly used clusters.

Waffles

To run BaitCapture on National Microbiology Laboratory's HPC cluster Waffles using Singularity, use -profile waffles. For example:

bash nextflow run OLC-Bioinformatics/BaitCapture \ -profile waffles \ --input samplesheet.csv \ --targets targets.fa \ --outdir <OUTDIR>

Advanced usage

More usage information can be obtained at any time by running nextflow run OLC-Bioinformatics/BaitCapture --help:

``$ nextflow run OLC-Bioinformatics/BaitCapture --help N E X T F L O W ~ version 23.10.1 Launchinghttps://github.com/OLC-Bioinformatics/BaitCapture` [stupefied_bassi] DSL2 - revision: 0fd15d548b [dev]

olc/baitcapture v1.0.0-g0fd15d5

Typical pipeline command:

nextflow run olc/baitcapture --input samplesheet.csv --targets targets.fa --outdir results/ -profile singularity

Input/output options --input [string] Path to comma-separated file containing information about the samples in the experiment. --inputfolder [string] Path to folder containing paired-end gzipped FASTQ files. --pattern [string] Naming of sequencing files for `--inputfolder. Must use double-quotes ("") and a prepended slash (/`). [default: "/*R{1,2}001.fastq.gz"] --targets [string] Path to FASTA file of gene targets for alignment. --outdir [string] The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure. --host [string] Path to FASTA file of host genome to use for host DNA removal (decontamination). --adapters [string] Path to FASTA file of adapter sequences to use for adapter removal with FASTP. --targetmetadata [string] Path to comma-separated file containing information about the metadata for targets used in the experiment. --email [string] Email address for completion summary. --multiqctitle [string] MultiQC report title. Printed as page header, used for filename if not otherwise specified.

Workflow execution options --aligner [string] Alignment tool to use for aligning (preprocessed) reads to the provided database of gene targets). (accepted: bwamem2, kma, bwa) [default: kma] --skiptrimming [boolean] Indicate whether to skip trimming of raw reads. --reportall [boolean] Report undetected targets in merged results files.

Target detection thresholds --foldcovthreshold [number] The minimum fold-coverage of a target that must be achieved to call a positive detection. [default: 0.9] --lencovthreshold [integer] The minimum length (in bp) that a target must be covered by to call a positive detection. [default: 0] --mappedreadsthreshold [integer] The minimum number of reads that must be mapped to a target to call a positive detection. [default: 2] --propcovthreshold [number] The minimum percentage of length (in bp) that a target must be covered by to call a positive detection. [default: 0.9] --pident_threshold [number] The minimum percentage identity match to a target that must be achieved to call a positive detection (only available with --aligner kma).

Generic options --multiqcmethodsdescription [string] Custom MultiQC yaml file containing HTML including a methods description.

!! Hiding 23 params, use the 'validation.showHiddenParams' config value to show them !!

If you use olc/baitcapture for your analysis please cite:

The nf-core framework https://doi.org/10.1038/s41587-020-0439-x
Software dependencies

https://github.com/olc/baitcapture/blob/master/CITATIONS.md

```

Contributions and support

If you would like to contribute to this pipeline, please see the contributing guidelines.

Citations

If you use BaitCapture in a publication, please consider citing the software using the Zenodo DOI: 10.5281/zenodo.11283946.

An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.

This pipeline uses code and infrastructure developed and maintained by the nf-core initative, including nf-core/mag and nf-core/ampliseq, and reused here under the MIT license.

Ewels PA, Peltzer A, Fillinger S, Patel H, Alneberg J, Wilm A, Garcia MU, Di Tommaso P, Nahnsen S. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x. PubMed PMID: 32055031.

Owner

Name: OLC-Bioinformatics
Login: OLC-Bioinformatics
Kind: organization

Repositories: 26
Profile: https://github.com/OLC-Bioinformatics

Citation (CITATIONS.md)

# olc/baitcapture: Citations

## [nf-core](https://pubmed.ncbi.nlm.nih.gov/32055031/)

> Ewels PA, Peltzer A, Fillinger S, Patel H, Alneberg J, Wilm A, Garcia MU, Di Tommaso P, Nahnsen S. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x. PubMed PMID: 32055031.

## [Nextflow](https://pubmed.ncbi.nlm.nih.gov/28398311/)

> Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow enables reproducible computational workflows. Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: 10.1038/nbt.3820. PubMed PMID: 28398311.

## Pipeline tools

- [AlignCov](https://github.com/pcrxn/aligncov)

  > Brown LP. (2023) AlignCov [Online].

- [BWA](https://arxiv.org/abs/1303.3997)

  > Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv. 2013:1303.3997v2. doi: 10.48550/arXiv.1303.3997

- [BWA-MEM2](https://ieeexplore.ieee.org/document/8820962)

  > Vasimuddin M, Misra S, Li H, Aluru S. Efficient architecture-aware acceleration of BWA-MEM for multicore systems. IEEE Parallel and Distributed Processing Symposium (IPDPS). 2019;19028010. doi: 10.1109/IPDPS.2019.00041

- [fastp](https://onlinelibrary.wiley.com/doi/10.1002/imt2.107)

  > Chen, S. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. iMeta. 2023;2:e107. doi: 10.1002/imt2.107

- [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/)

  > Andrews, S. (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data [Online].

- [fastq-scan](https://github.com/rpetit3/fastq-scan)
  
    > Petit, R. (2023) fastq-scan [Online].

- [KMA](https://bitbucket.org/genomicepidemiology/kma)

  > Clausen PTLC, Aarestrup FM, Lund O. Rapid and precise alignment of raw reads against redundant databases with KMA. BMC Bioinformatics. 2018;19:307. doi: 10.1186/s12859-018-2336-6

- [Mosdepth](https://academic.oup.com/bioinformatics/article/34/5/867/4583630)

  > Pedersen BS, Quinlan AR. Mosdepth: quick coverage calculation for genomes and exomes. Bioinformatics. 2018;34(5):867-868. doi: 10.1093/bioinformatics/btx699

- [MultiQC](https://pubmed.ncbi.nlm.nih.gov/27312411/)

  > Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016;32(19):3047-8. doi: 10.1093/bioinformatics/btw354. Epub 2016 Jun 16. PubMed PMID: 27312411; PubMed Central PMCID: PMC5039924.

- [SAMtools](https://doi.org/10.1093/bioinformatics/btp352)

  > Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–2079. doi: 10.1093/bioinformatics/btp352

## Software packaging/containerisation tools

- [Anaconda](https://anaconda.com)

  > Anaconda Software Distribution. Computer software. Vers. 2-2.4.0. Anaconda, Nov. 2016. Web.

- [Bioconda](https://pubmed.ncbi.nlm.nih.gov/29967506/)

  > Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, Valieris R, Köster J; Bioconda Team. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018;15(7):475-476. doi: 10.1038/s41592-018-0046-7. PubMed PMID: 29967506.

- [BioContainers](https://pubmed.ncbi.nlm.nih.gov/28379341/)

  > da Veiga Leprevost F, Grüning B, Aflitos SA, Röst HL, Uszkoreit J, Barsnes H, Vaudel M, Moreno P, Gatto L, Weber J, Bai M, Jimenez RC, Sachsenberg T, Pfeuffer J, Alvarez RV, Griss J, Nesvizhskii AI, Perez-Riverol Y. BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics. 2017;33(16):2580-2582. doi: 10.1093/bioinformatics/btx192. PubMed PMID: 28379341; PubMed Central PMCID: PMC5870671.

- [Docker](https://dl.acm.org/doi/10.5555/2600239.2600241)

  > Merkel, D. (2014). Docker: lightweight linux containers for consistent development and deployment. Linux Journal, 2014(239), 2. doi: 10.5555/2600239.2600241.

- [Singularity](https://pubmed.ncbi.nlm.nih.gov/28494014/)

  > Kurtzer GM, Sochat V, Bauer MW. Singularity: Scientific containers for mobility of compute. PLoS One. 2017;12(5):e0177459. doi: 10.1371/journal.pone.0177459. eCollection 2017. PubMed PMID: 28494014; PubMed Central PMCID: PMC5426675.

GitHub Events

Total

Watch event: 1
Push event: 1

Last Year

Watch event: 1
Push event: 1

Dependencies

modules/nf-core/bwamem2/index/meta.yml cpan

modules/nf-core/bwamem2/mem/meta.yml cpan

modules/nf-core/custom/dumpsoftwareversions/meta.yml cpan

modules/nf-core/fastqc/meta.yml cpan

modules/nf-core/multiqc/meta.yml cpan

pyproject.toml pypi

modules/nf-core/bwa/index/meta.yml cpan

modules/nf-core/bwa/mem/meta.yml cpan

modules/nf-core/samtools/flagstat/meta.yml cpan

modules/nf-core/samtools/idxstats/meta.yml cpan

modules/nf-core/samtools/index/meta.yml cpan

modules/nf-core/samtools/stats/meta.yml cpan

subworkflows/nf-core/bam_stats_samtools/meta.yml cpan

modules/local/aligncov/environment.yml conda

aligncov 0.0.2.*

modules/local/bwamem2_host_mem/environment.yml conda

bwa-mem2 2.2.1.*
samtools 1.16.1.*

modules/nf-core/bwamem2/index/environment.yml conda

bwa-mem2 2.2.1.*

modules/nf-core/bwamem2/mem/environment.yml conda

bwa-mem2 2.2.1.*
htslib 1.19.1.*
samtools 1.19.2.*

modules/nf-core/custom/dumpsoftwareversions/environment.yml conda

multiqc 1.20.*

modules/nf-core/fastqc/environment.yml conda

fastqc 0.12.1.*

modules/nf-core/multiqc/environment.yml conda

multiqc 1.21.*

modules/nf-core/fastp/meta.yml cpan

modules/nf-core/fastqscan/meta.yml cpan

modules/nf-core/mosdepth/meta.yml cpan

modules/local/cat_results/environment.yml pypi

modules/local/kma/align/environment.yml pypi

modules/local/kma/index/environment.yml pypi

modules/local/merge_mapping_results/environment.yml pypi

modules/local/summarize_stats/environment.yml pypi

modules/local/validate_target_metadata/environment.yml pypi

modules/nf-core/bwa/index/environment.yml pypi

modules/nf-core/bwa/mem/environment.yml pypi

modules/nf-core/fastp/environment.yml pypi

modules/nf-core/fastqscan/environment.yml pypi

modules/nf-core/mosdepth/environment.yml pypi

modules/nf-core/samtools/flagstat/environment.yml pypi

modules/nf-core/samtools/idxstats/environment.yml pypi

modules/nf-core/samtools/index/environment.yml pypi

modules/nf-core/samtools/stats/environment.yml pypi