rsv_ampseq
Nextflow pipeline for genome assembly and variants calling of human respiratory syncytial virus amplicon sequencing
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.2%) to scientific vocabulary
Repository
Nextflow pipeline for genome assembly and variants calling of human respiratory syncytial virus amplicon sequencing
Basic Info
- Host: GitHub
- Owner: asereewit
- License: mit
- Language: Nextflow
- Default Branch: main
- Size: 2.65 MB
Statistics
- Stars: 2
- Watchers: 1
- Forks: 1
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
RSV-AMPSEQ
A nextflow pipeline for genome assembly and variants calling (fusion gene) of human respiratory syncytial virus amplicon sequencing
Usage
Install Nextflow
Install Docker
Running the pipeline:
nextflow run asereewit/rsv_ampseq -r main -latest --input input_samplesheet.csv --output pipeline_output -profile docker
To run it on AWS, add your nextflow config for AWS using -c:
nextflow run asereewit/rsv_ampseq -r main -latest --input input_samplesheet.csv --output pipeline_output -profile docker -with-tower -c nextflow_aws.config
You can create a samplesheet using the bundled python script fastqdirsamplesheet.py in bin folder, like this:
python fastq_dir_samplesheet.py fastq_dir samplesheet_name.csv
Options
|Option|Explanation|
|------|-----------|
| --input | samplesheet in csv format with fastq information, samplesheet example: assets/samplesheet.csv |
| --output | output directory |
| --skip_fastqc | skip quality control using FastQC (default: false) |
| --skip_fastp | skip adapters and reads trimming using fastp (default: false) |
| --trim_len | minimum read length to keep (default:50) |
| --min_trim_reads | mininum number of trimmed reads required for downstream processes (default: 0) |
| --min_mapped_reads | minimum number of mapped reads for variants and consensus calling (default: 1000) |
| --skip_ivar_trim | skip primer clipping step |
Custom primers options
|Option|Explanation|
|------|-----------|
| --ref_RSVA | RSVA reference fasta |
| --primer_bed_RSVA | RSVA primer bed file with "RSVA" as the header in the first column, "LEFT" denotes forward primers, "RIGHT" denotes reverse primers |
| --ref_gff_RSVA | RSVA gff file with "RSVA" as the header in the first column |
| --RSVA_F_coord | RSVA reference's fusion gene coordinates in the format of xxxx-yyyy (e.g. 5682-7406) |
| --RSVA_F_length | RSVA fusion gene length |
| --ref_RSVB | RSVB reference fasta |
| --primer_bed_RSVB | RSVB primer bed file with "RSVB" as the header in the first column, "LEFT" denotes forward primers, "RIGHT" denotes reverse primers |
| --ref_gff_RSVB | RSVB gff file with "RSVB" as the header in the first column |
| --RSVB_F_coord | RSVB reference's fusion gene coordinates in the format of xxxx-yyyy (e.g. 5717-7441) |
| --RSVB_F_length | RSVB fusion gene length |
Owner
- Login: asereewit
- Kind: user
- Repositories: 1
- Profile: https://github.com/asereewit
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use `nf-core tools` in your work, please cite the `nf-core` publication"
authors:
- family-names: Ewels
given-names: Philip
- family-names: Peltzer
given-names: Alexander
- family-names: Fillinger
given-names: Sven
- family-names: Patel
given-names: Harshil
- family-names: Alneberg
given-names: Johannes
- family-names: Wilm
given-names: Andreas
- family-names: Garcia
given-names: Maxime Ulysse
- family-names: Di Tommaso
given-names: Paolo
- family-names: Nahnsen
given-names: Sven
title: "The nf-core framework for community-curated bioinformatics pipelines."
version: 2.4.1
doi: 10.1038/s41587-020-0439-x
date-released: 2022-05-16
url: https://github.com/nf-core/tools
prefered-citation:
type: article
authors:
- family-names: Ewels
given-names: Philip
- family-names: Peltzer
given-names: Alexander
- family-names: Fillinger
given-names: Sven
- family-names: Patel
given-names: Harshil
- family-names: Alneberg
given-names: Johannes
- family-names: Wilm
given-names: Andreas
- family-names: Garcia
given-names: Maxime Ulysse
- family-names: Di Tommaso
given-names: Paolo
- family-names: Nahnsen
given-names: Sven
doi: 10.1038/s41587-020-0439-x
journal: nature biotechnology
start: 276
end: 278
title: "The nf-core framework for community-curated bioinformatics pipelines."
issue: 3
volume: 38
year: 2020
url: https://dx.doi.org/10.1038/s41587-020-0439-x