rsv_ampseq

Nextflow pipeline for genome assembly and variants calling of human respiratory syncytial virus amplicon sequencing

https://github.com/asereewit/rsv_ampseq

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.2%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Nextflow pipeline for genome assembly and variants calling of human respiratory syncytial virus amplicon sequencing

Basic Info
  • Host: GitHub
  • Owner: asereewit
  • License: mit
  • Language: Nextflow
  • Default Branch: main
  • Size: 2.65 MB
Statistics
  • Stars: 2
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Created over 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme Changelog Contributing License Code of conduct Citation

README.md

RSV-AMPSEQ

A nextflow pipeline for genome assembly and variants calling (fusion gene) of human respiratory syncytial virus amplicon sequencing

Usage

Install Nextflow

Install Docker

Running the pipeline:

nextflow run asereewit/rsv_ampseq -r main -latest --input input_samplesheet.csv --output pipeline_output -profile docker

To run it on AWS, add your nextflow config for AWS using -c:

nextflow run asereewit/rsv_ampseq -r main -latest --input input_samplesheet.csv --output pipeline_output -profile docker -with-tower -c nextflow_aws.config

You can create a samplesheet using the bundled python script fastqdirsamplesheet.py in bin folder, like this:

python fastq_dir_samplesheet.py fastq_dir samplesheet_name.csv

Options

|Option|Explanation| |------|-----------| | --input | samplesheet in csv format with fastq information, samplesheet example: assets/samplesheet.csv | | --output | output directory | | --skip_fastqc | skip quality control using FastQC (default: false) | | --skip_fastp | skip adapters and reads trimming using fastp (default: false) | | --trim_len | minimum read length to keep (default:50) | | --min_trim_reads | mininum number of trimmed reads required for downstream processes (default: 0) | | --min_mapped_reads | minimum number of mapped reads for variants and consensus calling (default: 1000) | | --skip_ivar_trim | skip primer clipping step |

Custom primers options

|Option|Explanation| |------|-----------| | --ref_RSVA | RSVA reference fasta | | --primer_bed_RSVA | RSVA primer bed file with "RSVA" as the header in the first column, "LEFT" denotes forward primers, "RIGHT" denotes reverse primers | | --ref_gff_RSVA | RSVA gff file with "RSVA" as the header in the first column | | --RSVA_F_coord | RSVA reference's fusion gene coordinates in the format of xxxx-yyyy (e.g. 5682-7406) | | --RSVA_F_length | RSVA fusion gene length | | --ref_RSVB | RSVB reference fasta | | --primer_bed_RSVB | RSVB primer bed file with "RSVB" as the header in the first column, "LEFT" denotes forward primers, "RIGHT" denotes reverse primers | | --ref_gff_RSVB | RSVB gff file with "RSVB" as the header in the first column | | --RSVB_F_coord | RSVB reference's fusion gene coordinates in the format of xxxx-yyyy (e.g. 5717-7441) | | --RSVB_F_length | RSVB fusion gene length |

Owner

  • Login: asereewit
  • Kind: user

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use `nf-core tools` in your work, please cite the `nf-core` publication"
authors:
  - family-names: Ewels
    given-names: Philip
  - family-names: Peltzer
    given-names: Alexander
  - family-names: Fillinger
    given-names: Sven
  - family-names: Patel
    given-names: Harshil
  - family-names: Alneberg
    given-names: Johannes
  - family-names: Wilm
    given-names: Andreas
  - family-names: Garcia
    given-names: Maxime Ulysse
  - family-names: Di Tommaso
    given-names: Paolo
  - family-names: Nahnsen
    given-names: Sven
title: "The nf-core framework for community-curated bioinformatics pipelines."
version: 2.4.1
doi: 10.1038/s41587-020-0439-x
date-released: 2022-05-16
url: https://github.com/nf-core/tools
prefered-citation:
  type: article
  authors:
    - family-names: Ewels
      given-names: Philip
    - family-names: Peltzer
      given-names: Alexander
    - family-names: Fillinger
      given-names: Sven
    - family-names: Patel
      given-names: Harshil
    - family-names: Alneberg
      given-names: Johannes
    - family-names: Wilm
      given-names: Andreas
    - family-names: Garcia
      given-names: Maxime Ulysse
    - family-names: Di Tommaso
      given-names: Paolo
    - family-names: Nahnsen
      given-names: Sven
  doi: 10.1038/s41587-020-0439-x
  journal: nature biotechnology
  start: 276
  end: 278
  title: "The nf-core framework for community-curated bioinformatics pipelines."
  issue: 3
  volume: 38
  year: 2020
  url: https://dx.doi.org/10.1038/s41587-020-0439-x

GitHub Events

Total
Last Year