assemblyqc

A Nextflow pipeline for evaluating assembly quality

https://github.com/plant-food-research-open/assemblyqc

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 10 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.0%) to scientific vocabulary

Keywords

adaptor assembly busco contamination fcs genome hi-c k-mer merqury n50 phasing quality-control repeat report statistics synteny taxonomy telomere
Last synced: 6 months ago · JSON representation ·

Repository

A Nextflow pipeline for evaluating assembly quality

Basic Info
Statistics
  • Stars: 37
  • Watchers: 13
  • Forks: 8
  • Open Issues: 80
  • Releases: 0
Topics
adaptor assembly busco contamination fcs genome hi-c k-mer merqury n50 phasing quality-control repeat report statistics synteny taxonomy telomere
Created about 3 years ago · Last pushed 6 months ago
Metadata Files
Readme Changelog Contributing License Citation

README.md

plant-food-research-open/assemblyqc

GitHub Actions CI Status GitHub Actions Linting StatusCite Article nf-test

Nextflow run with conda ❌ run with docker run with singularity Launch on Seqera Platform

Introduction

plant-food-research-open/assemblyqc is a Nextflow pipeline which evaluates assembly quality with multiple QC tools and presents the results in a unified html report. The tools are shown in the Pipeline Flowchart and their references are listed in CITATIONS.md. The pipeline includes skip flags to disable execution of various tools.

Pipeline Flowchart

Usage

Refer to usage, parameters and output documents for details.

[!NOTE] If you are new to Nextflow and nf-core, please refer to this page on how to set-up Nextflow. Make sure to test your setup with -profile test before running the workflow on actual data.

Prepare an assemblysheet.csv file with following columns representing target assemblies and associated meta-data.

  • tag: A unique tag which represents the target assembly throughout the pipeline and in the final report
  • fasta: FASTA file

Now, you can run the pipeline using:

bash nextflow run plant-food-research-open/assemblyqc \ -revision <version> \ -profile <docker/singularity/.../institute> \ --input assemblysheet.csv \ --outdir <OUTDIR>

[!WARNING] Please provide pipeline parameters via the CLI or Nextflow -params-file option. Custom config files including those provided by the -c Nextflow option can be used to provide any configuration except for parameters; see docs.

Plant&Food Users

Download the pipeline to your /workspace/$USER folder. Change the parameters defined in the pfr/params.json file. Submit the pipeline to SLURM for execution.

bash sbatch ./pfr_assemblyqc

Credits

plant-food-research-open/assemblyqc was originally written by Usman Rashid (@gallvp) and Ken Smith (@hzlnutspread).

Ross Crowhurst (@rosscrowhurst), Chen Wu (@christinawu2008) and Marcus Davy (@mdavy86) generously contributed their QC scripts.

Mahesh Binzer-Panchal (@mahesh-panchal) and Simon Pearce (@SPPearce) helped port the pipeline modules and sub-workflows to nf-core schema.

We thank the following people for their extensive assistance in the development of this pipeline:

The pipeline uses nf-core modules contributed by following authors:

Contributions and Support

If you would like to contribute to this pipeline, please see the contributing guidelines.

Citations

If you use plant-food-research-open/assemblyqc for your analysis, please cite it as:

AssemblyQC: A Nextflow pipeline for reproducible reporting of assembly quality.

Usman Rashid, Chen Wu, Jason Shiller, Ken Smith, Ross Crowhurst, Marcus Davy, Ting-Hsuan Chen, Ignacio Carvajal, Sarah Bailey, Susan Thomson & Cecilia H Deng.

Bioinformatics. 2024 July 30. doi: 10.1093/bioinformatics/btae477.

An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.

This pipeline uses code and infrastructure developed and maintained by the nf-core community, reused here under the MIT license.

The nf-core framework for community-curated bioinformatics pipelines.

Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.

Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x.

Owner

  • Name: Plant-Food-Research-Open
  • Login: Plant-Food-Research-Open
  • Kind: organization

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this pipeline, please cite it as below."
authors:
  - family-names: "Rashid"
    given-names: "Usman"
    orcid: "https://orcid.org/0000-0002-1109-5493"
  - family-names: "Wu"
    given-names: "Chen"
  - family-names: "Shiller"
    given-names: "Jason"
  - family-names: "Smith"
    given-names: "Ken"
  - family-names: "Crowhurst"
    given-names: "Ross"
  - family-names: "Davy"
    given-names: "Marcus"
  - family-names: "Chen"
    given-names: "Ting-Hsuan"
  - family-names: "Carvajal"
    given-names: "Ignacio"
  - family-names: "Bailey"
    given-names: "Sarah"
  - family-names: "Thomson"
    given-names: "Susan"
  - family-names: "Deng"
    given-names: "Cecilia"
title: "AssemblyQC: A Nextflow pipeline for reproducible reporting of assembly quality"
version: 2.2.1
date-released: 2024-07-30
url: "https://github.com/Plant-Food-Research-Open/assemblyqc"
doi: 10.1093/bioinformatics/btae477

GitHub Events

Total
  • Create event: 50
  • Release event: 3
  • Issues event: 97
  • Watch event: 9
  • Delete event: 47
  • Member event: 1
  • Issue comment event: 179
  • Push event: 115
  • Pull request review comment event: 6
  • Pull request review event: 8
  • Pull request event: 89
  • Fork event: 4
Last Year
  • Create event: 50
  • Release event: 3
  • Issues event: 97
  • Watch event: 9
  • Delete event: 47
  • Member event: 1
  • Issue comment event: 179
  • Push event: 115
  • Pull request review comment event: 6
  • Pull request review event: 8
  • Pull request event: 89
  • Fork event: 4

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 120
  • Total pull requests: 88
  • Average time to close issues: 3 months
  • Average time to close pull requests: 4 days
  • Total issue authors: 8
  • Total pull request authors: 2
  • Average comments per issue: 0.63
  • Average comments per pull request: 1.08
  • Merged pull requests: 80
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 80
  • Pull requests: 65
  • Average time to close issues: 23 days
  • Average time to close pull requests: about 11 hours
  • Issue authors: 6
  • Pull request authors: 2
  • Average comments per issue: 0.38
  • Average comments per pull request: 1.15
  • Merged pull requests: 60
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • GallVp (101)
  • CeciliaDeng (7)
  • ignacio3437 (4)
  • rosscrowhurst (3)
  • christinawu2008 (2)
  • AlcaArctica (1)
  • SarahBailey1998 (1)
  • tseemann (1)
Pull Request Authors
  • GallVp (85)
  • ignacio3437 (3)
Top Labels
Issue Labels
enhancement (47) bug (28) done on dev (23) documentation (4) v2 (3) wontfix (3) awaiting-feedback (2) Stale (2) question (2) feature request (1) critical (1) fatal (1)
Pull Request Labels
enhancement (2) done on dev (2) feature request (1)