Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
✓DOI references
Found 7 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.2%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: MzDiaz824
- License: mit
- Language: Nextflow
- Default Branch: main
- Size: 26.2 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Getting Started
Install
Nextflow(>=21.10.3)Install
SingularityStart running your own analysis!
```console
nextflow run main.nf -profile singularity --input
--outdir ```
Please specify a directory for your results if you'd like to keep data for long term storage. Omitting the <--outdir> command will require the user to copy the pipeline results directory to the desired directory. This can consume a large amount of time when more than 5 samples are run.
Running Quaisar:
Note: First-time users will need to run the following command and allow the required databases to download:
console
nextflow run nf-core/quaisar -profile singularity -entry <databasedownloads>
Pipeline summary:
Preprocessing of Raw Reads
- PhiX adapter trimming and filtering of reads using BBDuK (
BBDuK) - Read filtering, adapter trimming, quality profiling and base correction using fastp (
fastp) - Raw Reads QC Assessment specifics found in "QC Summary by Read Processing State" section
*T Denotes Trimmed Reads that Were Not Assembled
*A Denotes Reads that Proceed to Assembly
Analysis of Trimmed Reads
4T. Gene detection and allele calling for antibiotic resistance, virulence (adding hypervirulence genes beyond their mention in the alerts section?), and/or plasmids using srst2 AR (srst2 AR)
5T. Report sequence types based on MLST alleles and profile definitions using srst2 MLST (srst2 MLST)
6T. Kraken2 (Kraken2)
Assembly
- Assemby of trimmed reads using SPAdes (
SPAdes)
### Analysis of Assembled Reads >= 500bps -------------------------------------------------------------------------------------------------------------------------------------------------------- 4A. Assess assembly quality using QUAST (QUAST)
5A. Measure the nucleotide-level coding region similarity (between genomes) using fastANI (fastANI)
6A. Type multiple loci to characterize isolates of microbial species using MLST (MLST)
7A. Detect hypervirulence genes and find best matches to untranslated genes from a gene database using GAMMA (GAMMA)
9A. Rapid whole genome annotation using Prokka (PROKKA)
10A. Assess genome assembly for completeness using BUSCO (BUSCO)
11A. KRAKEN2 (Kraken2)
Format Results of Analysis
QC Summary by Read Processing State
Raw Reads
Troubleshooting
Users may encounter exended lag time and/or issues connecting to certain databases. This can commonly occur while running the pipeline on a cluster. Please confirm that your HPC environment is not experiencing technical issues, updates, and/or downtimes that impact functionality if these types of errors occur.
Specifically, this error indicates that the user will need additional memory in order to run Kraken2.
Documentation
The nf-core/quaisar pipeline comes with documentation about the pipeline usage, parameters and output.
Credits
nf-core/quaisar was originally written by Rich Stanton, Nick Vlachos, Alyssa Kent, Maria Diaz, and Jill Hagey.
We thank the following people for their extensive assistance in the development of this pipeline:
Contributions and Support
If you would like to contribute to this pipeline, please see the contributing guidelines.
For further information or help, don't hesitate to get in touch on the Slack #quaisar channel (you can join with this invite).
Citations
An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.
You can cite the nf-core publication as follows:
The nf-core framework for community-curated bioinformatics pipelines.
Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.
Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x.
Introduction (Needs to be developed then will move to top of page)
nf-core/quaisar is a bioinformatics best-practice analysis pipeline for Quality, Assembly, Identification, Sequencing Typing, Annotation and Resistance mechanisms.
The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It uses Docker/Singularity containers making installation trivial and results highly reproducible. The Nextflow DSL2 implementation of this pipeline uses one container per process which makes it much easier to maintain and update software dependencies. Where possible, these processes have been submitted to and installed from nf-core/modules in order to make them available to all nf-core pipelines, and to everyone within the Nextflow community!
On release, automated continuous integration tests run the pipeline on a full-sized dataset on the AWS cloud infrastructure. This ensures that the pipeline runs on AWS, has sensible resource allocation defaults set to run on real-world datasets, and permits the persistent storage of results to benchmark between pipeline releases and other analysis sources. The results obtained from the full-sized test can be viewed on the nf-core website
Owner
- Name: Maria Diaz
- Login: MzDiaz824
- Kind: user
- Location: Atlanta, GA
- Repositories: 3
- Profile: https://github.com/MzDiaz824
Bioinformatician focused on metagenomics and molecular biology 👾
Citation (CITATIONS.md)
# nf-core/quaisar: Citations
## [nf-core](https://pubmed.ncbi.nlm.nih.gov/32055031/)
> Ewels PA, Peltzer A, Fillinger S, Patel H, Alneberg J, Wilm A, Garcia MU, Di Tommaso P, Nahnsen S. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x. PubMed PMID: 32055031.
## [Nextflow](https://pubmed.ncbi.nlm.nih.gov/28398311/)
> Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow enables reproducible computational workflows. Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: 10.1038/nbt.3820. PubMed PMID: 28398311.
## Pipeline tools
* [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/)
* [MultiQC](https://pubmed.ncbi.nlm.nih.gov/27312411/)
> Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016 Oct 1;32(19):3047-8. doi: 10.1093/bioinformatics/btw354. Epub 2016 Jun 16. PubMed PMID: 27312411; PubMed Central PMCID: PMC5039924.
## Software packaging/containerisation tools
* [Anaconda](https://anaconda.com)
> Anaconda Software Distribution. Computer software. Vers. 2-2.4.0. Anaconda, Nov. 2016. Web.
* [Bioconda](https://pubmed.ncbi.nlm.nih.gov/29967506/)
> Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, Valieris R, Köster J; Bioconda Team. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018 Jul;15(7):475-476. doi: 10.1038/s41592-018-0046-7. PubMed PMID: 29967506.
* [BioContainers](https://pubmed.ncbi.nlm.nih.gov/28379341/)
> da Veiga Leprevost F, Grüning B, Aflitos SA, Röst HL, Uszkoreit J, Barsnes H, Vaudel M, Moreno P, Gatto L, Weber J, Bai M, Jimenez RC, Sachsenberg T, Pfeuffer J, Alvarez RV, Griss J, Nesvizhskii AI, Perez-Riverol Y. BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics. 2017 Aug 15;33(16):2580-2582. doi: 10.1093/bioinformatics/btx192. PubMed PMID: 28379341; PubMed Central PMCID: PMC5870671.
* [Docker](https://dl.acm.org/doi/10.5555/2600239.2600241)
* [Singularity](https://pubmed.ncbi.nlm.nih.gov/28494014/)
> Kurtzer GM, Sochat V, Bauer MW. Singularity: Scientific containers for mobility of compute. PLoS One. 2017 May 11;12(5):e0177459. doi: 10.1371/journal.pone.0177459. eCollection 2017. PubMed PMID: 28494014; PubMed Central PMCID: PMC5426675.
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: 12 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0