wf-pocp
WARN: This pipeline has been merged with wf-aai. Compute Percentage of Conserved Proteins (POCP) between FastA and GenBank files
Science Score: 18.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.4%) to scientific vocabulary
Repository
WARN: This pipeline has been merged with wf-aai. Compute Percentage of Conserved Proteins (POCP) between FastA and GenBank files
Basic Info
- Host: GitHub
- Owner: gregorysprenger
- License: apache-2.0
- Language: Nextflow
- Default Branch: main
- Homepage: https://github.com/bacterial-genomics/wf-aai
- Size: 2.36 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md

General schematic of the steps in the workflow
Contents
- Quick Start
- Introduction
- Installation
- Usage
- Parameters
- Resource Managers
- Output
- Troubleshooting
- Contributions and Support
- Citations
Quick Start: Test
Run the built-in test set to confirm all parts are working as-expected. It will also download all dependencies to make subsequent runs much faster.
Pull workflow from GitHub
nextflow pull bacterial-genomics/wf-pocp
Run test workflow
nextflow run \
bacterial-genomics/wf-pocp \
-r main \
-profile <docker|singularity>,test \
--outdir results
Quick Start: Run
Example command on FastAs in "new-fasta-dir" data using BLAST with singularity:
Pull workflow from GitHub
nextflow pull bacterial-genomics/wf-pocp
Run workflow
nextflow run \
bacterial-genomics/wf-pocp \
-r main \
-profile singularity \
--input new-fasta-dir \
--outdir my-results \
--pocp blast
Introduction
This workflow performs average nucleotide identity on assembled and/or annotated files (FastA/Genbank).
Installation
- Nextflow
>=22.04.03 - Docker or Singularity
>=3.8.0 - Conda is currently unsupported
Usage
nextflow run main.nf -profile <docker|singularity> --input <input directory> --outdir <directory for results> --pocp <blast|diamond>
Please see the usage documentation for further information on using this workflow.
Parameters
Note the "--" long name arguments (e.g., --help, --input, --outdir) are generally specific to this workflow's options, whereas "-" long name options (e.g., -help, -latest, -profile) are general nextflow options.
These are the most pertinent options for this workflow:
Required parameters
``` ============================================ Input/Output ============================================ --input Path to input data directory containing FastA/Genbank assemblies or samplesheet. Recognized extensions are: {faa,gb,gbk,gbf,gbff} with optional gzip compression.
--query Path to input data FastA/Genbank file or samplesheet. Recognized extensions are: {faa,gb,gbk,gbf,gbff} with optional gzip compression.
--refdir Path to reference panel data directory containing FastA/Genbank assemblies or samplesheet. Recognized extensions are: {faa,gb,gbk,gbf,gbff} with optional gzip compression.
--outdir The output directory where the results will be saved.
============================================ Container platforms ============================================ -profile singularity Use Singularity images to run the workflow. Will pull and convert Docker images from Dockerhub if not locally available.
-profile docker Use Docker images to run the workflow. Will pull images from Dockerhub if not locally available.
============================================ Optional POCP tools ============================================ --pocp Specify what algorithm should be used to compare input files. Recognized arguments are: blast, diamond. [Default: blast] ```
Additional parameters
View help menu of all workflow options:
nextflow run \
bacterial-genomics/wf-pocp \
-r main \
--help \
--show_hidden_params
Resource Managers
The most well-tested and supported is a Univa Grid Engine (UGE) job scheduler with Singularity for dependency handling.
- UGE/SGE
- Additional tips for UGE processing are here.
- No Scheduler
- It has also been confirmed to work on desktop and laptop environments without a job scheduler using Docker with more tips here.
Output
Please see the output documentation for a table of all outputs created by this workflow.
Troubleshooting
Q: It failed, how do I find out what went wrong?
A: View file contents in the <outdir>/log directory.
Contributions and Support
If you would like to contribute to this pipeline, please see the contributing guidelines.
Citations
An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.
Owner
- Name: Gregory Sprenger
- Login: gregorysprenger
- Kind: user
- Repositories: 5
- Profile: https://github.com/gregorysprenger
Citation (CITATIONS.md)
# wf-pocp: Citations ## [nf-core](https://pubmed.ncbi.nlm.nih.gov/32055031/) > Ewels PA, Peltzer A, Fillinger S, Patel H, Alneberg J, Wilm A, Garcia MU, Di Tommaso P, Nahnsen S. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x. PubMed PMID: 32055031. ## [Nextflow](https://pubmed.ncbi.nlm.nih.gov/28398311/) > Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow enables reproducible computational workflows. Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: 10.1038/nbt.3820. PubMed PMID: 28398311. ## Pipeline tools - [BLAST+](https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-10-421) > Camacho C, Coulouris G, Avagyan V, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009/12/15 2009;10(1):421. doi:10.1186/1471-2105-10-421 - [BioPython](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2682512/) > Cock PJ, Antao T, Chang JT, et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. Jun 1 2009;25(11):1422-3. doi:10.1093/bioinformatics/btp163 - [DIAMOND](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8026399/) > Buchfink B, Reuter K, Drost HG. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat Methods. Apr 2021;18(4):366-368. doi:10.1038/s41592-021-01101-x ## Software packaging/containerisation tools - [Anaconda](https://anaconda.com) > Anaconda Software Distribution. Computer software. Vers. 2-2.4.0. Anaconda, Nov. 2016. Web. - [Bioconda](https://pubmed.ncbi.nlm.nih.gov/29967506/) > Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, Valieris R, Köster J; Bioconda Team. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018 Jul;15(7):475-476. doi: 10.1038/s41592-018-0046-7. PubMed PMID: 29967506. - [BioContainers](https://pubmed.ncbi.nlm.nih.gov/28379341/) > da Veiga Leprevost F, Grüning B, Aflitos SA, Röst HL, Uszkoreit J, Barsnes H, Vaudel M, Moreno P, Gatto L, Weber J, Bai M, Jimenez RC, Sachsenberg T, Pfeuffer J, Alvarez RV, Griss J, Nesvizhskii AI, Perez-Riverol Y. BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics. 2017 Aug 15;33(16):2580-2582. doi: 10.1093/bioinformatics/btx192. PubMed PMID: 28379341; PubMed Central PMCID: PMC5870671. - [Docker](https://dl.acm.org/doi/10.5555/2600239.2600241) - [Singularity](https://pubmed.ncbi.nlm.nih.gov/28494014/) > Kurtzer GM, Sochat V, Bauer MW. Singularity: Scientific containers for mobility of compute. PLoS One. 2017 May 11;12(5):e0177459. doi: 10.1371/journal.pone.0177459. eCollection 2017. PubMed PMID: 28494014; PubMed Central PMCID: PMC5426675. ### Citation Information Citations were created with the help of EndNote and were exported in AMA format.
GitHub Events
Total
Last Year
Dependencies
- mshick/add-pr-comment v1 composite
- actions/checkout v3 composite
- nf-core/setup-nextflow v1 composite
- actions/checkout v3 composite
- actions/setup-node v3 composite
- actions/checkout v3 composite
- actions/setup-node v3 composite
- mshick/add-pr-comment v1 composite
- psf/black stable composite
- dawidd6/action-download-artifact v2 composite
- marocchino/sticky-pull-request-comment v2 composite
- actions/checkout v3 composite
- nf-core/setup-nextflow v1 composite