pipesnake
ausarg/pipesnake is a bioinformatics best-practice analysis pipeline for phylogenomic reconstruction starting from short-read 'second-generation' sequencing data.
Science Score: 57.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 12 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (17.2%) to scientific vocabulary
Keywords
Repository
ausarg/pipesnake is a bioinformatics best-practice analysis pipeline for phylogenomic reconstruction starting from short-read 'second-generation' sequencing data.
Basic Info
Statistics
- Stars: 8
- Watchers: 5
- Forks: 4
- Open Issues: 5
- Releases: 3
Topics
Metadata Files
README.md
Welcome to ausarg/pipesnake

ausarg/pipesnake is a bioinformatics best-practice analysis pipeline for phylogenomic reconstruction starting from short-read 'second-generation' sequencing data.
The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It uses Docker/Singularity containers making installation trivial and results highly reproducible. The Nextflow DSL2 implementation of this pipeline uses one container per process which makes it much easier to maintain and update software dependencies.
Wiki + Quick Start
All of pipesnake's documentation is covered in this lovingly crafted Wiki.
Antsy? Follow our Quick Start guide to get up and running (slithering?).
Documentation + Detailed Summary
pipesnake comes with documentation about the pipeline usage, parameters and output.
A detailed summary of the workflow---including tools used---can be found in the Wiki.
Contributions + Support
If you would like to contribute to this pipeline, please see the contributing guidelines.
For further information or help, don't hesitate to get in touch. Having trouble? Open an issue or try contacting us directly.
Citation
If you use pipesnake, please consider citing it and its depenedencies:
Ian G Brennan, Sonal Singhal, Ziad Al Bkhetan, pipesnake: generalized software for the assembly and analysis of phylogenomic datasets from conserved genomic loci, Bioinformatics, Volume 40, Issue 5, May 2024, btae195, https://doi.org/10.1093/bioinformatics/btae195
An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.
Owner
- Name: AusARG
- Login: AusARG
- Kind: organization
- Repositories: 2
- Profile: https://github.com/AusARG
Citation (CITATIONS.md)
# ausarg/pipesnake: Citations ## Nextflow - [nf-core](https://pubmed.ncbi.nlm.nih.gov/32055031/) > Ewels PA, Peltzer A, Fillinger S, Patel H, Alneberg J, Wilm A, Garcia MU, Di Tommaso P, Nahnsen S. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x. PubMed PMID: 32055031. - [Nextflow](https://pubmed.ncbi.nlm.nih.gov/28398311/) > Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow enables reproducible computational workflows. Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: 10.1038/nbt.3820. PubMed PMID: 28398311. ## Pipeline Tools - [ASTER/ASTRAL]() > Zhang, C., & Mirarab, S. (2022). Weighting by gene tree uncertainty improves accuracy of quartet-based species trees. Molecular Biology and Evolution, 39(12), msac215. - [BBMAP](https://jgi.doe.gov/data-and-tools/software-tools/bbtools/bb-tools-user-guide/bbmap-guide/) > Bushnell, B. (2014). BBMap: a fast, accurate, splice-aware aligner (No. LBNL-7065E). Lawrence Berkeley National Lab.(LBNL), Berkeley, CA (United States). - [BLAT](https://genome.ucsc.edu/FAQ/FAQblat.html) > Kent, W. J. (2002). BLAT—the BLAST-like alignment tool. Genome research, 12(4), 656-664. - [GBLOCKS](https://home.cc.umanitoba.ca/~psgendb/doc/Castresana/Gblocks_documentation.html) > Talavera, G., & Castresana, J. (2007). Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Systematic biology, 56(4), 564-577. - [IQTREE](http://www.iqtree.org/) > Minh, B. Q., Schmidt, H. A., Chernomor, O., Schrempf, D., Woodhams, M. D., Von Haeseler, A., & Lanfear, R. (2020). IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Molecular biology and evolution, 37(5), 1530-1534. - [MAFFT](https://mafft.cbrc.jp/alignment/software/) > Katoh, K., & Standley, D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular biology and evolution, 30(4), 772-780. - [MultiQC](https://pubmed.ncbi.nlm.nih.gov/27312411/) > Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016 Oct 1;32(19):3047-8. doi: 10.1093/bioinformatics/btw354. Epub 2016 Jun 16. PubMed PMID: 27312411; PubMed Central PMCID: PMC5039924. - [RAxML](https://cme.h-its.org/exelixis/web/software/raxml/) > Stamatakis, A. (2014). RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics, 30(9), 1312-1313. - [SqCL](https://onlinelibrary.wiley.com/doi/full/10.1111/1755-0998.12681) > Singhal, S., Grundler, M., Colli, G., & Rabosky, D. L. (2017). Squamate conserved loci (Sq CL): a unified set of conserved loci for phylogenomics and population genetics of squamate reptiles. Molecular ecology resources, 17(6), e12-e24. - [Trimmomatic](http://www.usadellab.org/cms/?page=trimmomatic) > Bolger, A. M., Lohse, M., & Usadel, B. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics, 30(15), 2114-2120. - [Trinity](https://github.com/trinityrnaseq/trinityrnaseq) > Grabherr, M. G., Haas, B. J., Yassour, M., Levin, J. Z., Thompson, D. A., Amit, I., ... & Regev, A. (2011). Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data. Nature biotechnology, 29(7), 644. - [PEAR](https://cme.h-its.org/exelixis/web/software/pear/) > Zhang, J., Kobert, K., Flouri, T., & Stamatakis, A. (2014). PEAR: a fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics, 30(5), 614-620. ## Software packaging/containerisation tools - [Anaconda](https://anaconda.com) > Anaconda Software Distribution. Computer software. Vers. 2-2.4.0. Anaconda, Nov. 2016. Web. - [Bioconda](https://pubmed.ncbi.nlm.nih.gov/29967506/) > Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, Valieris R, Köster J; Bioconda Team. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018 Jul;15(7):475-476. doi: 10.1038/s41592-018-0046-7. PubMed PMID: 29967506. - [BioContainers](https://pubmed.ncbi.nlm.nih.gov/28379341/) > da Veiga Leprevost F, Grüning B, Aflitos SA, Röst HL, Uszkoreit J, Barsnes H, Vaudel M, Moreno P, Gatto L, Weber J, Bai M, Jimenez RC, Sachsenberg T, Pfeuffer J, Alvarez RV, Griss J, Nesvizhskii AI, Perez-Riverol Y. BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics. 2017 Aug 15;33(16):2580-2582. doi: 10.1093/bioinformatics/btx192. PubMed PMID: 28379341; PubMed Central PMCID: PMC5870671. - [Docker](https://dl.acm.org/doi/10.5555/2600239.2600241) - [Singularity](https://pubmed.ncbi.nlm.nih.gov/28494014/) > Kurtzer GM, Sochat V, Bauer MW. Singularity: Scientific containers for mobility of compute. PLoS One. 2017 May 11;12(5):e0177459. doi: 10.1371/journal.pone.0177459. eCollection 2017. PubMed PMID: 28494014; PubMed Central PMCID: PMC5426675.
GitHub Events
Total
- Issues event: 4
- Member event: 2
- Issue comment event: 14
- Push event: 5
- Pull request review event: 4
- Pull request event: 8
- Fork event: 1
Last Year
- Issues event: 4
- Member event: 2
- Issue comment event: 14
- Push event: 5
- Pull request review event: 4
- Pull request event: 8
- Fork event: 1
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 3
- Total pull requests: 3
- Average time to close issues: 21 days
- Average time to close pull requests: 4 days
- Total issue authors: 2
- Total pull request authors: 1
- Average comments per issue: 3.67
- Average comments per pull request: 1.0
- Merged pull requests: 3
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 3
- Pull requests: 3
- Average time to close issues: 21 days
- Average time to close pull requests: 4 days
- Issue authors: 2
- Pull request authors: 1
- Average comments per issue: 3.67
- Average comments per pull request: 1.0
- Merged pull requests: 3
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- stiatragul (3)
- njdowdy (2)
- IanGBrennan (2)
- ziadbkh (2)
- Phismil (1)
- SaunakP25 (1)
Pull Request Authors
- SiddhantAttavar (3)
- IanGBrennan (1)
- ziadbkh (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- mshick/add-pr-comment v1 composite
- actions/checkout v3 composite
- nf-core/setup-nextflow v1 composite
- actions/checkout v3 composite
- actions/setup-node v3 composite
- actions/checkout v3 composite
- actions/setup-node v3 composite
- actions/setup-python v4 composite
- actions/upload-artifact v3 composite
- mshick/add-pr-comment v1 composite
- nf-core/setup-nextflow v1 composite
- psf/black stable composite
- dawidd6/action-download-artifact v2 composite
- marocchino/sticky-pull-request-comment v2 composite