https://github.com/aaronmussig/gtdb-phylogenetic-diversity

Snakemake pipeline to calculate the phylum phylogenetic diversity of GTDB R220.

https://github.com/aaronmussig/gtdb-phylogenetic-diversity

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.1%) to scientific vocabulary
Last synced: 9 months ago · JSON representation

Repository

Snakemake pipeline to calculate the phylum phylogenetic diversity of GTDB R220.

Basic Info
  • Host: GitHub
  • Owner: aaronmussig
  • License: gpl-3.0
  • Language: Python
  • Default Branch: main
  • Homepage:
  • Size: 10.2 MB
Statistics
  • Stars: 1
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created almost 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License

README.md

gtdb-phylogenetic-diversity

Snakemake pipeline to calculate the phylogenetic diversity of each phylum in GTDB R220.

Dependencies

The only dependency is Snakemake (version >= 7.32.4) which is assumed to be on the command line.

Installation

Clone the repository:

bash git clone git@github.com:aaronmussig/gtdb-phylogenetic-diversity.git cd gtdb-phylogenetic-diversity

Running the pipeline

To run the pipeline, simply execute the following command:

bash snakemake --use-conda --cores all

Pipeline steps

  1. Trees and taxonomy files for the Archaeal and Bacterial domains are downloaded.
  2. Both trees are scaled by their relative evolutionary divergence (RED) using PhyloRank.
  3. A script will then calculate the phylogenetic diversity of each phylum in GTDB R220 by the sum of their branch lengths.

Both the RED and non-scaled trees will produce values.

Output files are saved under: ./output/

Owner

  • Name: Aaron Mussig
  • Login: aaronmussig
  • Kind: user
  • Location: Brisbane, Australia
  • Company: Australian Centre for Ecogenomics

Bioinformatics PhD student at the University of Queensland. Python and Rust enthusiast.

GitHub Events

Total
Last Year

Issues and Pull Requests

Last synced: over 1 year ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels