gtdbtk

GTDB-Tk: a toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes.

https://github.com/ecogenomics/gtdbtk

Science Score: 59.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 12 DOI reference(s) in README
  • Academic publication links
    Links to: ncbi.nlm.nih.gov, nature.com
  • Committers with academic emails
    2 of 22 committers (9.1%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.8%) to scientific vocabulary

Keywords

archaea bacteria bioinformatics metagenomics nomenclature phylogenetics species-assignments taxonomy

Keywords from Contributors

microbiome
Last synced: 6 months ago · JSON representation

Repository

GTDB-Tk: a toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes.

Basic Info
Statistics
  • Stars: 541
  • Watchers: 20
  • Forks: 90
  • Open Issues: 25
  • Releases: 44
Topics
archaea bacteria bioinformatics metagenomics nomenclature phylogenetics species-assignments taxonomy
Created about 9 years ago · Last pushed 6 months ago
Metadata Files
Readme Contributing License Code of conduct

README.md

GTDB-Tk

PyPI PyPI Downloads Bioconda BioConda Downloads Docker Image Version (latest by date) Docker Pulls

GTDB-Tk is a software toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes based on the Genome Database Taxonomy (GTDB). It is designed to work with recent advances that allow hundreds or thousands of metagenome-assembled genomes (MAGs) to be obtained directly from environmental samples. It can also be applied to isolate and single-cell genomes. The GTDB-Tk is open source and released under the GNU General Public License (Version 3).

Notifications about GTDB-Tk releases will be available through the GTDB Twitter account and the GTDB Announcements Forum.

Please post questions and issues related to GTDB-Tk on the Issues section of the GitHub repository. Questions related to the GTDB can be posted on the GTDB Forum or sent to the GTDB team.

🚀 Getting started

Be sure to check the hardware requirements, then choose your preferred method:

📖 Documentation

Documentation for GTDB-Tk can be found here.

✨ New Features

GTDB-Tk v2.5.0+ includes the following new features: - GTDB-Tk now uses skani exclusively for genome clustering, replacing the previous mash/skani hybrid approach.
As a result, the mutually exclusive required options --mash_db and --skip_ani_screen have been removed. The --skip_ani_screen flag has now been changed to an optional parameter.
⚠️This change is not backward-compatible and may break existing pipelines or scripts that rely on these options.

📈 Performance

Using ANI screen "can" reduce computation by >50%, although it depends on the set of input genomes. A set of input genomes consisting primarily of new species will not benefit from ANI screen as much as a set of genomes that are largely assigned to GTDB species clusters. In the latter case, the ANI screen will reduce the number of genomes that need to be classified by pplacer which reduces computation time substantially (between 25% and 60% in our testing).

📚 References

GTDB-Tk is described in:

The Genome Taxonomy Database (GTDB) is described in:

We strongly encourage you to cite the following 3rd party dependencies:

© Copyright

Copyright 2017 Pierre-Alain Chaumeil. See LICENSE for further details.

Owner

  • Name: Australian Centre for Ecogenomics
  • Login: Ecogenomics
  • Kind: organization
  • Location: University of Queensland, Australia

GitHub Events

Total
  • Create event: 3
  • Release event: 2
  • Issues event: 71
  • Watch event: 59
  • Issue comment event: 138
  • Push event: 36
  • Pull request event: 10
  • Gollum event: 3
  • Fork event: 6
Last Year
  • Create event: 3
  • Release event: 2
  • Issues event: 71
  • Watch event: 59
  • Issue comment event: 138
  • Push event: 36
  • Pull request event: 10
  • Gollum event: 3
  • Fork event: 6

Committers

Last synced: over 1 year ago

All Time
  • Total Commits: 759
  • Total Committers: 22
  • Avg Commits per committer: 34.5
  • Development Distribution Score (DDS): 0.613
Past Year
  • Commits: 27
  • Committers: 4
  • Avg Commits per committer: 6.75
  • Development Distribution Score (DDS): 0.185
Top Committers
Name Email Commits
Aaron Mussig a****g@g****m 294
pchaumeil u****m@u****u 213
Donovan Parks d****s@g****m 162
pchaumeil p****l@g****m 49
pchaumeil p****l@q****g 24
Cameron Hyde 4****t 1
Donovan 9****s 1
Florian Plaza Oñate f****a 1
Linda Fenske 1****3 1
Nicolai Søborg N****g 1
Samuel Aroney 4****S 1
Valentyn Bezshapkin 6****3 1
just.in.lee 3****6 1
tr11-sanger 9****r 1
Asaf Peer A****r@j****g 1
Ben Woodcroft b****t@g****m 1
Daniel McDonald d****d@u****u 1
Davide Albanese d****e@g****m 1
Florian Plaza Oñate f****e@i****r 1
Malte Rühlemann m****n@g****m 1
Mingye Wang a****6@g****m 1
Moritz m****k@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 204
  • Total pull requests: 46
  • Average time to close issues: 3 months
  • Average time to close pull requests: 9 days
  • Total issue authors: 161
  • Total pull request authors: 10
  • Average comments per issue: 3.03
  • Average comments per pull request: 0.28
  • Merged pull requests: 39
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 47
  • Pull requests: 9
  • Average time to close issues: about 1 month
  • Average time to close pull requests: 1 minute
  • Issue authors: 41
  • Pull request authors: 3
  • Average comments per issue: 1.68
  • Average comments per pull request: 0.0
  • Merged pull requests: 4
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • saras224 (5)
  • zwets (4)
  • donovan-h-parks (3)
  • lfenske-93 (3)
  • jolespin (3)
  • luwenyi111 (3)
  • wanxn518 (3)
  • pchaumeil (3)
  • nick-youngblut (3)
  • wwood (3)
  • lyisrae1 (2)
  • sarehaghababaee (2)
  • JiangweiPan1230 (2)
  • jianshu93 (2)
  • aaronmussig (2)
Pull Request Authors
  • pchaumeil (37)
  • MartinVad (2)
  • juanvillada (2)
  • donovan-h-parks (2)
  • aaronmussig (2)
  • Artoria2e5 (2)
  • wasade (2)
  • wwood (1)
  • AroneyS (1)
  • neoformit (1)
Top Labels
Issue Labels
error (100) next version (20) enhancement (9) stale (7) question (2) suggestion (2) bug (1)
Pull Request Labels
script (1)

Packages

  • Total packages: 2
  • Total downloads:
    • pypi 389 last-month
  • Total docker downloads: 475
  • Total dependent packages: 0
    (may contain duplicates)
  • Total dependent repositories: 2
    (may contain duplicates)
  • Total versions: 53
  • Total maintainers: 4
pypi.org: gtdbtk

A toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes.

  • Versions: 50
  • Dependent Packages: 0
  • Dependent Repositories: 2
  • Downloads: 389 Last month
  • Docker Downloads: 475
Rankings
Docker downloads count: 2.2%
Stargazers count: 3.3%
Forks count: 5.0%
Dependent packages count: 7.3%
Average: 7.4%
Dependent repos count: 11.8%
Downloads: 14.9%
Maintainers (3)
Last synced: 6 months ago
spack.io: py-gtdbtk

GTDB-Tk is a software toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes based on the Genome Database Taxonomy (GTDB).

  • Versions: 3
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent repos count: 0.0%
Stargazers count: 12.8%
Forks count: 14.1%
Average: 21.0%
Dependent packages count: 57.3%
Maintainers (1)
Last synced: 6 months ago

Dependencies

.github/workflows/master-push.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
.github/workflows/release-publish.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
  • docker/build-push-action v2 composite
  • docker/login-action v1 composite
  • docker/setup-buildx-action v1 composite
  • docker/setup-qemu-action v1 composite
Dockerfile docker
  • python 3.8-slim-bullseye build
docs/requirements.txt pypi
  • jupyter *
  • linuxdoc ==20211220
  • matplotlib *
  • nbsphinx *
  • recommonmark *
  • sphinx *
  • sphinx-argparse *
  • sphinx-rtd-theme *
  • sphinx-sitemap *
setup.py pypi
  • dendropy >=4.1.0
  • pydantic >=1.9.2,<2.0a1
  • tqdm >=4.35.0