gappa

A toolkit for analyzing and visualizing phylogenetic placement data

https://github.com/lczech/gappa

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 9 DOI reference(s) in README
  • Academic publication links
    Links to: plos.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.8%) to scientific vocabulary

Keywords

c-plus-plus evolutionary-placement gappa genesis phylogenetic-data phylogenetic-placement phylogenetics placement
Last synced: 6 months ago · JSON representation ·

Repository

A toolkit for analyzing and visualizing phylogenetic placement data

Basic Info
  • Host: GitHub
  • Owner: lczech
  • License: gpl-3.0
  • Language: C++
  • Default Branch: master
  • Homepage:
  • Size: 11.3 MB
Statistics
  • Stars: 60
  • Watchers: 5
  • Forks: 7
  • Open Issues: 1
  • Releases: 17
Topics
c-plus-plus evolutionary-placement gappa genesis phylogenetic-data phylogenetic-placement phylogenetics placement
Created over 8 years ago · Last pushed about 1 year ago
Metadata Files
Readme License Citation

README.md

Conda install Downloads Release CI
License Softwipe Score DOI <!-- Platforms --> <!-- Language --> <!-- CI --> <!-- Build Status -->

gappa

Features

gappa is a collection of commands for working with phylogenetic data. Its main focus are evolutionary placements of short environmental sequences on a reference phylogenetic tree. Such data are typically produced by tools such as EPA-ng, RAxML-EPA or pplacer, and usually stored in jplace files. <!-- It however also offers some commands for working with data such as sequences or trees. --> See the Wiki pages for the full list of all subcommands and their documentation.

We recommend our article "Metagenomic Analysis Using Phylogenetic Placement — A Review of the First Decade" as an introduction to the topic of phylogenetic placement.

Setup

There are two ways to get gappa:

  1. Install it via conda.
  2. Build it from source.

While conda will only give you proper releases, building from source will give you the latest (development) version. For that, simply get it, and build it:

~~~.sh git clone --recursive https://github.com/lczech/gappa.git cd gappa make ~~~

You can also use the green "Code" button above or click here to download the source as a zip archive. Unpack, and call make in the main directory to build everything.

Requirements:

After building, the executable is stored in the bin directory, and used as follows.

Usage and Documentation

gappa is used via its command line interface, with subcommands for each task. The commands have the general structure:

gappa <module> <subcommand> <options>

See the Wiki pages for the full list of all subcommands and their documentation.

For bug reports and feature requests of gappa, please open an issue on GitHub.

For user support, please see our Phylogenetic Placement Google Group. It is intended for discussions about phylogenetic placement, and for user support for our software tools, such as EPA-ng, gappa, and genesis.

Citation

To generally cite gappa, please use

Genesis and Gappa: processing, analyzing and visualizing phylogenetic (placement) data.
Lucas Czech, Pierre Barbera, and Alexandros Stamatakis.
Bioinformatics, 2020. https://doi.org/10.1093/bioinformatics/btaa070

Each command also prints out the relevant references for that command. Then, the command gappa tools citation can be used to obtain details on those references. See also our Wiki page Citation and References for a list of all references.

Lastly, we recommend reading our comprehensive review of the topic

Metagenomic Analysis Using Phylogenetic Placement—A Review of the First Decade.
Lucas Czech, Alexandros Stamatakis, Micah Dunthorn, and Pierre Barbera.
Frontiers in Bioinformatics, 2022. https://doi.org/10.3389/fbinf.2022.871393

to get an overview of phylogenetic placement and its methods.

Behind the scenes

gappa is short for Genesis Applications for Phylogenetic Placement Analysis. This is because most of the work of gappa is actually performed by our genesis library. See there if you are interested in the implementation details.

Owner

  • Name: Lucas Czech
  • Login: lczech
  • Kind: user
  • Location: Stanford, USA
  • Company: Carnegie Institution for Science

Postdoc in bioinformatics :seedling: and computer scientist :octocat: working on inter-disciplinary ways to save the planet :earth_africa:

Citation (CITATION.cff)

cff-version: 1.2.0
authors:
- family-names: "Czech"
  given-names: "Lucas"
  orcid: "https://orcid.org/0000-0002-1340-9644"
- family-names: "Barbera"
  given-names: "Pierre"
  orcid: "https://orcid.org/0000-0002-3437-150X"
- family-names: "Stamatakis"
  given-names: "Alexandros"
  orcid: "https://orcid.org/0000-0003-0353-0691"
title: "Genesis and Gappa: processing, analyzing and visualizing phylogenetic (placement) data"
doi: 10.1093/bioinformatics/btaa070
url: "http://github.com/lczech/gappa"
preferred-citation:
  type: article
  authors:
  - family-names: "Czech"
    given-names: "Lucas"
    orcid: "https://orcid.org/0000-0002-1340-9644"
  - family-names: "Barbera"
    given-names: "Pierre"
    orcid: "https://orcid.org/0000-0002-3437-150X"
  - family-names: "Stamatakis"
    given-names: "Alexandros"
    orcid: "https://orcid.org/0000-0003-0353-0691"
  doi: "10.1093/bioinformatics/btaa070"
  journal: "Bioinformatics"
  start: 3263 # First page number
  end: 3265 # Last page number
  title: "Genesis and Gappa: processing, analyzing and visualizing phylogenetic (placement) data"
  volume: 36
  issue: 10
  year: 2020

GitHub Events

Total
  • Create event: 1
  • Release event: 1
  • Issues event: 2
  • Watch event: 6
  • Issue comment event: 2
  • Push event: 4
  • Gollum event: 3
Last Year
  • Create event: 1
  • Release event: 1
  • Issues event: 2
  • Watch event: 6
  • Issue comment event: 2
  • Push event: 4
  • Gollum event: 3

Committers

Last synced: 12 months ago

All Time
  • Total Commits: 444
  • Total Committers: 4
  • Avg Commits per committer: 111.0
  • Development Distribution Score (DDS): 0.059
Past Year
  • Commits: 11
  • Committers: 1
  • Avg Commits per committer: 11.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Lucas Czech l****c@s****h 418
Pierre Barbera p****s@g****m 24
Frédéric Mahé f****e@c****r 1
Pierre Barbera p****s@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 29
  • Total pull requests: 2
  • Average time to close issues: 3 months
  • Average time to close pull requests: almost 3 years
  • Total issue authors: 25
  • Total pull request authors: 2
  • Average comments per issue: 4.34
  • Average comments per pull request: 1.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 0
  • Average time to close issues: 5 days
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 1.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • gavinmdouglas (2)
  • erikrikarddaniel (2)
  • ewenewen (2)
  • adriaaula (2)
  • blancobercial (1)
  • andrew-neal (1)
  • vmaffei (1)
  • lczech (1)
  • linshizuowei (1)
  • FWittmers (1)
  • justinshaffer (1)
  • Jigyasa3 (1)
  • boopsboops (1)
  • NailouZhang (1)
  • EricDeveaud (1)
Pull Request Authors
  • frederic-mahe (1)
  • pierrebarbera (1)
Top Labels
Issue Labels
enhancement (1)
Pull Request Labels
enhancement (1)