sourmash v4
sourmash v4: A multitool to quickly search, compare, and analyze genomic and metagenomic data sets - Published in JOSS (2024)
anndata
anndata: Access and store annotated data matrices - Published in JOSS (2024)
khmer release v2.1
khmer release v2.1: software for biological sequence analysis - Published in JOSS (2017)
Pyrodigal
Pyrodigal: Python bindings and interface to Prodigal, an efficient method for gene prediction in prokaryotes - Published in JOSS (2022)
Rasusa
Rasusa: Randomly subsample sequencing reads to a specified coverage - Published in JOSS (2022)
VCFPy
VCFPy: a Python 3 library with good support for both reading and writing VCF - Published in JOSS (2016)
The UCSCXenaTools R package
The UCSCXenaTools R package: a toolkit for accessing genomics data from UCSC Xena platform, from cancer multi-omics to single-cell RNA-seq - Published in JOSS (2019)
ReferenceSeeker
ReferenceSeeker: rapid determination of appropriate reference genomes - Published in JOSS (2020)
Fasten
Fasten: a toolkit for streaming operations on fastq files - Published in JOSS (2024)
TipToft
TipToft: detecting plasmids contained in uncorrected long read sequencing data - Published in JOSS (2019)
Multiblock PLS
Multiblock PLS: Block dependent prediction modeling for Python - Published in JOSS (2019)
Fastsubtrees
Fastsubtrees: simple and efficient subtrees extractions in Python with applications to NCBI taxonomy - Published in JOSS (2022)
TAXPASTA
TAXPASTA: TAXonomic Profile Aggregation and STAndardisation - Published in JOSS (2023)
Pycashier
Pycashier: cash in on DNA barcode tags - Published in JOSS (2024)
Bacting
Bacting: a next generation, command line version of Bioclipse - Published in JOSS (2021)
Contextualized
Contextualized: Heterogeneous Modeling Toolbox - Published in JOSS (2024)
Multilocus sequence typing by blast from de novo assemblies against PubMLST
Multilocus sequence typing by blast from de novo assemblies against PubMLST - Published in JOSS (2016)
AltamISA
AltamISA: a Python API for ISA-Tab files - Published in JOSS (2019)
MicrobiomeR
MicrobiomeR: An R Package for Simplified and Standardized Microbiome Analysis Workflows - Published in JOSS (2019)
Bedparse
Bedparse: feature extraction from BED files - Published in JOSS (2019)
Powering single-cell analyses in the browser with WebAssembly
Powering single-cell analyses in the browser with WebAssembly - Published in JOSS (2023)
Pynteny
Pynteny: a Python package to perform synteny-aware, profile HMM-based searches in sequence databases - Published in JOSS (2023)
cstag and cstag-cli
cstag and cstag-cli: tools for manipulating and visualizing cs tags - Published in JOSS (2024)
Metagenomic classification with KrakenUniq on low-memory computers
Metagenomic classification with KrakenUniq on low-memory computers - Published in JOSS (2022)
Python Implementation of Codon Adaptation Index
Python Implementation of Codon Adaptation Index - Published in JOSS (2018)
SaffronTree
SaffronTree: Fast, reference-free pseudo-phylogenomic trees from reads or contigs. - Published in JOSS (2017)
Lexicon-Mono-Seq, DOM Text Based Async MSA Viewer
Lexicon-Mono-Seq, DOM Text Based Async MSA Viewer - Published in JOSS (2019)
multiqc
Aggregate results from bioinformatics analyses across many samples into a single report.
Rainbow
Rainbow: Automated Air-Liquid Interface Cell Culture Analysis Using Deep Optical Flow - Published in JOSS (2022)
org.biojava
:book::microscope::coffee: BioJava is an open-source project dedicated to providing a Java library for processing biological data.
Recan
Recan: Python tool for analysis of recombination events in viral genomes - Published in JOSS (2020)
Taxonomy Resolver
Taxonomy Resolver: A Python package for building and filtering taxonomy trees - Published in JOSS (2025)
WGS2NCBI - Toolkit for preparing genomes for submission to NCBI
WGS2NCBI - Toolkit for preparing genomes for submission to NCBI - Published in JOSS (2019)
Genotify
Genotify: Fast, lightweight gene lookup and summarization - Published in JOSS (2018)
covtobed
covtobed: a simple and fast tool to extract coverage tracks from BAM files - Published in JOSS (2020)
Phylen
Phylen: automatic phylogenetic reconstruction using the EggNOG database - Published in JOSS (2018)
biotmle
biotmle: Targeted Learning for Biomarker Discovery - Published in JOSS (2017)
ORIS
ORIS: An interactive software tool for prediction of replication origin in prokaryotic genomes - Published in JOSS (2019)
chatAI4R: Interactive Artificial Intelligence toolkit for Data Science in R
chatAI4R: Interactive Artificial Intelligence toolkit for Data Science in R - Published in JOSS (2026)
HARE
HARE: A Python workflow for analyzing genomic feature enrichment in GWAS datasets - Published in JOSS (2024)
chise
A web application to visualize and edit the pathway models represented by SBGN Process Description Notation
qmflows
This library tackles the construction and efficient execution of computational chemistry workflows
integron_finder
Bioinformatics tool to find integrons in bacterial genomes
rna-tools
🔧rna-tools: a toolbox to analyze sequences, structures and simulations of RNA (and more) used by RNA CASP, RNA PUZZLES, and me ;-) docs @ http://rna-tools.rtfd.io web @ http://rna-tools.online
atropos
An NGS read trimming tool that is specific, sensitive, and speedy. (production)
pathway-mapper
PathwayMapper: An interactive and collaborative graphical curation tool for cancer pathways
pyskani
PyO3 bindings and Python interface to skani, a method for fast genomic identity calculation using sparse chaining.
dockstore
An app store for scientific workflows, tools, notebooks, and services
pyfastani
Cython bindings and Python interface to FastANI, a method for fast whole-genome similarity estimation.
cazy-webscraper
Web scraper to retrieve protein data catalogued by the CAZy, UniProt, NCBI, GTDB and PDB websites/databases.
gnparser
GNparser normalises scientific names and extracts their semantic elements.
biocommons.seqrepo
non-redundant, compressed, journalled, file-based storage for biological sequences
adapt-diagnostics
A package for designing activity-informed nucleic acid diagnostics for viruses.
https://github.com/theislab/cellrank
CellRank: dynamics from multi-view single-cell data
jcvi
Python library to facilitate genome assembly, annotation, and comparative genomics
chip-atlas
ChIP-Atlas: Browse and analyze all public ChIP/DNase-seq data on your browser
dbdipy
A Python library for the inspection, curation and interpretation of DBDI-MS data.
sigprofilermatrixgenerator
SigProfilerMatrixGenerator creates mutational matrices for all types of somatic mutations. It allows downsizing the generated mutations only to parts for the genome (e.g., exome or a custom BED file). The tool seamlessly integrates with other SigProfiler tools.
biopython
Official git repository for Biopython (originally converted from CVS)
maftools
Summarize, Analyze and Visualize MAF files from TCGA or in-house studies.
cobra
COBRApy is a package for constraint-based modeling of metabolic networks.
minimap2
A versatile pairwise aligner for genomic and spliced nucleotide sequences
glycowork
Package for processing and analyzing glycans and their role in biology.
dbcanlight
A lightweight rewrite of run_dbcan for better multithread performance
efp-seq_browser
An RNA-Seq data exploration tool that shows read map coverage of a gene of interest along with a coloured "electronic fluorescent pictographic" (eFP) based on its RPKM expression level.