graphsim
graphsim: An R package for simulating gene expression data from graph structures of biological pathways - Published in JOSS (2020)
TipToft
TipToft: detecting plasmids contained in uncorrected long read sequencing data - Published in JOSS (2019)
Multilocus sequence typing by blast from de novo assemblies against PubMLST
Multilocus sequence typing by blast from de novo assemblies against PubMLST - Published in JOSS (2016)
Pynteny
Pynteny: a Python package to perform synteny-aware, profile HMM-based searches in sequence databases - Published in JOSS (2023)
org.biojava
:book::microscope::coffee: BioJava is an open-source project dedicated to providing a Java library for processing biological data.
cazy-webscraper
Web scraper to retrieve protein data catalogued by the CAZy, UniProt, NCBI, GTDB and PDB websites/databases.
biocommons.seqrepo
non-redundant, compressed, journalled, file-based storage for biological sequences
adapt-diagnostics
A package for designing activity-informed nucleic acid diagnostics for viruses.
jcvi
Python library to facilitate genome assembly, annotation, and comparative genomics
OpenOmics
OpenOmics: A bioinformatics API to integrate multi-omics datasets and interface with public databases. - Published in JOSS (2021)
sarek
Analysis pipeline to detect germline or somatic variants (pre-processing, variant calling and annotation) from WGS / targeted sequencing
nplinker
A python framework for microbial natural products data mining by integrating genomics and metabolomics data
msprime
Simulate genealogical trees and genomic sequence data using population genetic models
filtersam
Tools to filter SAM/BAM files by percent identity and percent of matched sequence
circdna
Pipeline for the identification of extrachromosomal circular DNA (ecDNA) from Circle-seq, WGS, and ATAC-seq data that were generated from cancer and other eukaryotic cells.
zol
zol (& fai): large-scale targeted detection and evolutionary investigation of gene clusters (i.e. BGCs, phages, etc.)
smoove
structural variant calling and genotyping with existing tools, but, smoothly.
staramr
Scans genome contigs against the ResFinder, PlasmidFinder, and PointFinder databases.
cnr-flow
CUT&RUN-Flow, A Nextflow pipeline for QC, tag trimming, normalization, and peak calling for data from CUT&RUN experiments.
divbrowse
A web application for interactive visualization and exploratory data analysis of variant call matrices
circrna
circRNA quantification, differential expression analysis and miRNA target prediction of RNA-Seq data
pycirclize
Circular visualization in Python (Circos Plot, Chord Diagram, Radar Chart)
data-management-resources
Repo supporting the manuscript 'Journeying towards best practice data management in biodiversity genomics'.
pymsaviz
MSA(Multiple Sequence Alignment) visualization python package for sequence analysis
recognizer
A tool for domain based annotation with databases from the Conserved Domains Database
cogclassifier
A tool for classifying prokaryote protein sequences into COG(Cluster of Orthologous Genes) functional category
amr
AMRFinderPlus - Identify AMR genes and point mutations, and virulence and stress resistance genes in assembled bacterial nucleotide and protein sequence.
ensemblgenedownload
Nextflow DSL2 pipeline to download gene annotations from Ensembl.
hmnfusion
A tool to aggregate results of fusion produced by Genefuse and Lumpy and calculate allelic frequency
assemblebac
avantonder/assembleBAC is a bioinformatics best-practise analysis pipeline for assembling and annotating bacterial genomes.
readmapping
Nextflow DSL2 pipeline to align short and long reads to genome assembly. This workflow is part of the Tree of Life production suite.
genealignment
An NF-core pipeline to alignment genesets against a given reference genome.
snakemake_rnaseq
A Snakemake pipeline to go from fastq mRNA sequencing files to raw and normalised counts (usable for downstream EDA and differential analysis)
veba
A modular end-to-end suite for in silico recovery, clustering, and analysis of prokaryotic, microeukaryotic, and viral genomes from metagenomes
hipiler
Visual exploration of large genome interaction matrices with interactive small multiples.
rich_directrna
Nanopore directRNA (hopefully cDNA too ;)) workflow with many transcript reconstruction alternatives
hictkpy
Python bindings for hictk: read and write .cool and .hic files directly from Python
scgdb-data
SCGDB®-Data: A manually curated dedicated schizosaccharomyces comparative genome database.
recimap
Recimap (a reciprocal mapping tool) was developed as a bioinformatics command-line tool/pipeline to find rearrangements breakpoints between two closely related genomes.
llfsrnaseq
An R package which stores the code used to process Long Life Family Study RNA sequencing from fastq to analysis releases
chemcpa
Code for "Predicting Cellular Responses to Novel Drug Perturbations at a Single-Cell Resolution", NeurIPS 2022.
mdeepfri
Pipeline for searching and aligning contact maps for proteins, then running DeepFri's GCN.
nplinker-webapp
The NPLinker web app, built with Plotly Dash, for visualizing NPLinker predictions
fetch_ngs
Workflow to Fetch Public Sequencing Data and Metadata Using iSeq and MrBiomics Module.
computationalgenomicsmanual
Robs manual for the computational genomics and bioinformatics class.
rseb
An R-package for daily tasks required to handle biological data as well as avoid re-coding of small functions for quick but necessary data management.
ensemblrepeatdownload
Nextflow DSL2 pipeline to download repeat annotations from Ensembl.
somalier
fast sample-swap and relatedness checks on BAMs/CRAMs/VCFs/GVCFs... "like damn that is one smart wine guy"
2024-04-25-rutgers
Genomics Carpentries Workshop at Rutgers University, April 25-26, 2024
islandevolution.github.io
Source code for the Island biology, diversification, genome evolution lab