Rasusa
Rasusa: Randomly subsample sequencing reads to a specified coverage - Published in JOSS (2022)
mrivis
mrivis: Medical image visualization library for neuroscience in python - Published in JOSS (2018)
Lexicon-Mono-Seq, DOM Text Based Async MSA Viewer
Lexicon-Mono-Seq, DOM Text Based Async MSA Viewer - Published in JOSS (2019)
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Recan
Recan: Python tool for analysis of recombination events in viral genomes - Published in JOSS (2020)
filtersam
Tools to filter SAM/BAM files by percent identity and percent of matched sequence
align-cli
A CLI for pairwise alignment of sequences, using both normal and mass based alignment.
callsync
R package to align recordings, detect, assign, trace and analyse vocalisations
astred
An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.
https://github.com/bertsky/nmalign
forced alignment of lists of string by fuzzy string matching
visualqc
VisualQC : assistive tool to ease the quality control workflow of neuroimaging data.
https://github.com/astrazeneca/onto_merger
OntoMerger is an ontology alignment library for deduplicating knowledge graph nodes that represent the same domain.
nrex
nRex: Germline and somatic single-nucleotide, short indel and structural variant calling
https://github.com/astorfi/llm-alignment-project
A comprehensive template for aligning large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF), transfer learning, and more. Build your own customizable LLM alignment solution with ease.
fluentdna
FluentDNA allows you to browse sequence data of any size using a zooming visualization similar to Google Maps. You can use FluentDNA as a standalone program or as a python module for your own bioinformatics projects.
nf-fastq2bam
Nextflow pipeline to convert FASTQ files to BAM for WGS and RNA-seq data
microaligner
Image registration (alignment) software for large microscopy images
goombay
Python implementation of several sequence alignment algorithms such as Waterman-Smith-Beyer, Gotoh, and Needleman-Wunsch intended to calculate distance, show alignment, and display the underlying matrices.
relationship-between-auditory-and-semantic-entrainment
Relationship between auditory and semantic entrainmnet
https://github.com/cyberagentailab/annotation-efficient-po
Code of "Annotation-Efficient Preference Optimization for Language Model Alignment"
https://github.com/cyberagentailab/filtered-dpo
Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lower-quality samples compared to those generated by the learning model
https://github.com/bencardoen/smlmtools.jl
Computational tools for single molecule localization / superresolution microscopy (point clouds).
tomobear
TomoBEAR is a configurable and customizable modular pipeline for streamlined processing of cryo-electron tomographic data for subtomogram averaging.
evolution-by-emergence
LaTeX source of the open‑access book “Evolution by Emergence” on networks, complexity & universal evolution (CC‑BY).
https://github.com/bimberlab/nimble
nimble — execute lightweight, flexible alignments on custom reference libraries
https://github.com/bimberlab/nimble-aligner
nimble-aligner is the backend for nimble, a tool that executes lightweight, flexible alignments to generate supplemental alignment data
dpo-rlhf-paraphrase-types
Enhancing paraphrase-type generation using Direct Preference Optimization (DPO) and Reinforcement Learning from Human Feedback (RLHF), with large-scale HPC support. This project aligns model outputs to human-ranked data for robust, safety-focused NLP.
subaligner
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
https://github.com/broadinstitute/celligner2
A new version of the celligner package using VAEs. Inspired from the Theis lab's scArches.
symbolic-governed-mistral-artifact
Tier-10 sealed governance artifact for Mistral-7B with exact-match benchmarks and symbolic verifier.