GitHub
tyche
Randomly sub-sample sequencing reads to a specified coverage or number of bases.
dynamic-recommendation
Explore methods for dynamic recommendation systems for CSE543. Authored by David Wang, Shirley Li, Kathleen Weng, Donghong Cai, Xinhang Yuan.
ml4bio-workshop
Materials for a workshop introducing machine learning to biologists
msf-practica3
Modelado de Sistemas Fisiológicos. Práctica 3: Sistema Cardiovascular
iot-processing-architecture
Architecture for processing and storing Internet of Things (IoT) data in a scalable way.
segmentation-guided-diffusion
[MICCAI 2024] Easy diffusion models (optionally with segmentation guidance) for medical images and beyond.
wwtp-energy-tariffs
Electricity and natural gas rates of the 100 largest wastewater treatment plants in the United States
pfcampari
Analysis of CaMPARI measurements during the 75th ESA Parabolic Flight Campaign 2021
scaffold_ai
Scaffold AI: Curriculum Recommendation Tool for Sustainability and Climate Resilience
atomgs
[BMVC'24] Official implementation for the paper "AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field"
trends-cd4-vl
Trends in CD4 and viral load testing practices in Southern Africa.
treatall-cd4-vl
Impact of Treat-All policy on CD4 and viral load testing practices in Southern Africa.
llammlein
Repo used to train our German, decoder-only LLäMmlein model family from scratch. We currently provide three model sizes — 120M, 1B, and 7B — all trained on the same curated dataset for consistent scaling comparisons. All components of the training process, including code, datasets, and intermediate checkpoints are openly released.
bartabsa-plusplus
Code for the BARTABSA++ Paper presented at the XLLM@ACL 2025 workshop.
swell-ml-reproducibility
Reproducibility package for domain-informed machine learning model predicting swell potential of expansive soils.
https://github.com/google-research/s4l
Tensorflow implementation of S4L: Self-Supervised Semi-Supervised Learning
actnow.jl
Files and data to replicate the results of the Act Now series of papers.
rnaseq
RNA sequencing analysis pipeline using STAR, RSEM, HISAT2 or Salmon with gene/isoform counts and extensive quality control.
https://github.com/google-research/weatherbenchx
A modular framework for evaluating weather forecasts
time-series-forcasting-benchmark-dataset-preprocessing
Benchmark Datasets for Time Series Forecasting Preprocessing - NASA HTTP Dataset, WorldCup98 Dataset
frisbee
Kubernetes-native framework for declarative testing of distributed systems.
pandas-fnma
Cleaner version of pandas v1.5.2 for python3 users who are unable to upgrade to python4
phoenix
🔥🐦🔥PHoeNIx: a platform agnostic pipeline for healthcare-associated and antimicrobial resistant pathogens
org.hipparchus:hipparchus
An efficient, general-purpose mathematics components library in the Java programming language
wprautoencoders
This is one of Petrobras' open repositories on GitHub. It contains the WPRAutoencoders project which encompasses a wellbore pressure response generator, a dataset of 20.000 synthetic pressure responses and an autoencoder neural network capable of clustering this data based on transmissibility and reservoir geometry.
solcore5
A multi-scale, python-based library for the modelling of solar cells and semiconductor materials
@stdlib/assert-is-complex64vector-like
Test if a value is a 1-dimensional ndarray-like object containing single-precision complex floating-point numbers.
awebox
Modelling and optimal control of single- and multiple-kite systems for airborne wind energy
eo-datascience-cookbook
Earth Observation Data Science Cookbook provides training material centered around Earth Observation data while honoring the Pangeo Philosophy. The examples used in the notebooks represent some of the main research lines of the Remote Sensing Unit at the Department of Geodesy and Geoinformation at the TU Wien (Austria).
hegsrr-template
Template documents and packages for study pre-registration, reproductions, and replications
jailbreakbench
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]