GitHub
cosima-recipes
A cookbook 📒 of recipes (i.e., examples) for analysing ocean and sea ice model output. 👩🏽🍳🌊👨🏻🍳
bluemira
Bluemira is an integrated inter-disciplinary design tool for future fusion reactors. It incorporates several modules, some of which rely on other codes, to carry out a range of typical conceptual fusion reactor design activities.
torchsim-hepba
High-entropy Prussian blue analogues atomistic modeling with ML potentials.
afh-model
ECLIPSE: Systematic falsification framework for consciousness science. Provides rigorous validation protocols preventing post-hoc rationalization in consciousness research.
tyche
Randomly sub-sample sequencing reads to a specified coverage or number of bases.
dynamic-recommendation
Explore methods for dynamic recommendation systems for CSE543. Authored by David Wang, Shirley Li, Kathleen Weng, Donghong Cai, Xinhang Yuan.
ml4bio-workshop
Materials for a workshop introducing machine learning to biologists
msf-practica3
Modelado de Sistemas Fisiológicos. Práctica 3: Sistema Cardiovascular
iot-processing-architecture
Architecture for processing and storing Internet of Things (IoT) data in a scalable way.
segmentation-guided-diffusion
[MICCAI 2024] Easy diffusion models (optionally with segmentation guidance) for medical images and beyond.
wwtp-energy-tariffs
Electricity and natural gas rates of the 100 largest wastewater treatment plants in the United States
pfcampari
Analysis of CaMPARI measurements during the 75th ESA Parabolic Flight Campaign 2021
scaffold_ai
Scaffold AI: Curriculum Recommendation Tool for Sustainability and Climate Resilience
atomgs
[BMVC'24] Official implementation for the paper "AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field"
trends-cd4-vl
Trends in CD4 and viral load testing practices in Southern Africa.
treatall-cd4-vl
Impact of Treat-All policy on CD4 and viral load testing practices in Southern Africa.
llammlein
Repo used to train our German, decoder-only LLäMmlein model family from scratch. We currently provide three model sizes — 120M, 1B, and 7B — all trained on the same curated dataset for consistent scaling comparisons. All components of the training process, including code, datasets, and intermediate checkpoints are openly released.
bartabsa-plusplus
Code for the BARTABSA++ Paper presented at the XLLM@ACL 2025 workshop.
swell-ml-reproducibility
Reproducibility package for domain-informed machine learning model predicting swell potential of expansive soils.
https://github.com/google-research/s4l
Tensorflow implementation of S4L: Self-Supervised Semi-Supervised Learning
actnow.jl
Files and data to replicate the results of the Act Now series of papers.
rnaseq
RNA sequencing analysis pipeline using STAR, RSEM, HISAT2 or Salmon with gene/isoform counts and extensive quality control.
https://github.com/google-research/weatherbenchx
A modular framework for evaluating weather forecasts
time-series-forcasting-benchmark-dataset-preprocessing
Benchmark Datasets for Time Series Forecasting Preprocessing - NASA HTTP Dataset, WorldCup98 Dataset
frisbee
Kubernetes-native framework for declarative testing of distributed systems.
pandas-fnma
Cleaner version of pandas v1.5.2 for python3 users who are unable to upgrade to python4
phoenix
🔥🐦🔥PHoeNIx: a platform agnostic pipeline for healthcare-associated and antimicrobial resistant pathogens
org.hipparchus:hipparchus
An efficient, general-purpose mathematics components library in the Java programming language