https://github.com/hpcaitech/colossalai
Making large AI models cheaper, faster and more accessible
torchxrayvision
TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.
hls-foundation-os
This repository contains examples of fine-tuning Harmonized Landsat and Sentinel-2 (HLS) Prithvi foundation model.
chronos-forecasting
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
mattersim
MatterSim: A deep learning atomistic model across elements, temperatures and pressures.
https://github.com/priorlabs/tabpfn-client
⚡ Easy API access to the tabular foundation model TabPFN ⚡
https://github.com/modelscope/data-juicer
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
autodistill
Images to inference with no labeling (use foundation models to train supervised models).
lmms-finetune
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
py-alpaca-eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
medfm_submission_only
Solution for NeurIPS 2023 - MedFM Challenge
lrv-instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
https://github.com/bowang-lab/bioreason
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model
https://github.com/bowang-lab/ecg-fm
An electrocardiogram analysis foundation model.
https://github.com/biomedsciai/biomed-multi-omic
Build foundation model on RNA or DNA data
awesome-robotics-3d
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
text2earth
[IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model"
https://github.com/amazon-science/lc-plm
LC-PLM: long-context protein language model based on BiMamba-S architecture
https://github.com/amazon-science/fmcore
Running GenAI models at every scale, on every modality
https://github.com/chen-yang-liu/awesome-rs-spatiotemporal-vlms
🔥Remote Sensing Spatio-Temporal Vision-Language Models: A Comprehensive Survey
awesome-llm-papers.github.io
A curated collection of the most impactful papers, tools, and resources on Large Language Models (LLMs). Continuously updated to help researchers, developers, and enthusiasts stay on top of LLM advancements.
https://github.com/biomedsciai/gene-benchmark
Benchmark gene representations from different model families
https://github.com/biodt/bfm-model
Multi-modal Foundation Model for Biodiversity dynamics forecasting
git
[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
https://github.com/ai4co/awesome-fm4co
Recent research papers about Foundation Models for Combinatorial Optimization
https://github.com/ai4co/routefinder
[ICML'24 FM-Wild Oral] RouteFinder: Towards Foundation Models for Vehicle Routing Problems
spatialfusion-lm
SpatialFusion-LM is a real-time spatial reasoning framework that combines neural depth, 3D reconstruction, and language-driven scene understanding.
https://github.com/alleninstitute/biomolvec
Notebooks and scripts used for the Nautilex Hackathon
foundation-model-benchmarking-tool
Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.
https://github.com/dc-research/tempo
The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for forecasting task v1.0 version.
helical
A framework for state-of-the-art pre-trained bio foundation models on genomics and transcriptomics modalities.