Projects | Open Source Science

Updated 11 months ago

https://github.com/hpcaitech/colossalai • Rank 30.1 • Science 59%

Making large AI models cheaper, faster and more accessible

ai big-model data-parallelism deep-learning distributed-computing foundation-models heterogeneous-training hpc inference large-scale model-parallelism pipeline-parallelism

Updated 11 months ago

torchxrayvision • Rank 18.5 • Science 67%

TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.

chest-radiographs chest-xray chest-xray-images cxr cxr-images dataset deep-learning foundation-models image-classification machine-learning medical medical-ai medical-application medical-image-analysis medical-image-processing medical-imaging pytorch torchxrayvision transfer-learning

Updated 11 months ago

hls-foundation-os • Rank 8.2 • Science 77%

This repository contains examples of fine-tuning Harmonized Landsat and Sentinel-2 (HLS) Prithvi foundation model.

foundation-models remote-sensing

Updated 11 months ago

chronos-forecasting • Rank 20.5 • Science 54%

Chronos: Pretrained Models for Probabilistic Time Series Forecasting

artificial-intelligence forecasting foundation-models huggingface huggingface-transformers large-language-models llm machine-learning pretrained-models time-series time-series-forecasting timeseries transformers

Updated 11 months ago

mattersim • Rank 21.5 • Science 46%

MatterSim: A deep learning atomistic model across elements, temperatures and pressures.

ai4materials ai4science computational-materials-science foundation-models machine-learning machine-learning-force-field materials-science mlff

Updated 11 months ago

aurora • Rank 17.9 • Science 49%

Implementation of the Aurora model for Earth system forecasting

atmospheric-chemistry aurora-model deep-learning foundation-models ocean-waves tropical-cyclone-tracking weather-prediction

Updated 11 months ago

https://github.com/priorlabs/tabpfn-client • Rank 17.9 • Science 49%

⚡ Easy API access to the tabular foundation model TabPFN ⚡

data-science foundation-models machine-learning tabpfn tabular-data

Updated 11 months ago

https://github.com/modelscope/data-juicer • Rank 19.2 • Science 46%

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

data data-analysis data-pipeline data-processing data-science data-visualization foundation-models instruction-tuning large-language-models llm llms multi-modal pre-training synthetic-data

Updated 11 months ago

autodistill • Rank 19.6 • Science 44%

Images to inference with no labeling (use foundation models to train supervised models).

auto-labeling computer-vision deep-learning foundation-models grounding-dino image-annotation image-classification instance-segmentation labeling-tool machine-learning model-distillation multimodal object-detection pytorch segment-anything yolov5 yolov8

Updated 11 months ago

lmms-finetune • Rank 7.1 • Science 54%

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

finetuning foundation-models instruction-tuning large-language-model large-multimodal-models llava llava-next multimodal multimodal-large-language-models qwen-vl vision-language visual-instruction-tuning

Updated 11 months ago

py-alpaca-eval • Rank 12.6 • Science 46%

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

deep-learning evaluation foundation-models instruction-following large-language-models leaderboard nlp rlhf

Updated 11 months ago

medfm_submission_only • Rank 0.7 • Science 54%

Solution for NeurIPS 2023 - MedFM Challenge

artificial-intelligence fine-tuning finetuning foundation-model foundation-models image-classification image-classifier machine-learning machine-learning-algorithms medfm medfm-competition medical-image-classification neurips-2023 python pytorch transfer-learning vision-transformer vision-transformer-image-classification vision-transformer-models

Updated 11 months ago

lrv-instruction • Rank 5.7 • Science 41%

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

chatgpt evaluation evaluation-metrics foundation-models gpt gpt-4 hallucination iclr iclr2024 llama llava multimodal object-detection prompt-engineering vicuna vision vision-and-language vqa

Updated 11 months ago

https://github.com/bowang-lab/bioreason • Rank 5.8 • Science 36%

BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model

bioinformatics computational-biology dna foundation-models grpo large-language-models reasoning

Updated 11 months ago

https://github.com/bowang-lab/ecg-fm • Rank 5.0 • Science 36%

An electrocardiogram analysis foundation model.

electrocardiogram foundation-models healthcare machine-learning mimic-iv-ecg physionet2021 transformer

Updated 11 months ago

https://github.com/biomedsciai/biomed-multi-omic • Rank 4.4 • Science 36%

Build foundation model on RNA or DNA data

foundation-models genomics transcriptomics transformers

Updated 11 months ago

spatialfusion-lm • Science 26%

SpatialFusion-LM is a real-time spatial reasoning framework that combines neural depth, 3D reconstruction, and language-driven scene understanding.

3d-estimation computer-vision depth-estimation foundation-models mllm multimodal-llm point-clouds robotics scene-understanding spatial-intelligence stereo-vision transformer vision-language-model vision-transformer zero-shot-learning

Updated 11 months ago

https://github.com/dc-research/tempo • Science 36%

The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for forecasting task v1.0 version.

forecasting forecasting-models forecasting-time-series foundation-models gpt pretrained-language-model pretrained-models time-series time-series-analysis transformer transformers transformers-models

Updated 11 months ago

https://github.com/ai4co/awesome-fm4co • Science 36%

Recent research papers about Foundation Models for Combinatorial Optimization

awesome awesome-list combinatorial-optimization foundation-models lists machine-learning neural-combinatorial-optimization

Updated 11 months ago

https://github.com/ai4co/routefinder • Science 36%

[ICML'24 FM-Wild Oral] RouteFinder: Towards Foundation Models for Vehicle Routing Problems

ai4co foundation-models neural-combinatorial-optimization rl4co transformers vehicle-routing-problem

Updated 11 months ago

https://github.com/biodt/bfm-model • Science 36%

Multi-modal Foundation Model for Biodiversity dynamics forecasting

artificial-intelligence biodiversity ecology foundation-models modelling multi-modal representation-learning

Updated 11 months ago

git • Science 54%

[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

foundation-models perception transformer unified vision-and-language vision-transformer

Updated 11 months ago

foundation-model-benchmarking-tool • Science 54%

Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.

bedrock benchmark benchmarking deepseek deepseek-r1 evaluation-metrics foundation-models g5 g6 g6e generative-ai inferentia llama2 llama3 p4d p5 sagemaker trainium

Updated 11 months ago

spectrae • Science 57%

SPECTRA: Spectral framework for evaluation of biomedical AI models

benchmark-framework benchmarking foundation-models geometric-deep-learning large-language-models protein-language-model protein-sequences protein-structure

Updated 11 months ago

text2earth • Science 67%

[IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model"

foundation-models image-generation remote-sensing vision-language

Updated 11 months ago

https://github.com/amazon-science/lc-plm • Science 23%

LC-PLM: long-context protein language model based on BiMamba-S architecture

biology foundation-models foundation-models-for-biology huggingface mamba-state-space-models pretrained-models protein protein-sequences

Updated 11 months ago

https://github.com/alleninstitute/biomolvec • Science 13%

Notebooks and scripts used for the Nautilex Hackathon

foundation-models generative-model genes multimodal

Updated 11 months ago

https://github.com/chen-yang-liu/awesome-rs-spatiotemporal-vlms • Science 49%

🔥Remote Sensing Spatio-Temporal Vision-Language Models: A Comprehensive Survey

change-detetion foundation-models large-language-models remote-sensing spatio-temporal-analysis vision-language

Updated 11 months ago

awesome-llm-papers.github.io • Science 26%

A curated collection of the most impactful papers, tools, and resources on Large Language Models (LLMs). Continuously updated to help researchers, developers, and enthusiasts stay on top of LLM advancements.

academic-resources ai-research curated-papers deep-learning foundation-models generative-ai language-models large-language-models llm-tools llms machine-learning nlp open-source transformer-models

Updated 11 months ago

awesome-japanese-llm • Science 75%

日本語LLMまとめ - Overview of Japanese LLMs

foundation-models generative-ai generative-model generative-models japanese japanese-language japanese-language-model japanese-llm language-model language-models large-language-model large-language-models llm llm-japanese llms multimodal vision-and-language vision-language vision-language-model

Updated 11 months ago

uni-vision • Science 44%

Python library for handling most vision tasks .\

3d-reconstruction classification depth-estimation detection foundation-models gan instance-segmentation ocr pose-estimation segmentation superres tracking vision-rag vlm

Updated 11 months ago

awesome-robotics-3d • Science 36%

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

3d benchmarks computer-vision diffusion-models foundation-models gaussian-splatting grasping llm manipulation navigation nerf pointclouds policy-learning pretraining robotics scene-graph simulations vision-language-model vlm

Updated 11 months ago

glip • Science 54%

Centered Masking for Language-Image Pre-training

deep-learning foundation-models image-masking multimodal-learning representation-learning zero-shot-classification

Updated 11 months ago

https://github.com/amazon-science/fmcore • Science 26%

Running GenAI models at every scale, on every modality

ai-research artificial-intelligence computer-vision foundation-models genai machine-learning multi-modal natural-language-processing scientific-computing

Updated 11 months ago

helical • Science 57%

A framework for state-of-the-art pre-trained bio foundation models on genomics and transcriptomics modalities.

artificial-intelligence bioinformatics biology deep-learning dna-sequences evo2 foundation-models gene-expression geneformer helixmrna pre-trained-model pre-training rna rna-seq rnaseq scgpt transcriptformer transformer uce vcf

Updated 11 months ago

https://github.com/biomedsciai/gene-benchmark • Science 36%

Benchmark gene representations from different model families

benchmarking foundation-models foundation-models-for-biology representation-learning