Scientific Software
Updated 9 months ago

imodels — Peer-reviewed • Rank 21.0 • Science 100%

imodels: a python package for fitting interpretable models - Published in JOSS (2021)

Scientific Software
Updated 9 months ago

Machine Learning Validation via Rational Dataset Sampling with astartes — Peer-reviewed • Rank 15.6 • Science 100%

Machine Learning Validation via Rational Dataset Sampling with astartes - Published in JOSS (2023)

Engineering (40%)
Scientific Software · Peer-reviewed
Scientific Software
Updated 9 months ago

DSSE — Peer-reviewed • Rank 11.6 • Science 100%

DSSE: An environment for simulation of reinforcement learning-empowered drone swarm maritime search and rescue missions - Published in JOSS (2024)

Scientific Software
Updated 9 months ago

VeridicalFlow — Peer-reviewed • Rank 10.4 • Science 100%

VeridicalFlow: a Python package for building trustworthy data science pipelines with PCS - Published in JOSS (2022)

Scientific Software
Updated 9 months ago

LangFair — Peer-reviewed • Rank 14.9 • Science 95%

LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases - Published in JOSS (2025)

Scientific Software
Updated 9 months ago

modelStudio — Peer-reviewed • Rank 13.5 • Science 95%

modelStudio: Interactive Studio with Explanations for ML Predictive Models - Published in JOSS (2019)

Scientific Software
Updated 9 months ago

FuseMedML — Peer-reviewed • Rank 14.8 • Science 93%

FuseMedML: a framework for accelerated discovery in machine learning based biomedicine - Published in JOSS (2023)

Scientific Software
Updated 9 months ago

ollamar — Peer-reviewed • Rank 12.7 • Science 93%

ollamar: An R package for running large language models - Published in JOSS (2025)

Scientific Software · Peer-reviewed
Scientific Software
Updated 9 months ago

BetaML — Peer-reviewed • Rank 10.5 • Science 95%

BetaML: The Beta Machine Learning Toolkit, a self-contained repository of Machine Learning algorithms in Julia - Published in JOSS (2021)

Scientific Software
Updated 9 months ago

RT-utils — Peer-reviewed • Rank 8.5 • Science 95%

RT-utils: A Minimal Python Library for RT-struct Manipulation - Published in JOSS (2025)

Updated 9 months ago

paper-qa • Rank 22.0 • Science 77%

High accuracy RAG for answering questions from scientific documents with citations

Scientific Software
Updated 9 months ago

LGP — Peer-reviewed • Rank 5.8 • Science 93%

LGP: A robust Linear Genetic Programming implementation on the JVM using Kotlin. - Published in JOSS (2019)

Updated 9 months ago

datasets • Rank 34.4 • Science 64%

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Updated 2 months ago

chatAI4R: Interactive Artificial Intelligence toolkit for Data Science in R • Rank 8.4 • Science 87%

chatAI4R: Interactive Artificial Intelligence toolkit for Data Science in R - Published in JOSS (2026)

Updated 9 months ago

pytorch-lightning • Rank 34.8 • Science 54%

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Updated 9 months ago

terragpu • Rank 3.3 • Science 85%

Python library to process and classify remote sensing imagery by means of GPUs and ML.

Updated 9 months ago

litgpt • Rank 23.6 • Science 64%

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Updated 9 months ago

matsciml • Rank 7.8 • Science 77%

Open MatSci ML Toolkit is a framework for prototyping and scaling out deep learning models for materials discovery supporting widely used materials science datasets, and built on top of PyTorch Lightning, the Deep Graph Library, and PyTorch Geometric.

Updated 9 months ago

farm-haystack • Rank 28.7 • Science 54%

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Updated 9 months ago

letta_api • Rank 18.4 • Science 64%

Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.

Updated 9 months ago

gpt-researcher • Rank 15.2 • Science 64%

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Updated 9 months ago

mmengine • Rank 25.2 • Science 54%

OpenMMLab Foundational Library for Training Deep Learning Models

Updated 9 months ago

pandas-ai • Rank 24.5 • Science 54%

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Updated 9 months ago

SciMLBenchmarks • Rank 11.5 • Science 67%

Scientific machine learning (SciML) benchmarks, AI for science, and (differential) equation solvers. Covers Julia, Python (PyTorch, Jax), MATLAB, R

Updated 9 months ago

tensorzero • Rank 23.4 • Science 54%

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

Updated 9 months ago

langchain • Rank 39.0 • Science 36%

🦜🔗 Build context-aware reasoning applications 🦜🔗

Updated 9 months ago

grand-challenge.org • Rank 9.2 • Science 64%

A platform for end-to-end development of machine learning solutions in biomedical imaging

Updated 9 months ago

tango • Rank 17.4 • Science 54%

Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.

Updated 9 months ago

mlflow • Rank 35.0 • Science 36%

The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

Updated 9 months ago

reductstore • Rank 17.6 • Science 52%

High Performance Storage and Streaming Solution for Data Acquisition Systems

Updated 9 months ago

snorkel • Rank 23.6 • Science 46%

A system for quickly generating training data with weak supervision

Updated 9 months ago

aif360 • Rank 22.2 • Science 46%

A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.

Updated 9 months ago

optimate • Rank 12.8 • Science 54%

A collection of libraries to optimise AI model performances

Updated 9 months ago

https://github.com/facebookresearch/habitat-lab • Rank 20.0 • Science 46%

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Updated 9 months ago

distilabel • Rank 11.6 • Science 54%

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Updated 9 months ago

bugbug • Rank 11.6 • Science 54%

Platform for Machine Learning projects on Software Engineering

Updated 9 months ago

beeai-framework • Rank 21.3 • Science 44%

Build production-ready AI agents in both Python and Typescript.

Updated 9 months ago

hash • Rank 11.2 • Science 54%

🚀 The open-source, multi-tenant, self-building knowledge graph

Updated 9 months ago

oss-fuzz-gen • Rank 10.6 • Science 54%

LLM powered fuzzing via OSS-Fuzz.

Updated 9 months ago

solana-agent-kit • Rank 20.5 • Science 44%

connect any ai agents to solana protocols

Updated 9 months ago

basic-memory • Rank 18.8 • Science 44%

AI conversations that actually remember. Never re-explain your project to Claude again. Local-first, integrates with Obsidian. Join our Discord: https://discord.gg/tyvKNccgqN

Updated 9 months ago

eco2ai • Rank 13.1 • Science 49%

eco2AI is a python library which accumulates statistics about power consumption and CO2 emission during running code.

Updated 9 months ago

linear-relational • Rank 7.8 • Science 54%

Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch

Updated 9 months ago

deeplake • Rank 25.6 • Science 36%

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

Updated 9 months ago

tested • Rank 2.1 • Science 59%

The best way to test "it" is to watch someone else try to break "it".

Updated 9 months ago

monitors4codegen • Rank 7.0 • Science 54%

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multispy` is a lsp client library in Python intended to be used to build applications around language servers.

Updated 9 months ago

https://github.com/khoj-ai/khoj • Rank 24.1 • Science 36%

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

Updated 9 months ago

ai-bestpapers • Rank 4.9 • Science 54%

:trophy: AI Best Paper Awards

Updated 8 months ago

https://github.com/deepset-ai/haystack-core-integrations • Rank 22.9 • Science 36%

Additional packages (components, document stores and the likes) to extend the capabilities of Haystack

Updated 9 months ago

iamai • Rank 13.6 • Science 44%

A rule-driven comprehensive AI toolkit emphasizing simultaneous support for multimodal machine learning and the ability to construct cross-platform robots using logic.(规则驱动式的综合性人工智能工具库,强调同时支持多模态机器学习和利用逻辑构建跨平台机器人的能力)

Updated 9 months ago

grammarflow • Rank 5.5 • Science 52%

Powering Agent Chains by Constraining LLM Outputs

Updated 9 months ago

docling4j • Rank 3.2 • Science 54%

Docling4j brings the functionalities of Docling in document understanding to Java® projects