Scientific Software
Updated 9 months ago

imodels — Peer-reviewed • Rank 21.0 • Science 100%

imodels: a python package for fitting interpretable models - Published in JOSS (2021)

Scientific Software
Updated 9 months ago

Machine Learning Validation via Rational Dataset Sampling with astartes — Peer-reviewed • Rank 15.6 • Science 100%

Machine Learning Validation via Rational Dataset Sampling with astartes - Published in JOSS (2023)

Engineering (40%)
Scientific Software · Peer-reviewed
Scientific Software
Updated 9 months ago

VeridicalFlow — Peer-reviewed • Rank 10.4 • Science 100%

VeridicalFlow: a Python package for building trustworthy data science pipelines with PCS - Published in JOSS (2022)

Scientific Software
Updated 9 months ago

Choice-Learn — Peer-reviewed • Rank 12.0 • Science 98%

Choice-Learn: Large-scale choice modeling for operational contexts through the lens of machine learning - Published in JOSS (2024)

Scientific Software
Updated 9 months ago

BetaML — Peer-reviewed • Rank 10.5 • Science 95%

BetaML: The Beta Machine Learning Toolkit, a self-contained repository of Machine Learning algorithms in Julia - Published in JOSS (2021)

Updated 9 months ago

terragpu • Rank 3.3 • Science 85%

Python library to process and classify remote sensing imagery by means of GPUs and ML.

Updated 9 months ago

irl-imitation • Rank 7.1 • Science 77%

Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL

Updated 9 months ago

spdl • Rank 15.0 • Science 64%

Scalable and Performant Data Loading

dl ml
Updated 9 months ago

tensorzero • Rank 23.4 • Science 54%

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

Updated 9 months ago

deepchecks • Rank 22.8 • Science 54%

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.

Updated 9 months ago

glaucus • Rank 7.0 • Science 67%

Glaucus is a PyTorch complex-valued ML autoencoder & RF estimation python module.

Updated 9 months ago

mlflow • Rank 35.0 • Science 36%

The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

Updated 9 months ago

yggdrasil-decision-forests • Rank 21.5 • Science 49%

A library to train, evaluate, interpret, and productionize decision forest models such as Random Forest and Gradient Boosted Decision Trees.

Updated 9 months ago

bugbug • Rank 11.6 • Science 54%

Platform for Machine Learning projects on Software Engineering

Updated 9 months ago

tensorflow-mri • Rank 5.1 • Science 59%

A Library of TensorFlow Operators for Computational MRI

Updated 9 months ago

https://github.com/biomedsciai/causallib • Rank 16.5 • Science 46%

A Python package for modular causal inference analysis and model evaluations

Updated 9 months ago

deeplake • Rank 25.6 • Science 36%

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

Updated 9 months ago

k-means-constrained • Rank 17.3 • Science 44%

K-Means clustering - constrained with minimum and maximum cluster size. Documentation: https://joshlk.github.io/k-means-constrained

Updated 9 months ago

structure-seer • Rank 0.7 • Science 57%

The implementation, training and evaluation of a Structure Seer machine learning model designed for reconstruction of adjacency of a molecular graph from the labelling of its nodes.

Updated 9 months ago

iamai • Rank 13.6 • Science 44%

A rule-driven comprehensive AI toolkit emphasizing simultaneous support for multimodal machine learning and the ability to construct cross-platform robots using logic.(规则驱动式的综合性人工智能工具库,强调同时支持多模态机器学习和利用逻辑构建跨平台机器人的能力)

Updated 9 months ago

functionary • Rank 10.6 • Science 44%

Chat language model that can use tools and interpret the results

Updated 9 months ago

state-of-open-source-ai • Rank 10.3 • Science 44%

:closed_book: Clarity in the current fast-paced mess of Open Source innovation

Updated 9 months ago

https://github.com/google/dopamine • Rank 29.1 • Science 23%

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Updated 9 months ago

@stdlib/datasets-suthaharan-single-hop-sensor-network • Rank 4.0 • Science 44%

Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.

Updated 9 months ago

@stdlib/datasets-suthaharan-multi-hop-sensor-network • Rank 3.7 • Science 44%

Labeled wireless sensor network data set collected from a multi-hop wireless sensor network deployment using TelosB motes.

Updated 9 months ago

rgf • Rank 19.0 • Science 23%

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

Updated 9 months ago

https://github.com/csinva/matching-with-gans • Rank 4.9 • Science 33%

Matching in GAN latent space for better bias benchmarking and semantic image editing. 👶🏻🧒🏾👩🏼‍🦰👱🏽‍♂️👴🏾

Updated 9 months ago

ai • Rank 10.9 • Science 26%

AI ——人工智能工具集,包含机器学习,深度学习,自然语言处理

Updated 9 months ago

quickai • Rank 10.5 • Science 26%

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

Updated 9 months ago

https://github.com/kyegomez/gemini • Rank 7.2 • Science 26%

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

Updated 9 months ago

https://github.com/thebabylonai/babylog • Rank 9.1 • Science 23%

A lightweight logger for machine learning teams to log images and predictions in production.

Updated 9 months ago

biowatch • Rank 3.6 • Science 26%

Analyze, visualize, and explore camera trap datasets with ease

Updated 9 months ago

https://github.com/neptune-ai/neptune-notebooks • Rank 14.9 • Science 13%

📚 Jupyter Notebooks extension for versioning, managing and sharing notebook checkpoints in your machine learning and data science projects.

Updated 9 months ago

https://github.com/csinva/dnn-ensemble • Rank 2.1 • Science 23%

Testing the properties of ensembled neural networks.

Updated 9 months ago

https://github.com/dair-ai/ml-nlp-paper-discussions • Rank 5.0 • Science 10%

📄 A repo containing notes and discussions for our weekly NLP/ML paper discussions.

Updated 9 months ago

osdg-tool • Rank 4.9 • Science 10%

OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant content in any text. The tool is available online at www.osdg.ai. API access available for research purposes.

Updated 9 months ago

https://github.com/csbiology/chlamyatlas • Rank 0.7 • Science 13%

Chlamy Atlas is a AI-powered web application which predicts the localizations of proteins from the Green Algae Chlamydomonas reinhardtii.

Updated 9 months ago

insect-detect • Science 67%

Detection models and Python scripts for automated insect monitoring with the Insect Detect DIY camera trap.

Updated 9 months ago

https://github.com/csinva/nano-descriptions • Science 13%

Algorithms for machine-learning analysis and descriptions of nanomaterials

Updated 9 months ago

machine-learning-responsible-python • Science 18%

Introduction to responsible machine learning with Python

Updated 9 months ago

ocaml-semsearch-jsoo • Science 44%

OCaml + js_of_ocaml + SBERT + TensorFlow.js

Updated 9 months ago

wnb • Science 44%

General (mixed) and weighted naive Bayes classifiers.

Updated 9 months ago

openspeaks-before-ai • Science 67%

A set of frameworks for creating the AI/ML building blocks for low-resource languages.

Updated 9 months ago

tikz-machinelearning • Science 41%

Machine Learning wih TikZ

Updated 9 months ago

lecture-ai-basics • Science 44%

Course content for the elective Artificial Intelligence I, covering foundational AI concepts and applied exercises.

Updated 9 months ago

speech_data_ghana_ug • Science 57%

The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani, Daagare, and Ikposo. Each language includes 1000 hours of audio speech from indigenous speakers of the language. Of which 100 hours is transcribed.

Updated 9 months ago

gnn_tracking • Science 54%

Reconstruct billions of particle trajectories with graph neural networks

Updated 9 months ago

https://github.com/agnostiqhq/tutorials_covalent_pydata_2023 • Science 13%

Covalent tutorial notebooks and slides for PyData 2023, NYC