imodels
imodels: a python package for fitting interpretable models - Published in JOSS (2021)
Yellowbrick
Yellowbrick: Visualizing the Scikit-Learn Model Selection Process - Published in JOSS (2019)
Feature-engine
Feature-engine: A Python package for feature engineering for machine learning - Published in JOSS (2021)
Osprey
Osprey: Hyperparameter Optimization for Machine Learning - Published in JOSS (2016)
fseval
fseval: A Benchmarking Framework for Feature Selection and Feature Ranking Algorithms - Published in JOSS (2022)
RSMTool
RSMTool: collection of tools building and evaluating automated scoring models - Published in JOSS (2016)
chemotools
chemotools: A Python Package that Integrates Chemometrics and scikit-learn - Published in JOSS (2024)
pykoop
Koopman operator identification library in Python, compatible with `scikit-learn`
Scikit-Longitudinal
Scikit-Longitudinal: A Machine Learning Library for Longitudinal Classification in Python - Published in JOSS (2025)
cblearn
cblearn: Comparison-based Machine Learning in Python - Published in JOSS (2024)
flaml
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
scikit-activeml
scikit-activeml: Python library for active learning on top of scikit-learn
skforecast
Time series forecasting with machine learning models
pydata-wrangler
Wrangle messy numerical, image, and text data into consistent well-organized formats
reflame
reflame: Revolutionizing Functional Link Neural Network by Metaheuristic Optimization
emlearn
Machine Learning inference engine for Microcontrollers and Embedded devices
tpot
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
treeple
Scikit-learn compatible decision trees beyond those offered in scikit-learn
prince
:crown: Multivariate exploratory data analysis in Python — PCA, CA, MCA, MFA, FAMD, GPA
eckity
EC-KitY: A scikit-learn-compatible Python tool kit for doing evolutionary computation.
scikit-matter
A collection of scikit-learn compatible utilities that implement methods born out of the materials science and chemistry communities
https://github.com/arogozhnikov/hep_ml
Machine Learning for High Energy Physics.
https://github.com/epistasislab/tpot2
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
tclf
A scikit-learn compatible classifier to perform trade classification in Python.
mljar-supervised
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
https://github.com/uxlfoundation/scikit-learn-intelex
Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
https://github.com/mikekeith52/scalecast
The practitioner's forecasting library
metasklearn
MetaSklearn: A Metaheuristic-Powered Hyperparameter Optimization Framework for Scikit-Learn Models.
dtreeviz
A python library for decision tree visualization and model interpretation.
https://github.com/reiinakano/scikit-plot
An intuitive library to add plotting functionality to scikit-learn objects.
jetson-containers
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
m5py
An implementation of M5 and model trees in python, compliant with scikit-learn.
zoish
Zoish is a Python package that streamlines machine learning by leveraging SHAP values for feature selection and interpretability, making model development more efficient and user-friendly
https://github.com/csinva/tree-prompt
Tree prompting: easy-to-use scikit-learn interface for improved prompting.
stormtrooper
Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.
hybrid-vocal-classifier
a Python machine learning library for animal vocalizations and bioacoustics
vulpes
Vulpes: Test many classification, regression models and clustering algorithms to see which one is most suitable for your dataset
https://github.com/time-series-machine-learning/tsml-py
A toolkit for time series machine learning algorithms.
bibmon
Python package that provides predictive models for fault detection, soft sensing, and process condition monitoring.
credit-card-prediction
💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.
eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
kmodes
Python implementations of the k-modes and k-prototypes clustering algorithms, for clustering categorical data
https://github.com/polyaxon/hypertune
A library for performing hyperparameter optimization
https://github.com/zama-ai/concrete-ml
Concrete ML: Privacy Preserving ML framework using Fully Homomorphic Encryption (FHE), built on top of Concrete, with bindings to traditional ML frameworks.
upgini
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
https://github.com/alleninstitute/mouse_connectivity_models
Python package providing mesoscale connectivity models for mouse.
https://github.com/comic/evalutils
evalutils helps users create extensions for grand-challenge.org
https://github.com/csinva/disentangled-attribution-curves
Using / reproducing DAC from the paper "Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees"
gap-stat
Dynamically get the suggested clusters in the data for unsupervised learning.
projection-pursuit
An implementation of multivariate projection pursuit regression and univariate classification
mantis-ml
mantis-ml: Stochastic semi-supervised learning to prioritise genes from high throughput genomic screens
seqArchR
seqArchR: Identifying (promoter) sequence architectures de novo using NMF
sktime-dl
DEPRECATED, now in sktime - companion package for deep learning based on TensorFlow
nolearn
Combines the ease of use of scikit-learn with the power of Theano/Lasagne
https://github.com/gperdrizet/ensembleset
Ensemble dataset generator for tabular data prediction and modeling projects.
tinyautoml
TinyAutoML is a comprehensive Pipeline Classifier Project thought as a Scikit-learn plugin
https://github.com/alleninstitute/neuroglia
a Python machine learning library for neurophysiology data
sklearn-porter
Transpile trained scikit-learn estimators to C, Java, JavaScript and others.
practicalstats-pucsp-2024
Statistical Measures in Python - Age and Salary Analysis
https://github.com/chris-santiago/steps
A SciKit-Learn style feature selector using best subsets and stepwise regression.
https://github.com/autoviml/pandas_dq
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
https://github.com/cerlymarco/tspiral
A python package for time series forecasting with scikit-learn estimators.
https://github.com/gsganden/model_inspector
A uniform interface to a curated set of methods for inspecting machine learning models
https://github.com/bpesquet/mlkatas
(Phased out) A series of challenges for practicing your Machine Learning and Deep Learning skills
https://github.com/janjoswig/commonnnclustering
A Python package for common-nearest-neighbours clustering
https://github.com/ahmedshahriar/sentiment-analysis
Sentiment Analysis Projects