PyCM
PyCM: Multiclass confusion matrix library in Python - Published in JOSS (2018)
Learning from Crowds with Crowd-Kit
Learning from Crowds with Crowd-Kit - Published in JOSS (2024)
NiaARM
NiaARM: A minimalistic framework for Numerical Association Rule Mining - Published in JOSS (2022)
pypots
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/classification/clustering/forecasting/anomaly detection/cleaning on incomplete industrial (irregularly-sampled) multivariate TS with NaN missing values
lexicalrichness
:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).
pyprobables
Probabilistic data structures in python http://pyprobables.readthedocs.io/en/latest/index.html
sport-activities-features
A minimalistic toolbox for extracting features from sports activity files written in Python
smart-agriculture-datasets
lasio
Python library for reading and writing well data using Log ASCII Standard (LAS) files
automlpipeline.jl
A package that makes it trivial to create and evaluate machine learning pipeline architectures.
@stdlib/datasets-fivethirtyeight-ffq
FiveThirtyEight reader responses to a food frequency questionnaire (FFQ).
arctic3d
Automatic Retrieval and ClusTering of Interfaces in Complexes from 3D structural information
pygrinder
PyGrinder: a Python toolkit for grinding data beans into the incomplete for real-world data simulation by introducing missing values with different missingness patterns, including MCAR (complete at random), MAR (at random), MNAR (not at random), sub sequence missing, and block missing
tsdb
a Python toolbox loads 172 public time series datasets for machine/deep learning with a single line of code. Datasets from multiple domains including healthcare, financial, power, traffic, weather, and etc.
awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
core-periphery-hypergraphs
[KDD 2022] Official Code Release for "Core-periphery Models for Hypergraphs"
nuggets
R package for searching of patterns in subspaces described with elementary conjunctions
awesome-arm-in-smart-agriculture
A collection of literature on the use of association rule mining methods in smart agriculture
https://github.com/centrefordigitalhumanities/gabber
A project for the Data School.