Nominally
Nominally: A Name Parser for Record Linkage - Published in JOSS (2021)
dedupe
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
https://github.com/moj-analytical-services/splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
ER-Evaluation
ER-Evaluation: End-to-End Evaluation of Entity Resolution Systems - Published in JOSS (2023)
deduplipy
Python package for deduplication/entity resolution using active learning
pyjedai
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
https://github.com/amazon-science/refined
ReFinED is an efficient and accurate entity linking (EL) system.
oasis
A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).
linktransformer
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
califair-em
Official implementation of GUIDE-AI @ SIGMOD paper "Threshold-Independent Fair Matching through Score Calibration"
ember
Code and data for the paper "Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction" (IJCAI 2022)
https://github.com/ai-team-uoa/autoer
Code & Experiments for IEEE Access paper "Auto-Configuring Entity Resolution Pipelines" by K.Nikoletos, V.Efthymiou, G.Papadakis and K.Stafanidis
pre-em-bias
Official implementation of the IEEE Big Data 2024 paper "Evaluating Blocking Biases in Entity Matching"