Nominally
Nominally: A Name Parser for Record Linkage - Published in JOSS (2021)
dedupe
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
https://github.com/moj-analytical-services/splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
ER-Evaluation
ER-Evaluation: End-to-End Evaluation of Entity Resolution Systems - Published in JOSS (2023)
deduplipy
Python package for deduplication/entity resolution using active learning
pyjedai
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
https://github.com/amazon-science/refined
ReFinED is an efficient and accurate entity linking (EL) system.
oasis
A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).
califair-em
Official implementation of GUIDE-AI @ SIGMOD paper "Threshold-Independent Fair Matching through Score Calibration"
https://github.com/ai-team-uoa/autoer
Code & Experiments for IEEE Access paper "Auto-Configuring Entity Resolution Pipelines" by K.Nikoletos, V.Efthymiou, G.Papadakis and K.Stafanidis
pre-em-bias
Official implementation of the IEEE Big Data 2024 paper "Evaluating Blocking Biases in Entity Matching"
ember
Code and data for the paper "Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction" (IJCAI 2022)
linktransformer
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning