Updated 5 months ago

dedupe • Rank 23.6 • Science 62%

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

Updated 6 months ago

@stdlib/iter-unique-by • Rank 3.1 • Science 44%

Create an iterator which returns unique values according to a predicate function.

Updated 6 months ago

@stdlib/iter-intersection-by-hash • Rank 2.3 • Science 44%

Create an iterator which returns the intersection of two or more iterators according to a hash function.

Updated 6 months ago

@stdlib/iter-dedupe-by • Rank 2.3 • Science 44%

Create an iterator which removes consecutive values that resolve to the same value according to a provided function.