Scientific Software
Updated 6 months ago

cuallee — Peer-reviewed • Rank 18.9 • Science 98%

cuallee: A Python package for data quality checks across multiple DataFrame APIs - Published in JOSS (2024)

Updated 6 months ago

https://github.com/awslabs/deequ • Rank 16.5 • Science 36%

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Updated 6 months ago

https://github.com/autoviml/pandas_dq • Rank 14.7 • Science 13%

Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.

Updated 6 months ago

rqssframework • Science 44%

The main code repository of Referencing Quality Scoring System metrics. Paper: https://www.semantic-web-journal.net/system/files/swj3593.pdf