Updated 6 months ago
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Updated 6 months ago
https://github.com/desbordante/desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
anomaly-detection
correlations
data-analytics
data-cleaning
data-cleansing
data-engineering
data-exploration
data-mining
data-mining-algorithms
data-preprocessing
data-profiling
data-science
data-wrangling
exploratory-data-analysis
feature-engineering
feature-extraction
feature-selection
knowledge-discovery
spreadsheets
tabular-data
Updated 6 months ago
https://github.com/nagapv/edexplore
A simple widget for interactive EDA / QA. Works on top of Pandas [in Jupyter Notebook] using IPyWidgets with a sprinkle of Regex.
Updated 6 months ago
https://github.com/sidkris/megaprofiler
A Python library for automatic data profiling and validation