Recent Releases of ydata-profiling
ydata-profiling - v4.16.0
4.16.0 (2025-03-24)
Features
- Python
Published by azory-ydata about 1 year ago
ydata-profiling - v4.15.1
What's Changed
- fix: add a new env variable for banner by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1735
- chore(actions): revert to python 3.11 by @portellaa in https://github.com/ydataai/ydata-profiling/pull/1736
Full Changelog: https://github.com/ydataai/ydata-profiling/compare/v4.15.0...v4.15.1
- Python
Published by portellaa about 1 year ago
ydata-profiling - v4.15.0
What's Changed
- fix: improve profiling code logic by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1728
- chore(setup): comply build process with latest changes to pypi by @portellaa in https://github.com/ydataai/ydata-profiling/pull/1731
- feat: update spark version for ydata-profiling by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1730
- chore: fix linting errors by @portellaa in https://github.com/ydataai/ydata-profiling/pull/1732
- chore: fix release action by @portellaa in https://github.com/ydataai/ydata-profiling/pull/1733
- chore: add permissions to the release file by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1734
Full Changelog: https://github.com/ydataai/ydata-profiling/compare/v4.14.0...v4.15.0
- Python
Published by fabclmnt about 1 year ago
ydata-profiling - v4.13.0
4.13.0 (2025-03-05)
Bug Fixes
- add information to ydataprofiling (#1716) (b74a5de)
- avoid failure when index level shares name with a column (#1673) (cf853bd)
- display: Update display import (#1709) (f593bb7)
- fix CI pipeline (df66320)
- pandas version (53c8d36)
- pandas version (7ebb7c6)
Features
- move setup.py to pyproject.toml and update makefile (6a57afc)
- update contribution guide dev install (0100cab)
- update github actions to support new requirements (88c6778)
- Python
Published by azory-ydata about 1 year ago
ydata-profiling - v4.12.0
4.12.0 (2024-10-29)
Bug Fixes
- actions: add permissions object (#1668) (d6df865)
- iscategoricaldtype is deprecated and will be removed in a future version. Use isinstance(dtype, pd.CategoricalDtype) (5bbd589)
- linting: code formatting (d3b58e7)
- linting: code formatting (886be38)
- Literal not supported in python 3.7 (7ae5e16)
- liting issue (7758dba)
- mypy does not support dinamic types (227b917)
Features
- add adfuller stationary test parameters to config (3642b01)
- Python
Published by azory-ydata over 1 year ago
ydata-profiling - v4.11.0
4.11.0 (2024-10-16)
Bug Fixes
- alert style name in widget (e4814fd)
- Correction for numpy.NaN deprecating to numpy.nan (#1649) (06524cc)
- linting: code formatting (938a601)
- variables: select column dropdown placement changes (#1629) (acd2189)
Features
- remove dark and orange model from inputs (712ee67)
- Python
Published by azory-ydata over 1 year ago
ydata-profiling - v4.10.0
4.10.0 (2024-09-07)
Bug Fixes
- linting: code formatting (e94e500)
- remove compare report type enforcement (c2b81e5)
- suppress pandas tick resolution adjustment (#1635) (ca43fc8)
Features
- bump numpy version to support numpy v.2.2 (5679b16)
- Python
Published by azory-ydata over 1 year ago
ydata-profiling - v4.9.0
4.9.0 (2024-07-15)
Bug Fixes
- 1540 addexpectationsuite doesn't have key overwrite_existing (#1541) (159fe0d)
- bug fixed (#1601) (9946037)
- fix correlations issues whenever strings are present in the dataset. (1c94d33)
- jquery & bootstrap versions upgrade to fix vulnerables (jquery vulns related to xss and bootstrap vulns related to CVE & Sonatype) (7032adb)
- jquery: jquery update import changes (93fc6a0)
- linting: code formatting (9c85bc7)
- linting: code formatting (3eeab5e)
- linting: code formatting (0ec2085)
- logger flagged errors (488341c)
- rmv analytcis from the tests (a4afa34)
- update html (733d334)
- update matplotlib and tests.yaml (7176590)
- update matplotlib and tests.yaml for python3.7 (1c42264)
- update setup file for notebook installation (2d2ccbc)
Features
Reverts
- Revert "fix: jquery & bootstrap versions upgrade to fix vulnerables (jquery v…" (5164e0b)
- Python
Published by azory-ydata almost 2 years ago
ydata-profiling - 4.8.3
Full Changelog: https://github.com/ydataai/ydata-profiling/compare/v4.8.2...4.8.3
What's Changed
- fix: comparison to_json pd.Series encoding error by @alexbarros in https://github.com/ydataai/ydata-profiling/pull/1538
- fix: switch createexpectationsuite with add by @nadavshatz in https://github.com/ydataai/ydata-profiling/pull/1540
- fix: timeseries gap comparison failing with no gaps for date index by @alexbarros in https://github.com/ydataai/ydata-profiling/pull/1551
- fix: great expectations unit tests referring to deprecated method by @alexbarros in https://github.com/ydataai/ydata-profiling/pull/1558
- fix: zero division for categorical colums with 100% missing data by @alexbarros in https://github.com/ydataai/ydata-profiling/pull/1569
- chore: update html and readme by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1571
- fix: downgrade commitlint action version by @alexbarros in https://github.com/ydataai/ydata-profiling/pull/1572
- chore(deps): remove seaborn upper limit by @ggydush in https://github.com/ydataai/ydata-profiling/pull/1570
- chore(deps): update dependency mike to >=2.1.1,<2.2.0 by @renovate in https://github.com/ydataai/ydata-profiling/pull/1580
- chore: update python3statement url by @tqa236 in https://github.com/ydataai/ydata-profiling/pull/1573
- fix: Use correct value in ConstantAlert template by @AlpAribal in https://github.com/ydataai/ydata-profiling/pull/1574
- fix: Resolve the issue of conflicts between columns added during the analysis process and the original data columns in the Spark version. by @frelion in https://github.com/ydataai/ydata-profiling/pull/1518
- chore(deps): update typeguard version by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1581
- chore(deps): update dependency mkdocs-table-reader-plugin to <=2.2.0 by @renovate in https://github.com/ydataai/ydata-profiling/pull/1478
- chore(deps): update dependency coverage to v7 by @renovate in https://github.com/ydataai/ydata-profiling/pull/1403
- chore(deps): update dependency mkdocs to >=1.6.0,<1.7.0 by @renovate in https://github.com/ydataai/ydata-profiling/pull/1438
- feat: add analytics by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1582
- fix: class exception by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1584
- fix: code exception by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1585
- chore: remove version by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1586
- docs: update links in the documentation by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1587
New Contributors
- @nadavshatz made their first contribution in https://github.com/ydataai/ydata-profiling/pull/1540
- @ggydush made their first contribution in https://github.com/ydataai/ydata-profiling/pull/1570
- @tqa236 made their first contribution in https://github.com/ydataai/ydata-profiling/pull/1573
- @AlpAribal made their first contribution in https://github.com/ydataai/ydata-profiling/pull/1574
- @frelion made their first contribution in https://github.com/ydataai/ydata-profiling/pull/1518
Full Changelog: https://github.com/ydataai/ydata-profiling/compare/v4.7.0...4.8.3
- Python
Published by fabclmnt about 2 years ago
ydata-profiling - v4.8.1
4.8.1 (2024-05-07)
Bug Fixes
- Python
Published by azory-ydata about 2 years ago
ydata-profiling - v4.80
What's Changed
- fix: comparison to_json pd.Series encoding error by @alexbarros in https://github.com/ydataai/ydata-profiling/pull/1538
- fix: switch createexpectationsuite with add by @nadavshatz in https://github.com/ydataai/ydata-profiling/pull/1540
- fix: timeseries gap comparison failing with no gaps for date index by @alexbarros in https://github.com/ydataai/ydata-profiling/pull/1551
- fix: great expectations unit tests referring to deprecated method by @alexbarros in https://github.com/ydataai/ydata-profiling/pull/1558
- feat: support for python 3.12 by @aquemy in https://github.com/ydataai/ydata-profiling/pull/1521
- chore(actions): remove sonarqube by @andre-lx in https://github.com/ydataai/ydata-profiling/pull/1561
- chore(actions): remove sonarqube by @andre-lx in https://github.com/ydataai/ydata-profiling/pull/1563
- fix: zero division for categorical colums with 100% missing data by @alexbarros in https://github.com/ydataai/ydata-profiling/pull/1569
- chore: update html and readme by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1571
- fix: downgrade commitlint action version by @alexbarros in https://github.com/ydataai/ydata-profiling/pull/1572
- chore(deps): remove seaborn upper limit by @ggydush in https://github.com/ydataai/ydata-profiling/pull/1570
- chore(deps): update dependency mike to >=2.1.1,<2.2.0 by @renovate in https://github.com/ydataai/ydata-profiling/pull/1580
- chore: update python3statement url by @tqa236 in https://github.com/ydataai/ydata-profiling/pull/1573
- fix: Use correct value in ConstantAlert template by @AlpAribal in https://github.com/ydataai/ydata-profiling/pull/1574
- fix: Resolve the issue of conflicts between columns added during the analysis process and the original data columns in the Spark version. by @frelion in https://github.com/ydataai/ydata-profiling/pull/1518
- chore(deps): update typeguard version by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1581
- chore(deps): update dependency mkdocs-table-reader-plugin to <=2.2.0 by @renovate in https://github.com/ydataai/ydata-profiling/pull/1478
- chore(deps): update dependency coverage to v7 by @renovate in https://github.com/ydataai/ydata-profiling/pull/1403
- chore(deps): update dependency mkdocs to >=1.6.0,<1.7.0 by @renovate in https://github.com/ydataai/ydata-profiling/pull/1438
- feat: add analytics by @fabclmnt in https://github.com/ydataai/ydata-profiling/pull/1582
New Contributors
- @nadavshatz made their first contribution in https://github.com/ydataai/ydata-profiling/pull/1540
- @ggydush made their first contribution in https://github.com/ydataai/ydata-profiling/pull/1570
- @tqa236 made their first contribution in https://github.com/ydataai/ydata-profiling/pull/1573
- @AlpAribal made their first contribution in https://github.com/ydataai/ydata-profiling/pull/1574
- @frelion made their first contribution in https://github.com/ydataai/ydata-profiling/pull/1518
Full Changelog: https://github.com/ydataai/ydata-profiling/compare/v4.7.0...v4.8.0
- Python
Published by azory-ydata about 2 years ago
ydata-profiling - v4.6.5
4.6.5 (2024-02-26)
Bug Fixes
- comparison to_json pd.Series encoding error (#1538) (84af0d0)
- switch createexpectationsuite with addexpectationsuite (#1540) (e0c770e)
- timeseries gap comparison failing with no gaps for date index (#1551) (bd5782b)
- update describesupportedspark.py included fix for DivideByZero (#1545) (129bb98)
- Python
Published by azory-ydata over 2 years ago
ydata-profiling - v4.6.3
4.6.3 (2023-12-05)
Bug Fixes
This release also change the higher bound for pandas version.
- Python
Published by azory-ydata over 2 years ago
ydata-profiling - v4.6.1
What's Changed
- chore(deps): bump to pydantic v2 by @portellaa in https://github.com/ydataai/ydata-profiling/pull/1483
The reason for this release is just to bump the requirement of pydantic into v2. As for code changes, we continue using v1 which is packed inside v2. We will proceed with the migration as soon as possible.
- Python
Published by azory-ydata over 2 years ago
ydata-profiling - v4.6.0
4.6.0 (2023-10-10)
Bug Fixes
- bug when creating a new report (#1440) (9a5b4fe)
- fixate matplotlib upper version (3ded762)
- gen wordcloud only for non-empty cols (#1459) (f04fb61)
- remove the duplicated cardinality threshold under categorical and text settings (80a1024)
- table template ignoring text format (#1462) (472c216)
- template {{ file_name }} error in HTML wrapper (#1380) (a5a227f)
- to_category misshandling pd.NA (#1464) (d5bbe7d)
- update duplicates_pandas.py (#1427) (56a6641), closes #1384
Features
- Python
Published by azory-ydata over 2 years ago
ydata-profiling - v4.3.2
4.3.2 (2023-07-19)
Bug Fixes
- Python
Published by azory-ydata almost 3 years ago
ydata-profiling - v4.2.0
4.2.0 (2023-05-24)
Bug Fixes
- add punctuation marks in all alerts (#1315) (acb9f21)
- complete alerts information in json (#1339) (4108709)
- error for large integers in Series (#1308) (50e44dd)
- infer all
Numericvars asTimeSerieswhen tsmode=True (#1343) (d750e00) - json ignoring redact (#1341) (9f5d4ba)
- lint complaints on list comprehension (#1327) (621450d)
- old references links (#1328) (7db5694)
- requirements upper bounds (#1326) (e6b3a49)
- revert infer all
Numericvars asTimeSerieswhen tsmode=True (#1343)" (#1346) (40fb0c2)
Features
- Python
Published by azory-ydata about 3 years ago
ydata-profiling - v4.1.1
4.1.1 (2023-03-17)
Bug Fixes
- Python
Published by azory-ydata about 3 years ago
ydata-profiling - v4.1.0
4.1.0 (2023-03-08)
Bug Fixes
- add imagehash requirements (0a8c5de)
- correlation between columns w/o nonnull row (cfda278)
- removed irregular spacing (#1252) (8aff455)
- update links and alignment in README (efd633a)
- update pypi links (1c84e89)
- update Stack Overlow to link to use both tags (79202f7)
Features
- Python
Published by azory-ydata about 3 years ago
ydata-profiling - v4.0.0
4.0.0 (2023-01-30)
Bug Fixes
- version bug and incorporate feedback (3fe3aee)
- add css to cope with large tables (7e51e77)
- add ProfileReport validations and configuration file (175798e)
- add Report configurations for correlations (9339a2d)
- add union to typecheck (351b17f)
- change dataframe empy check method (0f474ae)
- df validation (b834214)
- fix date and datetime histogram. Fix Spearman and Pearson's correlations (0c77695)
- fix merge issues (3cf3cc4)
- improve description and correct plot for ‘auto’ correlation (#1119) (a105d32)
- isinstance (ce81102)
- isinstance (c6a6daa)
- move file example. remove unused prints (4f794cd)
- re-introduce to prevent cases where the profilereport is init with no DF (b12f45f)
- remove getdefaultsettings (9f4f8f9)
- solve mypy error (35d9827)
- uses colors from the specified config (74041c7)
Features
- add heatmap values as a table under correlations (5268544)
- design improvements on the correlations section (5facb7b)
- minimal spark deployment (#1132) (4113389)
- setup: change package naming (#1254) (99e2e9f)
- update variables layout (#1207) (34e19a7)
- Python
Published by azory-ydata over 3 years ago
ydata-profiling - v3.6.1
3.6.1 (2022-12-23)
Bug Fixes
- Python
Published by azory-ydata over 3 years ago
ydata-profiling - v3.6.0
3.6.0 (2022-12-21)
Bug Fixes
- add css to cope with large tables (7f42f87)
- adjust categoricals layout (f0bb45a)
- categorical data not being obscured in the common values plot (40236bc)
- compare report ignoring config parameter (3d60556)
- compare report warnings always showing the last alert type (6b3c13d)
- comparison fails when duplicates are disable (#1208) (6d19620)
- do no raise exception for percentage formatter (3ea626d)
- enforce recomputation of description sets (a9fd1c8)
- error comparing only one precomputed profile (00646cd)
- html: sensible cloud-platform notebook html rendering (b22ece2)
- ignoring config of precomputed reports (6478c40)
- only compute auto correlation when no config is specified (d5d4f58)
- remove malfunctioning hook (e2593f5)
- remove unused test (2170338)
- return the proper type for widgets (4c0b358)
- set compute default to false (c70e491)
- solve mypy error (9c4266e)
- solve mypy issue (e3e7788)
- uses colors from the specified config (c0c556d)
- utils: use 'urllib.request' instead of 'requests' (#1177) (e4d020b), closes #1168
Features
- add heatmap values as a table under correlations (fc5da9e)
- allow to specify the configuration for the comparison report (ad725b0)
- design improvements on the correlations section (e5cd8cf)
- implement imbalanced warning (ce84c81)
- update variables layout (#1207) (cf0e0a7)
- Python
Published by azory-ydata over 3 years ago
ydata-profiling - v3.5.0
3.5.0 (2022-11-22)
Bug Fixes
- change context managed backend (#1149) (11e1a8a)
- dataset names on comparison report (#1159) (3c14d43)
- duplicate key in test dict (#1126) (d19affe)
- improve description and correct plot for ‘auto’ correlation (#1119) (2617b92)
- remove correlation calculation for constants (#1152) (1ed2bc0)
- time series render format (#1157) (39ca8ce)
- update config files to only calculate 'auto' correlation (#1158) (34cf73d)
- update repository links (#1141) (c742c5d)
Features
- add typechecking to profile report (#1139) (ec8ece0)
- report comparison example (#1160) (5e75fd2)
- report comparisons (#1069) (70ee5c7), closes #1137 #1136 #1143 #1148 #1150
- Python
Published by azory-ydata over 3 years ago
ydata-profiling - v3.4.0
3.4.0 (2022-10-20)
Bug Fixes
- correlation
autopassing extra parameters (#1114) (21f4fe6) - cramer's correlation fails with missings vals (#1109) (8e7f8b2)
- drop joblib dependency (#1090) (586cef3), closes #1056
- fix linter errors (#1117) (5f17cfd)
- make tangled-up-in-unicode an optional dependency (#1070) (e6b2a00)
- remove unused imports (56beed4)
- remove unused imports (66864c1)
- Remove unused imports. (985fbd1)
Features
- add support for Pandas 1.5 (#1076) (5c5a710)
- added filter to locate columns (#1115) (c2f817d)
- introduce auto parameter for correlations (#1095) (4d2e415)
- Python
Published by azory-ydata over 3 years ago
ydata-profiling - pandas-profiling v3.3.0
The full changelog is available here: https://pandas-profiling.ydata.ai/docs/master/pages/reference/changelog.html?highlight=change+log
- Python
Published by fabclmnt over 3 years ago
ydata-profiling - pandas-profiling v3.2.0
The full changelog is available here: https://pandas-profiling.ydata.ai/docs/master/pages/reference/changelog.html?highlight=change+log
- Python
Published by sbrugman about 4 years ago
ydata-profiling - pandas-profiling v3.1.0
The full changelog is available here: https://pandas-profiling.github.io/pandas-profiling/docs/master/rtd/pages/changelog.html.
- Python
Published by sbrugman over 4 years ago
ydata-profiling - pandas-profiling v3.0.0
The full changelog is available here: https://pandas-profiling.github.io/pandas-profiling/docs/master/rtd/pages/changelog.html.
- Python
Published by sbrugman about 5 years ago
ydata-profiling - pandas-profiling v2.13.0
The full changelog is available here: https://pandas-profiling.github.io/pandas-profiling/docs/master/rtd/pages/changelog.html.
- Python
Published by sbrugman about 5 years ago
ydata-profiling - pandas-profiling v2.12.0
The full changelog is available here: https://pandas-profiling.github.io/pandas-profiling/docs/master/rtd/pages/changelog.html.
- Python
Published by sbrugman about 5 years ago
ydata-profiling - pandas-profiling v2.11.0
The full changelog is available here: https://pandas-profiling.github.io/pandas-profiling/docs/master/rtd/pages/changelog.html.
- Python
Published by sbrugman over 5 years ago
ydata-profiling - pandas-profiling v2.10.1
The full changelog is available here: https://pandas-profiling.github.io/pandas-profiling/docs/master/rtd/pages/changelog.html.
- Python
Published by sbrugman over 5 years ago
ydata-profiling - pandas-profiling v2.10.0rc1
The full changelog is available here: https://pandas-profiling.github.io/pandas-profiling/docs/master/rtd/pages/changelog.html.
- Python
Published by sbrugman over 5 years ago
ydata-profiling - pandas-profiling v2.9.0
The full changelog is available here: https://pandas-profiling.github.io/pandas-profiling/docs/master/rtd/pages/changelog.html.
- Python
Published by sbrugman over 5 years ago
ydata-profiling - pandas-profiling v2.9.0rc1
This release candidate improves handling of sensitive data and futhermore reduces technical debt with various fixes. The full changelog is available here: https://pandas-profiling.github.io/pandas-profiling/docs/master/rtd/pages/changelog.html.
A warm thank you to everyone who has contributed to this release: @gauravkumar37 @Jooong @smaranjitghose @XavierBanos Tam Nguyen @andycraig @mgorsk1 @mbh86 @MHUNCHO @GaelVaroquaux @AmauryLepicard @baluyotraf @pvojnisek @abegong
- Python
Published by sbrugman almost 6 years ago
ydata-profiling - pandas-profiling v2.8.0
pandas-profiling now has build-in supports for Files and Images, such as extracting file sizes, creation dates and dimensions and scanning for truncated images or those containing EXIF information. Moreover, the text analysis features have also been reworked, providing more informative statistics.
Read the changelog v2.8.0 for more details.
Contributors: @loopyme @Bradley-Butcher @willemhendriks, @IscaAy, @frellnick, @dataverz @ieaves
- Python
Published by sbrugman about 6 years ago
ydata-profiling - pandas-profiling v2.7.1
Fix #468 by pinning visions to 0.4.1
- Python
Published by sbrugman about 6 years ago
ydata-profiling - pandas-profiling v2.7.0
Announcement and changelog are available in the documentation.
We are grateful for @loopyme and @kyleYang for creating parts of the features on this release.
Thanks for all contributors that made this release possible @1313e @dataprofessor @neomatrix369 @jiangfangfangxm @WesleyTheGeolien @NickYi1990 @ricgu8086.
- Python
Published by sbrugman about 6 years ago
ydata-profiling - pandas-profiling v2.6.0
Dependency policy
The current dependency policy is suboptimal. Pinning the dependencies is great for reproducibility (high guarantee to work), but on the downside requires frequent maintenance and introduces compatibility issues with other packages. Therefore, we are moving away from pinning dependencies and instead specify a minimum version.
Pandas v1
Early releases of pandas v1 demonstrated many regressions that broke functionality (as acknowledged by the authors here). At this point, pandas is more stable and we notice high demand for compatibility. We move on to support pandas' latest versions. To ensure compatibility with both versions, we have extended the test matrix to test against both pandas 0.x.y and 1.x.y.
Python 3.6+ features
Python 3.6 introduces ordered dicts and f-strings, which we now rely on. This means that from pandas-profiling 2.6, you should minimally run Python 3.6. For users that for some reason cannot update, you can use pandas-profiling 2.5.0, but you unfortunately won't benefit from updates or maintenance.
Extended continuous integration
Starting from this release, we use Github Actions and Travis CI combined to increase maintainability. Travis CI handles the testing, Github Actions automates part of the development process by running black and building the docs.
- Python
Published by sbrugman about 6 years ago
ydata-profiling - pandas-profiling v2.5.0
- Progress bar added (#224)
- Character analysis for Text/NLP (#278)
- Themes: configuration and demo's (Orange, Dark)
- Tutorial on modifying the report's structure (#362; #281, #259, #253, #234). This jupyter notebook also demonstrates how to use the Kaggle api together with pandas-profiling.
- Toggle descriptions at correlations.
Deprecation:
- This is the last version to support Python 3.5.
Stability:
- The order of columns changed when sort="None" (#377, fixed).
- Pandas v1.0.X is not yet supported (#367, #366, #363, #353, pinned pandas to < 1)
- Improved mixed type detection (#351)
- Refactor of report structures.
- Correlations are more stable (e.g. Phi_k color scale now from 0-1, rows and columns with NaN values are dropped, #329).
- Distinct counts exclude NaNs.
- Fixed alerts in notebooks.
Other improvements:
- Warnings are now sorted.
- Links to Binder and Google Colab are added for notebooks (#349)
The overview section is tabbed.
Commit for pandas-profiling v2.5.0
Progress bar added (#224)
Character analysis for Text/NLP (#278)
Themes: configuration and demo's (Orange, Dark)
Tutorial on modifying the report's structure (#362; #281, #259, #253, #234). This jupyter notebook also demonstrates how to use the Kaggle api together with pandas-profiling.
Toggle descriptions at correlations.
Deprecation:
- This is the last version to support Python 3.5.
Stability:
- The order of columns changed when sort="None" (#377, fixed).
- Pandas v1.0.X is not yet supported (#367, #366, #363, #353, pinned pandas to < 1)
- Improved mixed type detection (#351)
- Refactor of report structures.
- Correlations are more stable (e.g. Phi_k color scale now from 0-1, rows and columns with NaN values are dropped, #329).
- Distinct counts exclude NaNs.
- Fixed alerts in notebooks.
Other improvements:
- Warnings are now sorted.
- Links to Binder and Google Colab are added for notebooks (#349)
- The overview section is tabbed.
- Python
Published by sbrugman over 6 years ago
ydata-profiling - pandas-profiling v2.4.0
The v2.4.0 release decouples the data structure of reports from the actual rendering. It's now much simpler to change the user interface, whether the user is in a jupyter notebook, webpage, native application or just wants a json view of the data.
We are also proud to announce that we are accepted for the GitHub Sponsor programme. You are cordially invited to support me through this programme, because you want to see me continue working on this project and to boost community funding, GitHub will match your contribution!
Other improvements: - extended configuration with better defaults, including minimal mode for big data (#258, #310) - more example datasets - rejection of highly correlated variables is generalized (#284, #299) - many structural and stability improvements (#254, #274, #239)
Special thanks to @marco-cardoso @ajupton @lvwerra @gliptak @neomatrix369 for their contributions.
- Python
Published by sbrugman over 6 years ago
ydata-profiling - pandas-profiling v2.3.0
- (Experimental) Support for "path" type
- Fix numeric precision (#225)
- Force labels in missing values diagram for large number of columns (#222)
- Add pull request template
- Add Census Dataset from the UCI ML Repository
Thanks @bensdm and @huaiweicheng for your valuable contributions to this version!
- Python
Published by sbrugman almost 7 years ago
ydata-profiling - pandas-profiling v2.2.0
New release introducing variable size binning (via astropy), PyCharm integration and various fixes and optimizations.
- Added Variable bin sizing via Bayesian Boxing (feature request [#216])
- PyCharm integration, console attempts to detect file type.
- Fixed bug [#215].
- Updated the
missingnopackage to 0.4.2, fixing the font size in thebardiagram. - Various optimizations
Thanks to: @Utsav37 @mansenfranzen @jakevdp
- Python
Published by sbrugman almost 7 years ago
ydata-profiling - pandas-profiling v2.1.2
Fix [#211] and README
- Python
Published by sbrugman almost 7 years ago
ydata-profiling - pandas-profiling v2.1.1
- Fix of [#206]
- Improve code maintainability of the view (HTML templates, notebook)
- Fix bug in dendrogram sizing
- Python
Published by sbrugman almost 7 years ago
ydata-profiling - pandas-profiling v2.1.0
The pandas-profiling release version 2.1.0 includes:
- Correlations: correlation calculations are now more fault tolerant ([#51] and [#197]), correlation names in the report are clarified.
- Jupyter Notebook: rendering a profiling report is done inside the
srcdocattribute (which fixes [#199]), a full-width option is added and the column layout is improved. - User experience: The table styling and sample section formatting is improved.
- Warnings: detection added for categorical variable that is suspected to be of the datetime type.
- Documentation and community:
- The Contribution page helps users that want to contribute.
- Typo's fixed [#195], Thank you @abhilashshakti
- Added more examples.
- Other bugfixes and improvements:
- Add version information to console interface.
- Fix: Remove one-time used logger [#202]
- Fix: Dealing with string indices [#200]
Contributors: @abhilashshakti @adamrossnelson @manycoding @InsciteAnalytics
- Python
Published by sbrugman almost 7 years ago
ydata-profiling - pandas-profiling v2.0.3
Bugfix on version structure for 2.0.2.
- Python
Published by sbrugman almost 7 years ago
ydata-profiling - pandas-profiling v2.0.2
Revised version structure, fixed recursion preventing installation of dependencies ([#184]).
The setup.py file used to include utils from the package prior to installation. This causes errors when the dependencies are not yet present.
- Python
Published by sbrugman almost 7 years ago
ydata-profiling - pandas-profiling v2.0.1
- Add offline support [#177], [#179] and [#180]
- Python
Published by sbrugman almost 7 years ago
ydata-profiling - pandas-profiling v2.0.0
With 23 commits, 123 files changes and 20+ issues resolved, Pandas Profiling v2.0.0 is a big leap forward.
Thanks to the great contributions from everyone involved! Special thanks to @JosPolfliet @conradoqg @eyaltra.
- Python
Published by sbrugman almost 7 years ago
ydata-profiling - pandas-profiling v1.4.3
- Fix the correlation images (#160).
Contributors: @kazetof
- Python
Published by sbrugman almost 7 years ago
ydata-profiling - pandas-profiling v1.4.2
- Multiple Bugfixes
- Enable Travis CI builds
Contributors: @Aylr @LeonardAukea @kevanshea @endremborza @romainx @drkarthi
- Python
Published by JosPolfliet about 7 years ago
ydata-profiling - pandas-profiling v1.4.1
Enhancements
- Performance enhancement. It is now possible to disable some heavy resource operations and achieve better performances (see also #76):
- Correlation checking by turning
check_correlationtoFalse(#43) - Recoded checking by turning
check_recodedtoFalse.
- Correlation checking by turning
- Possibility to install using conda
- Implementation of a new Boolean variable type (#25)
- Add new badges for zeros and highly skewed (#63)
- Code refactoring (internal improvement) to split on main modules in 4 modules (#65)
- Improve types handling
- types like
list,tupleanddictare now officially unsupported until we improve them - mixed columns are also correctly handled
- New Binary variable type supporting native
booleantype and also binary numeric values (#77)
- types like
- Warnings column names have link to corresponding detail in variables section in order to ease the navigation (#66)
- Spearman and Pearson Correlation matrix diagrams added in the report (#83)
Bug fixes
- #56 Incorrect calculation for % unique for variables with missing values bug
- #11 Avoid to throw an error when calling
get_rejected_variableswhile correlation has not been computed - #68 Avoid to set the matplotlib backend if not necessary
- Python
Published by romainx over 8 years ago
ydata-profiling - Version 1.4
Bug fixes and new check for recoded categorical variables. Thanks to all who contributed!
- Python
Published by JosPolfliet over 9 years ago
ydata-profiling - pandas-profiling v1.3.0
New additions include frequency counts and extreme values for numeric variables. Pandas-profiling now does all 1d-calculations in a multitprocessing fashion, vastly speeding up runtime.
- Python
Published by JosPolfliet over 9 years ago
ydata-profiling - v1.2.0
What's new: - histograms for date variables - bug fixes
- Python
Published by JosPolfliet almost 10 years ago
ydata-profiling - pandas-profiling 1.0.0a1
Initial release.
- Python
Published by JosPolfliet over 10 years ago