https://github.com/danielpuentee/outdpik

The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.

https://github.com/danielpuentee/outdpik

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    2 of 3 committers (66.7%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (19.4%) to scientific vocabulary

Keywords

data-analysis matplotlib numpy python
Last synced: 6 months ago · JSON representation

Repository

The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.

Basic Info
Statistics
  • Stars: 3
  • Watchers: 2
  • Forks: 3
  • Open Issues: 0
  • Releases: 0
Topics
data-analysis matplotlib numpy python
Created over 3 years ago · Last pushed over 3 years ago
Metadata Files
Readme License

README.md


outdpik: Fundamental toolkit for outlier analysis and visualization

PyPI Latest Release PyPI License Package Status Documentation Status Code style: black

What is it?

Outdpik is an open source Python package that provides different methods for outlier detection. It aims to be the fundamental high-level package for this purpose. Additionally, it offers visualization methods for the outlier analysis.

Main Features

Here are just a few of the things that outdpik does well:

  • It supports numpy arrays and pandas dataframes
  • Multiple outlier detection techniques that can be combined
  • Powerful visualizations
  • Flexible at including one or more columns for the analysis

Where to get it

The source code is currently hosted on GitHub at: https://github.com/DanielPuentee/outdpik

Installer for the latest released version is available at the Python Package Index (PyPI)

```sh

PyPI

pip install outdpik ```

How to use outdpik

Examples of configuring and running outpdik:

python import outpdik as outdp outdp = outdp()

We proceed to detect outliers returning a dictionary of numeric features and the outliers instances:

python outliers_dict = outdp.outliers(df = df, cols = "all") Plotting advantages:

python outdp.plot_outliers(df = df, col = "x")

Strip plot outliers detection


Dependencies

License

This project is licensed under the terms of the GNU - see the LICENSE file for details.

Documentation

The official documentation is hosted on: https://outdpik.readthedocs.io/en/latest/

Development

Want to contribute? Great! Open a discussion in Github in this repo and we will answer as soon as possible.

Owner

  • Name: Daniel Puente Viejo
  • Login: DanielPuentee
  • Kind: user
  • Location: Bilbao
  • Company: NTT Data

Generative AI Engineer

GitHub Events

Total
Last Year

Committers

Last synced: almost 3 years ago

All Time
  • Total Commits: 31
  • Total Committers: 3
  • Avg Commits per committer: 10.333
  • Development Distribution Score (DDS): 0.161
Top Committers
Name Email Commits
DanielPuentee d****e@a****u 26
ikumpli i****o@a****u 4
Daniel Puente Viejo 6****e@u****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 12 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 6
  • Total maintainers: 2
pypi.org: outdpik

The fundamental toolkit for outliers search and visualization

  • Versions: 6
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 12 Last month
Rankings
Dependent packages count: 6.6%
Forks count: 17.3%
Stargazers count: 25.5%
Average: 29.5%
Dependent repos count: 30.6%
Downloads: 67.8%
Maintainers (2)
Last synced: 7 months ago

Dependencies

setup.py pypi
  • get *
  • matplotlib *
  • numpy *
  • pandas *
  • scipy *
  • seaborn *