https://github.com/danielpuentee/outdpik
The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
2 of 3 committers (66.7%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (19.4%) to scientific vocabulary
Keywords
Repository
The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.
Basic Info
- Host: GitHub
- Owner: DanielPuentee
- License: gpl-3.0
- Language: Python
- Default Branch: main
- Homepage: https://pypi.org/project/outdpik/
- Size: 649 KB
Statistics
- Stars: 3
- Watchers: 2
- Forks: 3
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
outdpik: Fundamental toolkit for outlier analysis and visualization
What is it?
Outdpik is an open source Python package that provides different methods for outlier detection. It aims to be the fundamental high-level package for this purpose. Additionally, it offers visualization methods for the outlier analysis.
Main Features
Here are just a few of the things that outdpik does well:
- It supports numpy arrays and pandas dataframes
- Multiple outlier detection techniques that can be combined
- Powerful visualizations
- Flexible at including one or more columns for the analysis
Where to get it
The source code is currently hosted on GitHub at: https://github.com/DanielPuentee/outdpik
Installer for the latest released version is available at the Python Package Index (PyPI)
```sh
PyPI
pip install outdpik ```
How to use outdpik
Examples of configuring and running outpdik:
python
import outpdik as outdp
outdp = outdp()
We proceed to detect outliers returning a dictionary of numeric features and the outliers instances:
python
outliers_dict = outdp.outliers(df = df, cols = "all")
Plotting advantages:
python
outdp.plot_outliers(df = df, col = "x")
Dependencies
- pandas - Provides fast, flexible, and expressive data structures designed to make working with "relational" or "labeled" data both easy and intuitive
- NumPy - Adds support for large, multi-dimensional arrays, matrices and high-level mathematical functions to operate on these arrays
- SciPy - Includes modules for statistics, optimization, integration, linear algebra, Fourier transforms, signal and image processing, ODE solvers, and more
- matplotlib - Comprehensive library for creating static, animated, and interactive visualizations in Python
- seaborn - Provides a high-level interface for drawing attractive statistical graphics
License
This project is licensed under the terms of the GNU - see the LICENSE file for details.
Documentation
The official documentation is hosted on: https://outdpik.readthedocs.io/en/latest/
Development
Want to contribute? Great! Open a discussion in Github in this repo and we will answer as soon as possible.
Owner
- Name: Daniel Puente Viejo
- Login: DanielPuentee
- Kind: user
- Location: Bilbao
- Company: NTT Data
- Website: https://linktr.ee/Danielpv
- Repositories: 7
- Profile: https://github.com/DanielPuentee
Generative AI Engineer
GitHub Events
Total
Last Year
Committers
Last synced: almost 3 years ago
All Time
- Total Commits: 31
- Total Committers: 3
- Avg Commits per committer: 10.333
- Development Distribution Score (DDS): 0.161
Top Committers
| Name | Commits | |
|---|---|---|
| DanielPuentee | d****e@a****u | 26 |
| ikumpli | i****o@a****u | 4 |
| Daniel Puente Viejo | 6****e@u****m | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 8 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 12 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 6
- Total maintainers: 2
pypi.org: outdpik
The fundamental toolkit for outliers search and visualization
- Homepage: https://github.com/DanielPuentee/outdpik
- Documentation: https://outdpik.readthedocs.io/
- License: GNU
-
Latest release: 1.5
published over 3 years ago
Rankings
Maintainers (2)
Dependencies
- get *
- matplotlib *
- numpy *
- pandas *
- scipy *
- seaborn *