https://github.com/georgedouzas/imbalanced-learn-extra
Implementation of novel oversampling algorithms.
Science Score: 49.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 6 DOI reference(s) in README -
✓Academic publication links
Links to: sciencedirect.com -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.0%) to scientific vocabulary
Keywords
Repository
Implementation of novel oversampling algorithms.
Basic Info
- Host: GitHub
- Owner: georgedouzas
- License: mit
- Language: Python
- Default Branch: main
- Homepage: https://georgedouzas.github.io/imbalanced-learn-extra/
- Size: 1.29 MB
Statistics
- Stars: 33
- Watchers: 2
- Forks: 16
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
imbalanced-learn-extra
| Category | Tools |
| ------------------| -------- |
| Development |
|
| Package |
|
| Documentation |
|
| Communication |
|
Introduction
imbalanced-learn-extra is a Python package that extends imbalanced-learn. It implements algorithms that are not included in
imbalanced-learn due to their novelty or lower citation number. The current version includes the following:
A general interface for clustering-based oversampling algorithms.
The Geometric SMOTE algorithm. It is a geometrically enhanced drop-in replacement for SMOTE, that handles numerical as well as categorical features.
Installation
For user installation, imbalanced-learn-extra is currently available on the PyPi's repository, and you can
install it via pip:
bash
pip install imbalanced-learn-extra
Development installation requires cloning the repository and then using PDM to install the project as well as the main and development dependencies:
bash
git clone https://github.com/georgedouzas/imbalanced-learn-extra.git
cd imbalanced-learn-extra
pdm install
SOM clusterer requires optional dependencies:
bash
pip install imbalanced-learn-extra[som]
Usage
All the classes included in imbalanced-learn-extra follow the imbalanced-learn API using the functionality of the base
oversampler. Using scikit-learn convention, the data are represented as follows:
- Input data
X: 2D array-like or sparse matrices. - Targets
y: 1D array-like.
The oversamplers implement a fit method to learn from X and y:
python
oversampler.fit(X, y)
They also implement a fit_resample method to resample X and y:
python
X_resampled, y_resampled = clustering_based_oversampler.fit_resample(X, y)
Citing imbalanced-learn-extra
Publications using clustering-based oversampling:
- G. Douzas, F. Bacao, "Self-Organizing Map Oversampling (SOMO) for imbalanced data set learning", Expert Systems with Applications, vol. 82, pp. 40-52, 2017.
- G. Douzas, F. Bacao, F. Last, "Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE", Information Sciences, vol. 465, pp. 1-20, 2018.
- G. Douzas, F. Bacao, F. Last, "G-SOMO: An oversampling approach based on self-organized maps and geometric SMOTE", Expert Systems with Applications, vol. 183,115230, 2021.
Publications using Geometric-SMOTE:
Douzas, G., Bacao, B. (2019). Geometric SMOTE: a geometrically enhanced drop-in replacement for SMOTE. Information Sciences, 501, 118-135. https://doi.org/10.1016/j.ins.2019.06.007
Fonseca, J., Douzas, G., Bacao, F. (2021). Increasing the Effectiveness of Active Learning: Introducing Artificial Data Generation in Active Learning for Land Use/Land Cover Classification. Remote Sensing, 13(13), 2619. https://doi.org/10.3390/rs13132619
Douzas, G., Bacao, F., Fonseca, J., Khudinyan, M. (2019). Imbalanced Learning in Land Cover Classification: Improving Minority Classes’ Prediction Accuracy Using the Geometric SMOTE Algorithm. Remote Sensing, 11(24), 3040. https://doi.org/10.3390/rs11243040
Owner
- Name: George Douzas
- Login: georgedouzas
- Kind: user
- Location: Athens
- Website: https://aiandfriends.blog
- Repositories: 6
- Profile: https://github.com/georgedouzas
Physicist
GitHub Events
Total
- Issues event: 2
- Watch event: 3
- Delete event: 6
- Issue comment event: 9
- Push event: 32
- Pull request event: 17
- Fork event: 1
- Create event: 13
Last Year
- Issues event: 2
- Watch event: 3
- Delete event: 6
- Issue comment event: 9
- Push event: 32
- Pull request event: 17
- Fork event: 1
- Create event: 13
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 1
- Total pull requests: 11
- Average time to close issues: 3 days
- Average time to close pull requests: less than a minute
- Total issue authors: 1
- Total pull request authors: 1
- Average comments per issue: 11.0
- Average comments per pull request: 0.0
- Merged pull requests: 10
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 9
- Average time to close issues: 3 days
- Average time to close pull requests: less than a minute
- Issue authors: 1
- Pull request authors: 1
- Average comments per issue: 11.0
- Average comments per pull request: 0.0
- Merged pull requests: 9
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- AkinaMH (1)
Pull Request Authors
- georgedouzas (17)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 64 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 7
- Total maintainers: 1
pypi.org: imbalanced-learn-extra
An implementation of novel oversampling algorithms.
- Homepage: https://georgedouzas.github.io/imbalanced-learn-extra
- Documentation: https://georgedouzas.github.io/imbalanced-learn-extra
- License: MIT
-
Latest release: 0.7.0
published over 1 year ago
Rankings
Maintainers (1)
Funding
- https://github.com/sponsors/georgedouzas
Dependencies
- actions/checkout v3 composite
- actions/setup-python v4 composite
- pdm-project/setup-pdm v3 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- pdm-project/setup-pdm v3 composite
- actions/checkout v3 composite
- actions/configure-pages v3 composite
- actions/deploy-pages v1 composite
- actions/upload-pages-artifact v1 composite
- pdm-project/setup-pdm v3 composite
- imbalanced-learn >=0.11.0
- scikit-learn >=1.3.2
- typing-extensions >=4.8.0