https://github.com/georgedouzas/imbalanced-learn-extra

Implementation of novel oversampling algorithms.

https://github.com/georgedouzas/imbalanced-learn-extra

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 6 DOI reference(s) in README
  • Academic publication links
    Links to: sciencedirect.com
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.0%) to scientific vocabulary

Keywords

clustering-based-oversampling data-science g-somo geometric-smote imbalanced-learning kmeans-smote machine-learning oversampling python scikit-learn smote
Last synced: 5 months ago · JSON representation

Repository

Implementation of novel oversampling algorithms.

Basic Info
Statistics
  • Stars: 33
  • Watchers: 2
  • Forks: 16
  • Open Issues: 0
  • Releases: 0
Topics
clustering-based-oversampling data-science g-somo geometric-smote imbalanced-learning kmeans-smote machine-learning oversampling python scikit-learn smote
Created over 6 years ago · Last pushed about 1 year ago
Metadata Files
Readme Changelog Contributing Funding License Code of conduct

README.md

imbalanced-learn-extra

ci doc

| Category | Tools | | ------------------| -------- | | Development | black ruff mypy docformatter | | Package | version pythonversion downloads | | Documentation | mkdocs| | Communication | gitter discussions |

Introduction

imbalanced-learn-extra is a Python package that extends imbalanced-learn. It implements algorithms that are not included in imbalanced-learn due to their novelty or lower citation number. The current version includes the following:

  • A general interface for clustering-based oversampling algorithms.

  • The Geometric SMOTE algorithm. It is a geometrically enhanced drop-in replacement for SMOTE, that handles numerical as well as categorical features.

Installation

For user installation, imbalanced-learn-extra is currently available on the PyPi's repository, and you can install it via pip:

bash pip install imbalanced-learn-extra

Development installation requires cloning the repository and then using PDM to install the project as well as the main and development dependencies:

bash git clone https://github.com/georgedouzas/imbalanced-learn-extra.git cd imbalanced-learn-extra pdm install

SOM clusterer requires optional dependencies:

bash pip install imbalanced-learn-extra[som]

Usage

All the classes included in imbalanced-learn-extra follow the imbalanced-learn API using the functionality of the base oversampler. Using scikit-learn convention, the data are represented as follows:

  • Input data X: 2D array-like or sparse matrices.
  • Targets y: 1D array-like.

The oversamplers implement a fit method to learn from X and y:

python oversampler.fit(X, y)

They also implement a fit_resample method to resample X and y:

python X_resampled, y_resampled = clustering_based_oversampler.fit_resample(X, y)

Citing imbalanced-learn-extra

Publications using clustering-based oversampling:

Publications using Geometric-SMOTE:

  • Douzas, G., Bacao, B. (2019). Geometric SMOTE: a geometrically enhanced drop-in replacement for SMOTE. Information Sciences, 501, 118-135. https://doi.org/10.1016/j.ins.2019.06.007

  • Fonseca, J., Douzas, G., Bacao, F. (2021). Increasing the Effectiveness of Active Learning: Introducing Artificial Data Generation in Active Learning for Land Use/Land Cover Classification. Remote Sensing, 13(13), 2619. https://doi.org/10.3390/rs13132619

  • Douzas, G., Bacao, F., Fonseca, J., Khudinyan, M. (2019). Imbalanced Learning in Land Cover Classification: Improving Minority Classes’ Prediction Accuracy Using the Geometric SMOTE Algorithm. Remote Sensing, 11(24), 3040. https://doi.org/10.3390/rs11243040

Owner

  • Name: George Douzas
  • Login: georgedouzas
  • Kind: user
  • Location: Athens

Physicist

GitHub Events

Total
  • Issues event: 2
  • Watch event: 3
  • Delete event: 6
  • Issue comment event: 9
  • Push event: 32
  • Pull request event: 17
  • Fork event: 1
  • Create event: 13
Last Year
  • Issues event: 2
  • Watch event: 3
  • Delete event: 6
  • Issue comment event: 9
  • Push event: 32
  • Pull request event: 17
  • Fork event: 1
  • Create event: 13

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 1
  • Total pull requests: 11
  • Average time to close issues: 3 days
  • Average time to close pull requests: less than a minute
  • Total issue authors: 1
  • Total pull request authors: 1
  • Average comments per issue: 11.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 10
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 9
  • Average time to close issues: 3 days
  • Average time to close pull requests: less than a minute
  • Issue authors: 1
  • Pull request authors: 1
  • Average comments per issue: 11.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 9
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • AkinaMH (1)
Pull Request Authors
  • georgedouzas (17)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 64 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 7
  • Total maintainers: 1
pypi.org: imbalanced-learn-extra

An implementation of novel oversampling algorithms.

  • Versions: 7
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 64 Last month
Rankings
Dependent packages count: 10.7%
Average: 35.5%
Dependent repos count: 60.4%
Maintainers (1)
Funding
  • https://github.com/sponsors/georgedouzas
Last synced: 6 months ago

Dependencies

.github/workflows/ci-docs.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • pdm-project/setup-pdm v3 composite
.github/workflows/ci.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • pdm-project/setup-pdm v3 composite
.github/workflows/doc.yml actions
  • actions/checkout v3 composite
  • actions/configure-pages v3 composite
  • actions/deploy-pages v1 composite
  • actions/upload-pages-artifact v1 composite
  • pdm-project/setup-pdm v3 composite
pyproject.toml pypi
  • imbalanced-learn >=0.11.0
  • scikit-learn >=1.3.2
  • typing-extensions >=4.8.0