https://github.com/georgedouzas/imbalanced-learn-extra

Implementation of novel oversampling algorithms.

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 6 DOI reference(s) in README
✓
Academic publication links
Links to: sciencedirect.com
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (10.0%) to scientific vocabulary

Keywords

clustering-based-oversampling data-science g-somo geometric-smote imbalanced-learning kmeans-smote machine-learning oversampling python scikit-learn smote

Last synced: 5 months ago · JSON representation

Repository

Implementation of novel oversampling algorithms.

Basic Info

Host: GitHub
Owner: georgedouzas
License: mit
Language: Python
Default Branch: main
Homepage: https://georgedouzas.github.io/imbalanced-learn-extra/
Size: 1.29 MB

Statistics

Stars: 33
Watchers: 2
Forks: 16
Open Issues: 0
Releases: 0

Topics

clustering-based-oversampling data-science g-somo geometric-smote imbalanced-learning kmeans-smote machine-learning oversampling python scikit-learn smote

Created over 6 years ago · Last pushed about 1 year ago

Metadata Files

Readme Changelog Contributing Funding License Code of conduct

imbalanced-learn-extra

| Category | Tools | | ------------------| -------- | | Development | | | Package | version pythonversion downloads | | Documentation | | | Communication | |

Introduction

imbalanced-learn-extra is a Python package that extends imbalanced-learn. It implements algorithms that are not included in imbalanced-learn due to their novelty or lower citation number. The current version includes the following:

A general interface for clustering-based oversampling algorithms.
The Geometric SMOTE algorithm. It is a geometrically enhanced drop-in replacement for SMOTE, that handles numerical as well as categorical features.

Installation

For user installation, imbalanced-learn-extra is currently available on the PyPi's repository, and you can install it via pip:

bash pip install imbalanced-learn-extra

Development installation requires cloning the repository and then using PDM to install the project as well as the main and development dependencies:

bash git clone https://github.com/georgedouzas/imbalanced-learn-extra.git cd imbalanced-learn-extra pdm install

SOM clusterer requires optional dependencies:

bash pip install imbalanced-learn-extra[som]

Usage

All the classes included in imbalanced-learn-extra follow the imbalanced-learn API using the functionality of the base oversampler. Using scikit-learn convention, the data are represented as follows:

Input data X: 2D array-like or sparse matrices.
Targets y: 1D array-like.

The oversamplers implement a fit method to learn from X and y:

python oversampler.fit(X, y)

They also implement a fit_resample method to resample X and y:

python X_resampled, y_resampled = clustering_based_oversampler.fit_resample(X, y)

Citing `imbalanced-learn-extra`

Publications using clustering-based oversampling:

Publications using Geometric-SMOTE:

Douzas, G., Bacao, B. (2019). Geometric SMOTE: a geometrically enhanced drop-in replacement for SMOTE. Information Sciences, 501, 118-135. https://doi.org/10.1016/j.ins.2019.06.007
Fonseca, J., Douzas, G., Bacao, F. (2021). Increasing the Effectiveness of Active Learning: Introducing Artificial Data Generation in Active Learning for Land Use/Land Cover Classification. Remote Sensing, 13(13), 2619. https://doi.org/10.3390/rs13132619
Douzas, G., Bacao, F., Fonseca, J., Khudinyan, M. (2019). Imbalanced Learning in Land Cover Classification: Improving Minority Classes’ Prediction Accuracy Using the Geometric SMOTE Algorithm. Remote Sensing, 11(24), 3040. https://doi.org/10.3390/rs11243040

Owner

Name: George Douzas
Login: georgedouzas
Kind: user
Location: Athens

Website: https://aiandfriends.blog
Repositories: 6
Profile: https://github.com/georgedouzas

Physicist

GitHub Events

Total

Issues event: 2
Watch event: 3
Delete event: 6
Issue comment event: 9
Push event: 32
Pull request event: 17
Fork event: 1
Create event: 13

Last Year

Issues event: 2
Watch event: 3
Delete event: 6
Issue comment event: 9
Push event: 32
Pull request event: 17
Fork event: 1
Create event: 13

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 1
Total pull requests: 11
Average time to close issues: 3 days
Average time to close pull requests: less than a minute
Total issue authors: 1
Total pull request authors: 1
Average comments per issue: 11.0
Average comments per pull request: 0.0
Merged pull requests: 10
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 1
Pull requests: 9
Average time to close issues: 3 days
Average time to close pull requests: less than a minute
Issue authors: 1
Pull request authors: 1
Average comments per issue: 11.0
Average comments per pull request: 0.0
Merged pull requests: 9
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

AkinaMH (1)

Pull Request Authors

georgedouzas (17)

Top Labels

Issue Labels

Pull Request Labels

Packages

Total packages: 1
Total downloads:
- pypi 64 last-month

Total dependent packages: 0
Total dependent repositories: 0
Total versions: 7
Total maintainers: 1

pypi.org: imbalanced-learn-extra

An implementation of novel oversampling algorithms.

Homepage: https://georgedouzas.github.io/imbalanced-learn-extra
Documentation: https://georgedouzas.github.io/imbalanced-learn-extra
License: MIT
Latest release: 0.7.0
published over 1 year ago

Versions: 7
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 64 Last month

Rankings

Dependent packages count: 10.7%

Average: 35.5%

Dependent repos count: 60.4%

Maintainers (1)

gdouzas

Funding

https://github.com/sponsors/georgedouzas

Last synced: 6 months ago

Dependencies

.github/workflows/ci-docs.yml actions

actions/checkout v3 composite
actions/setup-python v4 composite
pdm-project/setup-pdm v3 composite

.github/workflows/ci.yml actions

actions/checkout v3 composite
actions/setup-python v4 composite
pdm-project/setup-pdm v3 composite

.github/workflows/doc.yml actions

actions/checkout v3 composite
actions/configure-pages v3 composite
actions/deploy-pages v1 composite
actions/upload-pages-artifact v1 composite
pdm-project/setup-pdm v3 composite

pyproject.toml pypi

imbalanced-learn >=0.11.0
scikit-learn >=1.3.2
typing-extensions >=4.8.0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/georgedouzas/imbalanced-learn-extra

Science Score: 49.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

imbalanced-learn-extra

Introduction

Installation

Usage

Citing `imbalanced-learn-extra`

Owner

GitHub Events

Total

Last Year

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

pypi.org: imbalanced-learn-extra

Rankings

Maintainers (1)

Funding

Dependencies

https://github.com/georgedouzas/imbalanced-learn-extra

Science Score: 49.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

imbalanced-learn-extra

Introduction

Installation

Usage

Citing imbalanced-learn-extra

Owner

GitHub Events

Total

Last Year

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

pypi.org: imbalanced-learn-extra

Rankings

Maintainers (1)

Funding

Dependencies

Citing `imbalanced-learn-extra`