rlboost

https://github.com/iiconocimiento/rlboost

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 3 DOI reference(s) in README
✓
Academic publication links
Links to: sciencedirect.com
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (8.3%) to scientific vocabulary

Last synced: 9 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: iiconocimiento
License: mit
Language: Python
Default Branch: main
Size: 528 KB

Statistics

Stars: 7
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created almost 2 years ago · Last pushed over 1 year ago

Metadata Files

Readme License Citation

RLBoost

Data quality or data evaluation is sometimes a task as important as collecting a large volume of data when it comes to generating accurate artificial intelligence models. In fact, being able to evaluate the data can lead to a larger database that is better suited to a particular problem because we have the ability to filter out data obtained automatically of dubious quality.

RLBoost is an algorithm that uses deep reinforcement learning strategies to evaluate a particular dataset and obtain a model capable of estimating the quality of any new data in order to improve the final predictive quality of a supervised learning model. This solution has the advantage that of being agnostic regarding the supervised model used and, through multi-attention strategies, takes into account the data in its context and not only individually.

RLBoost

Installation

To install rlboost simply run: pip install git+https://github.com/iiconocimiento/rlboost

Standalone usage example

```python from sklearn.datasets import loadbreastcancer from sklearn.linearmodel import LogisticRegression from sklearn.modelselection import traintestsplit

from stable_baselines3 import PPO from rlboost import RLBoostOutlierDetector

Getting classification data

data = loadbreastcancer(as_frame=True) df = data["data"] df["target"] = data["target"]

Building datasets

dftrain, dftest = traintestsplit(df, testsize=0.3, trainsize=0.7) dftrain, dfvalid = traintestsplit(dftrain, testsize=0.5, train_size=0.5)

Xtrain, ytrain = dftrain.drop( columns=["target"]).values, dftrain["target"].values Xvalid, yvalid = dfvalid.drop( columns=["target"]).values, dfvalid["target"].values Xtest, ytest = dftest.drop( columns=["target"]).values, dftest["target"].values

RLBoost fitting

Estimator to refit iteratively (override score function as needed)

estimator = LogisticRegression()

rlboost = RLBoostOutlierDetector(Xvalid, yvalid, agentclass=PPO, steps=10, estimator=estimator, agentbatchsize=64, usevfencoding=True, usescorebase=True, device="cpu") rlboost.fit(Xtrain, y_train)

RLBoost evaluating data

(usually only train, but you can evaluate any data you want)

datavaluestrain = rlboost.getdatavalues(Xtrain, ytrain) datavaluesvalid = rlboost.getdatavalues(Xvalid, yvalid) datavaluestest = rlboost.getdatavalues(Xtest, ytest) print("Done") ```

Explanation

RLBoost is a data evaluation algorithm based on deep reinforcement learning. This algorithm tries to maximize the difference between the score of an estimator trained on N data chosen from a dataset versus the score of the same estimator trained on the same data with prior filtering.

This score is calculated with the default score method of the proposed estimator (usually accuracy for classification problems).

Parameters

agent_class: stable_baselines3.common.on_policy_algorithm.OnPolicyAlgorithm
- Reinforcement Learning algorithm to use. It has to be an ActorCritic one that accepts Boolean actions.
steps: int
- Number of steps to run the algoritm. 1e4 is recommended.
estimator: sklearn.base.BaseEstimator
- Estimator to wrap and use to evaluate the data
agent_batch_size: int
- Number of samples usead in each iterations. Ussually 200 should be fine.
use_vf_encoding: bool
- Whether to use a TransformerEncoder for the value function estimation. False for speedup, but true is recommended.
use_score_base: bool
- Whether to use the baseline score for the value function estimation. True is recommended.
device: str
- Device to train the model. You can choose from cpu, cuda:0, cuda:1, etc.

Use Cases

Data quality checker.
Automatic data augmentation.

Limitations

Only classificaction problems (by now).
Validation set has to be very precise.

Financial support

The authors acknowledge financial support from project PID2022-139856NB-I00 funded by MCIN/ AEI / 10.13039/501100011033 / FEDER UE, the Instituto de Ingeniería del Conocimiento (IIC), and the project IA4TES - Inteligencia Artificial para la Transición Energética Sostenible funded by Ministry of Economic Affairs and Digital Transformation (MIA.2021.M04.0008).

Citation

@article{BATANERO2025128815, title = {RLBoost: Boosting supervised models using deep reinforcement learning}, journal = {Neurocomputing}, volume = {618}, pages = {128815}, year = {2025}, issn = {0925-2312}, doi = {https://doi.org/10.1016/j.neucom.2024.128815}, url = {https://www.sciencedirect.com/science/article/pii/S0925231224015868}, author = {Eloy Anguiano Batanero and Ángela Fernández Pascual and Álvaro Barbero Jiménez}, keywords = {Data quality, Data valuation, Deep learning, Reinforcement learning, Supervised learning, Multi-attention}, abstract = {Data quality or evaluation can sometimes be a task as important as collecting a large volume of data when it comes to generating accurate artificial intelligence models. Being able to evaluate the data can lead to a larger database that is better suited to a particular problem because we have the ability to filter out data of dubious quality obtained automatically. In this paper, we present RLBoost (Anguiano et al., 2024) [1], an algorithm that uses deep reinforcement learning strategies to evaluate a particular dataset and obtain a model capable of estimating the quality of any new data in order to improve the final predictive quality of a supervised learning model. This solution has the advantage of being agnostic regarding the supervised model used and, through multi-attention strategies, takes into account the data in its context and not only individually. The results of the article show that this model obtains better and more stable results than other state-of-the-art algorithms such as Leave One Out, Shapley or Data Valuation using Reinforcement Learning.} }

Owner

Name: Instituto de Ingeniería del Conocimiento
Login: iiconocimiento
Kind: organization
Email: iic@iic.uam.es
Location: https://goo.gl/maps/zjR8CQ8fkzT2

Website: www.iic.uam.es
Repositories: 1
Profile: https://github.com/iiconocimiento

Centro privado de I+D+i pionero en Inteligencia Artificial con 30 años de experiencia en análisis de datos, Big Data, PLN y Machine Learning.

Citation (CITATION.cff)

cff-version: 1.2.0
title: >-
  RLBoost: Boosting Supervised Models using Deep
  Reinforcement Learning
message: >-
  If you use this software, please cite it using the
  metadata from this file.
type: software
authors:
  - given-names: 'Eloy '
    family-names: Anguiano Batanero
    email: eloy.anguiano@iic.uam.es
    affiliation: Instituto de Ingeniería del Conocimiento
    orcid: 'https://orcid.org/0000-0001-8516-5629'
  - given-names: Ángela
    family-names: Fernández Pascual
    email: a.fernandez@uam.es
    affiliation: Universidad Autónoma de Madrid
  - given-names: Álvaro
    family-names: Barbero Jiménez
    email: alvaro.barbero@iic.uam.es
    affiliation: Instituto de Ingeniería del Conocimiento
repository-code: 'https://github.com/iiconocimiento/rlboost'
license: MIT
commit: 1.0.0
version: 1.0.0
date-released: '2024-09-10'

GitHub Events

Total

Watch event: 3
Push event: 1
Fork event: 1

Last Year

Watch event: 3
Push event: 1
Fork event: 1

Dependencies

pyproject.toml pypi

coverage 6.5.0 develop
pytest 7.3.1 develop
pytest-cov 3.0.0 develop
pytest-ruff 0.0.5 develop
gym 0.21.0
python >=3.9, <3.12
scikit-learn >= 1
stable-baselines3 <=1.1.0

setup.py pypi

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

rlboost

Science Score: 67.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

RLBoost

Installation

Standalone usage example

Getting classification data

Building datasets

RLBoost fitting

Estimator to refit iteratively (override score function as needed)

RLBoost evaluating data

(usually only train, but you can evaluate any data you want)

Explanation

Parameters

Use Cases

Limitations

Financial support

Citation

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Dependencies