Recent Releases of empulse
empulse - 0.9.0
Added
optimal_thresholdandoptimal_ratemethods to calculate the optimal threshold(s) and optimal predicted positive rate for a given metric. This is useful for determining the best decision threshold and predicted positive rate for a cost-sensitive or value-driven model.CSTreeClassifier,CSForestClassifier, andCSBaggingClassifiercan now take aMetricinstance as their criterion to optimize.CSThresholdClassifiercan now take aMetricinstance to choose the optimal decision threshold.RobustCSClassifiercan now take estimators with aMetricinstance as the loss function or criterion.RobustCSClassifierwill treat any cost marked as outlier sensitive. This can be done by using themark_outlier_sensitivemethod.Allow savings metrics to be used in
CSBoostClassifierandCSLogitClassifieras the objective function. Internally, the expected cost loss is used to train the model, since the expected savings score is just a transformation of the expected cost loss.kindargument toMetrichas been replaced bystrategy. TheMetricclass now takes aMetricStrategyinstance. This change allows for more flexibility in defining the metric strategy. The currently available strategies are:Fix error when importing Empulse without any optional dependencies installed.
Fix
CSLogitClassifiernot properly using the gradient when using a custom loss function fromMetric.Fix models throwing errors when differently shaped costs are passed to the fit or predict method.
Fix sympy distribution parameters not being properly translated to scipy distribution parameters when using the
MaxProfitstrategy (formerlykind='max profit') with the quasi monte-carlo integration method.
- Python
Published by github-actions[bot] 12 months ago
empulse - 0.8.0
- Feature:
CSBoostClassifier,CSLogitClassifier, andProfLogitClassifiercan now take a ,Metric, instance as their loss function. Internally, the metric instance is converted to the appropriate loss function for the model. For more information, read the User Guide. - Feature: Type hints are now available for all functions and classes.
- Enhancement: Add support for more than one stochastic variable when building maximum profit metrics with
Metric - Enhancement: Allow
Metricto be used as a context manager. This ensures the metric is always built after defining the cost-benefit elements. - Fix: Fix datasets not properly being packaged together with the package
- Fix: Fix
RobustCSClassifierwhen array-like parameters are passed to fit method. - Fix: Fix boosting models being biased towards the positive class.
Full Changelog: https://github.com/ShimantoRahman/empulse/compare/0.7.0...0.8.0
- Python
Published by github-actions[bot] about 1 year ago
empulse - 0.7.0
Full Changelog: https://github.com/ShimantoRahman/empulse/compare/0.6.0...0.7.0
- MajorFeature: Add
empulse.models.CSTreeClassifier,empulse.models.CSForestClassifier, andempulse.models.CSBaggingClassifierto support cost-sensitive decision tree and ensemble models - Enhancement: Add support for scikit-learn 1.5.2 (previously Empulse only supported scikit-learn 1.6.0 and above).
- API: Removed the
emp_scoreandempfunctions from theempulse.metricsmodule. Use theempulse.metrics.Metricclass instead to define custom expected maximum profit measures. For more information, read the User Guide. - API: Removed numba as a dependency for Empulse. This will reduce the installation time and the size of the package.
- Fix: Fix
empulse.metrics.Metricwhen defining stochastic variable with fixed values. - Fix: Fix
empulse.metrics.Metricwhen stochastic variable has infinite bounds. - Fix: Fix
empulse.models.CSThresholdClassifierwhen costs of predicting positive and negative classes are equal. - Fix: Fix documentation linking issues to sklearn
- Python
Published by github-actions[bot] over 1 year ago
empulse - 0.6.0
- Add
Metricclass to easily build your own value-driven and cost-sensitive metrics - Add support for LightGBM and Catboost models in
CSBoostClassifierandB2BoostClassifier make_objective_churnandmake_objective_acquisitionnow take amodelargument to calculate the objective for either XGBoost, LightGBM or Catboost models.- XGBoost is now an optional dependency together with LightGBM and Catboost. To install the package with
XGBoost, LightGBM and Catboost support, use the following command:
pip install empulse[optional] Renamed
y_pred_baselineandy_proba_baselinetobaselineinsavings_scoreandexpected_savings_score. It now accepts the following arguments:- If
'zero_one', the baseline model is a naive model that predicts all zeros or all ones depending on which is better. - If
'prior', the baseline model is a model that predicts the prior probability of the majority or minority class depending on which is better (not available for savings score). - If array-like, target probabilities of the baseline model.
- If
Add parameter validation for all models and samplers
Make all arguments of dataset loaders keyword-only
Update the descriptions attached to each dataset to match information found in the user guide
Improve type hints for functions and classes
Full Changelog: https://github.com/ShimantoRahman/empulse/compare/0.5.2...0.6.0
- Python
Published by github-actions[bot] over 1 year ago
empulse - 0.5.2
- Allow savingsscore and expectedsavingsscore to calculate the savings score over the baseline model instead of a naive model, by setting the ypredbaseline and yproba_baseline parameters, respectively.
- Reworked the user guide documentation to better explain the usage of value-driven and cost-sensitive models, samplers and metrics
- CSLogitClassifier and ProfLogitClassifier by default do not perform soft-thresholding on the regression coefficients. This can be enabled by setting the soft_threshold parameter to True.
- Prevent division by zero errors in expectedcostloss
Full Changelog: https://github.com/ShimantoRahman/empulse/compare/0.5.1...0.5.2
- Python
Published by github-actions[bot] over 1 year ago
empulse - 0.5.0
- Added supported for python 3.13
- Added cost-sensitive models
- CSLogitClassifier
- CSBoostClassifier
- RobustCSClassifier
- CSThresholdClassifier
- Added cost-sensitive metrics
- cost_loss
- expectedcostloss
- expectedlogcost_loss
- savings_score
- expectedsavingsscore
- Added cost-sensitive sampler
- CostSensitiveSampler
- Added datasets module
- rename metric arguments which expect target score from ypred to yscore and target probabilities from ypred to yproba.
- Allow all cost-sensitive models and samplers to accept cost parameters during initialization
Full Changelog: https://github.com/ShimantoRahman/empulse/compare/0.4.6...0.5.0
- Python
Published by github-actions[bot] over 1 year ago
empulse - Empulse 0.4.0
Added the Area Under the Expected Profit Curve metric.
- Python
Published by ShimantoRahman over 1 year ago