Recent Releases of mixgb
mixgb - Github version v1.5.3
mixgb 1.5.3
Bug fix
- Fix an error in
cbind2(Mis.m, Obs.m)that occurred when the imputed dataset had only a single incomplete variable.- Cause of error: In this scenario,
Mis.m(a matrix of all other incomplete variables except the currently imputed one) was a0x0matrix, which caused error during binding.
- Cause of error: In this scenario,
- R
Published by agnesdeng about 1 year ago
mixgb - CRAN Version 1.5.2
For CRAN Submission
- Significantly faster imputation by optimising data preprocessing and the use of RcppArmadillo
- Visual diagnostic functions have been moved to the
vismipackage - Update Documentation
- R
Published by agnesdeng about 1 year ago
mixgb - Github Version 1.4.0
Optimized mixgb() for large datasets:
- Significantly faster imputation by optimizing data preprocessing and the use of RcppArmadillo
- Enhanced memory efficiency with in-place modifications using data.table
- Bootstrapping option removed from
mixgb(). Users can still use bootstrap in the archived functionmixgb0(). PMMis now set toNULLby default.
- R
Published by agnesdeng over 2 years ago
mixgb - GitHub version 1.3.2
Note: We chose to directly release v1.3.2, skipping the release of v1.3.1. The primary changes for both v1.3.2 and v1.3.1 are detailed below. Please be aware that starting from mixgb 1.3.1, the package requires XGBoost version 2.0.0 or higher.
mixgb 1.3.2
Miscellaneous
- Improves package documentation regarding the import of
xgb.save()andxgb.load()from XGBoost.
mixgb 1.3.1
Compatibility
- Makes the package compatible with XGBoost 2.0.0 with GPU support:
- Introduces a new parameter
device. - Deprecates parameters
gpu_idandpredictor. - Sets
tree_method = "hist"by default, aligning with XGBoost 2.0.0.
- Introduces a new parameter
New Features
- Introduces support for saving imputation models to a local directory through the parameter
save.models.folderinmixgb().- Models save in JSON format using
xgb.save(), a method recommended by XGBoost for future compatibility. - When
save.models.folderis specified, the return object ofmixgb()includes the current imputed datasets, directories for imputation models, and relevant parameters. This object can save usingsaveRDS()as it doesn't directly contain the models. Users can later load this object into R and employimpute_new(object, newdata, ...)for new data imputation.
- Models save in JSON format using
- R
Published by agnesdeng over 2 years ago
mixgb - Github version 1.3.1
## Now compatible with XGBoost 2.0.0
- Added new parameter device
- Removed depreciated parameters gpu_id and predictor
- Set tree_method = "hist" by default, match with XGBoost 2.0.0
## Added feature
- Support saving imputation models in a local directory. Users can specify the directory in the parameter save.models.folder in the main
function mixgb().
- Models will be save as JSON format using xgb.save() internally. Saving XGBoost models in this way instead of using saveRDS in R is recommended by XGBoost. This can ensure that the imputation models can still be used in later release of XGBoost.
- If users specify the save.models.folder, the return object of the main function mixgb() will include the current m sets of imputed datasets, a list of directories of the imputation models, and relevant parameters. Users can save this object using saveRDS() without any concern, as it does not contain the models themselves. For later use, one can load the object into R and impute new data using impute_new(object,newdata,...)
- R
Published by agnesdeng over 2 years ago
mixgb - GitHub version 1.1.0
## Added feature:
- Added an auxiliary function default_params() for the main function mixgb().
This function is designed to validate the list of XGBoost hyperparameters that are provided by the user. If users only want to modify a subset of hyperparameter, they can simply pass in those specific hyperparameters as a list in the xgb_params argument, without having to listing all other hyperparameters along with their default values.
## Bug fixes:
- Bugs related to PMM for multiclass variables.
- Updates plot_hist() and plot_bar() because the dot-dot notation (..prop..) was deprecated in ggplot2 3.4.0.
- use after_stat(density) instead of ..density.. in the function plot_hist()
- use after_stat(prop) instead of ..prop.. in the function plot_bar()
- R
Published by agnesdeng about 3 years ago
mixgb - GitHub version 1.0.3
- Minor change:
- Added an auxiliary function
default_params()for the main functionmixgb(). This function is designed to validate the list of XGBoost hyperparameters that are provided by the user. If users only want to modify a subset of hyperparameter, they can simply pass in those specific hyperparameters as a list in thexgb_paramsargument, without having to listing all other hyperparameters along with their default values.
- Added an auxiliary function
- R
Published by agnesdeng about 3 years ago
mixgb - CRAN version 1.0.2
## New version on CRAN
* Major change of default settings for mixgb().
- Our package has changed from using bootstrapping to subsampling with a default setting of subsample = 0.7. After further investigations, we discovered that although bootstrapping often works effectively, it can introduce bias in certain situations. As a result, we have made subsampling the default method instead of bootstrapping.
Current default settings for mixgb().
- Subsampling (
subsample = 0.7) - No bootstrapping (
bootstrap = FALSE) - Minor change:
- All examples use
nthread = 2to meet the requirement of the CRAN policy.
- Subsampling (
- R
Published by agnesdeng about 3 years ago
mixgb - Github version 1.0.1
Notable changes :
- use subsampling instead of bootstrapping
- Default settings for the main function
mixgb():subsample = 0.7bootstrap = FALSE
- R
Published by agnesdeng about 3 years ago
mixgb - CRAN version 0.1.0
Main feature
- First release on CRAN
- Support single and multiple imputation
- Customisable settings regarding bootstrapping and predictive matching
- Visual diagnostics for multiply imputed data
Notes
- Mac OSX users may not be able to use
mixgbwith multithreading. The packagexgboostrequires OpenMP to use multi-core. For more information please check https://mac.r-project.org/openmp/.
- R
Published by agnesdeng almost 4 years ago