Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
✓DOI references
Found 2 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.6%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
Repository
Basic Info
- Host: GitHub
- Owner: tmlange
- Language: R
- Default Branch: main
- Size: 1.26 MB
Statistics
- Stars: 2
- Watchers: 1
- Forks: 2
- Open Issues: 0
- Releases: 0
Created almost 2 years ago
· Last pushed about 1 year ago
https://github.com/tmlange/optRF/blob/main/
# optRF: Optimising random forest stability by determining the optimal number of trees[](https://CRAN.R-project.org/package=optRF)   The optRF package provides tools for optimizing the number of trees in a random forest to improve model stability and reproducibility. Since random forest is a non-deterministic method, variable importance and prediction results can vary between runs. The optRF package estimates the stability of random forest based on the number of trees and helps users determine the optimal number of trees required for reliable predictions and variable selection. ## Installation To install the optRF R package from CRAN, just run ``` r install.packages("optRF") ``` R version >= 3.6 is required. You can install the development version of optRF from [GitHub](https://github.com/tmlange/optRF) using `devtools` with: ``` r devtools::install_github("tmlange/optRF") ``` ## Usage The optRF package includes the `SNPdata` data set for demonstration purposes. The two main functions are: * `opt_prediction` Finds the optimal number of trees for stable predictions. * `opt_importance` Finds the optimal number of trees for stable variable importance estimates. ``` r library(optRF) # Load example data set data(SNPdata) # Optimise random forest for predicting the first column in SNPdata result_optpred = opt_prediction(y = SNPdata[,1], X=SNPdata[,-1]) summary(result_optpred) # Optimise random forest for calculating variable importance result_optimp = opt_importance(y = SNPdata[,1], X=SNPdata[,-1]) summary(result_optimp) ``` For detailed examples and explanations, refer to the package vignettes: * `optRF` General package overview * `opt_prediction` Optimizing random forest predictions * `opt_importance` Optimizing random forest variable importance estimation ## Citing optRF If you use optRF in your research, please cite: [Lange, T.M., Gltas, M., Schmitt, A.O. & Heinrich, F. optRF: Optimising random forest stability by determining the optimal number of trees. BMC Bioinformatics 26, 95 (2025).](https://doi.org/10.1186/s12859-025-06097-1)
Owner
- Login: tmlange
- Kind: user
- Repositories: 1
- Profile: https://github.com/tmlange
GitHub Events
Total
- Watch event: 2
- Push event: 5
- Pull request event: 14
- Fork event: 1
Last Year
- Watch event: 2
- Push event: 5
- Pull request event: 14
- Fork event: 1
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 0
- Total pull requests: 7
- Average time to close issues: N/A
- Average time to close pull requests: about 5 hours
- Total issue authors: 0
- Total pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 7
- Average time to close issues: N/A
- Average time to close pull requests: about 5 hours
- Issue authors: 0
- Pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- FelixHeinrich (8)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- cran 588 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 4
- Total maintainers: 1
cran.r-project.org: optRF
Optimising Random Forest Stability by Determining the Optimal Number of Trees
- Homepage: https://github.com/tmlange/optRF
- Documentation: http://cran.r-project.org/web/packages/optRF/optRF.pdf
- License: GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]
-
Latest release: 1.2.1
published about 1 year ago
Rankings
Dependent packages count: 28.1%
Dependent repos count: 34.7%
Average: 49.8%
Downloads: 86.6%
Maintainers (1)
Last synced:
10 months ago
Dependencies
DESCRIPTION
cran
- R >= 4.1.2 depends
- irr >= 0.84.1 depends
- minpack.lm >= 1.2 depends
- ranger >= 0.16.0 depends
- graphics * imports
- methods * imports
- stats * imports
- covr * suggests
- spelling * suggests
- testthat * suggests
[](https://CRAN.R-project.org/package=optRF)


The optRF package provides tools for optimizing the number of trees in a random forest to improve model stability and reproducibility. Since random forest is a non-deterministic method, variable importance and prediction results can vary between runs. The optRF package estimates the stability of random forest based on the number of trees and helps users determine the optimal number of trees required for reliable predictions and variable selection.
## Installation
To install the optRF R package from CRAN, just run
``` r
install.packages("optRF")
```
R version >= 3.6 is required.
You can install the development version of optRF from [GitHub](https://github.com/tmlange/optRF) using `devtools` with:
``` r
devtools::install_github("tmlange/optRF")
```
## Usage
The optRF package includes the `SNPdata` data set for demonstration purposes. The two main functions are:
* `opt_prediction` Finds the optimal number of trees for stable predictions.
* `opt_importance` Finds the optimal number of trees for stable variable importance estimates.
``` r
library(optRF)
# Load example data set
data(SNPdata)
# Optimise random forest for predicting the first column in SNPdata
result_optpred = opt_prediction(y = SNPdata[,1], X=SNPdata[,-1])
summary(result_optpred)
# Optimise random forest for calculating variable importance
result_optimp = opt_importance(y = SNPdata[,1], X=SNPdata[,-1])
summary(result_optimp)
```
For detailed examples and explanations, refer to the package vignettes:
* `optRF` General package overview
* `opt_prediction` Optimizing random forest predictions
* `opt_importance` Optimizing random forest variable importance estimation
## Citing optRF
If you use optRF in your research, please cite:
[Lange, T.M., Gltas, M., Schmitt, A.O. & Heinrich, F. optRF: Optimising random forest stability by determining the optimal number of trees. BMC Bioinformatics 26, 95 (2025).](https://doi.org/10.1186/s12859-025-06097-1)