Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.6%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: tmlange
  • Language: R
  • Default Branch: main
  • Size: 1.26 MB
Statistics
  • Stars: 2
  • Watchers: 1
  • Forks: 2
  • Open Issues: 0
  • Releases: 0
Created almost 2 years ago · Last pushed about 1 year ago

https://github.com/tmlange/optRF/blob/main/

# optRF: Optimising random forest stability by determining the optimal number of trees  


[![CRAN status](https://www.r-pkg.org/badges/version/optRF?color=green)](https://CRAN.R-project.org/package=optRF)
![CRAN Downloads month](https://cranlogs.r-pkg.org/badges/optRF?color=brightgreen)
![CRAN Downloads overall](https://cranlogs.r-pkg.org/badges/grand-total/optRF?color=brightgreen)


The optRF package provides tools for optimizing the number of trees in a random forest to improve model stability and reproducibility. Since random forest is a non-deterministic method, variable importance and prediction results can vary between runs. The optRF package estimates the stability of random forest based on the number of trees and helps users determine the optimal number of trees required for reliable predictions and variable selection.

## Installation
To install the optRF R package from CRAN, just run

``` r
install.packages("optRF")
```
R version >= 3.6 is required.  
You can install the development version of optRF from [GitHub](https://github.com/tmlange/optRF) using `devtools` with:

``` r
devtools::install_github("tmlange/optRF")
```

## Usage

The optRF package includes the `SNPdata` data set for demonstration purposes. The two main functions are:

* `opt_prediction`  Finds the optimal number of trees for stable predictions.
* `opt_importance`  Finds the optimal number of trees for stable variable importance estimates.

``` r
library(optRF)

# Load example data set
data(SNPdata)

# Optimise random forest for predicting the first column in SNPdata
result_optpred = opt_prediction(y = SNPdata[,1], X=SNPdata[,-1])
summary(result_optpred)

# Optimise random forest for calculating variable importance
result_optimp = opt_importance(y = SNPdata[,1], X=SNPdata[,-1]) 
summary(result_optimp)
```
For detailed examples and explanations, refer to the package vignettes:  

* `optRF`  General package overview  
* `opt_prediction`  Optimizing random forest predictions  
* `opt_importance`  Optimizing random forest variable importance estimation  

## Citing optRF
If you use optRF in your research, please cite:  
[Lange, T.M., Gltas, M., Schmitt, A.O. & Heinrich, F. optRF: Optimising random forest stability by determining the optimal number of trees. BMC Bioinformatics 26, 95 (2025).](https://doi.org/10.1186/s12859-025-06097-1)

Owner

  • Login: tmlange
  • Kind: user

GitHub Events

Total
  • Watch event: 2
  • Push event: 5
  • Pull request event: 14
  • Fork event: 1
Last Year
  • Watch event: 2
  • Push event: 5
  • Pull request event: 14
  • Fork event: 1

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 0
  • Total pull requests: 7
  • Average time to close issues: N/A
  • Average time to close pull requests: about 5 hours
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 5
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 7
  • Average time to close issues: N/A
  • Average time to close pull requests: about 5 hours
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 5
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • FelixHeinrich (8)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 588 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 4
  • Total maintainers: 1
cran.r-project.org: optRF

Optimising Random Forest Stability by Determining the Optimal Number of Trees

  • Versions: 4
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 588 Last month
Rankings
Dependent packages count: 28.1%
Dependent repos count: 34.7%
Average: 49.8%
Downloads: 86.6%
Last synced: 10 months ago

Dependencies

DESCRIPTION cran
  • R >= 4.1.2 depends
  • irr >= 0.84.1 depends
  • minpack.lm >= 1.2 depends
  • ranger >= 0.16.0 depends
  • graphics * imports
  • methods * imports
  • stats * imports
  • covr * suggests
  • spelling * suggests
  • testthat * suggests