ggRandomForests

Graphical analysis of random forests with the randomForestSRC, randomForest and ggplot2 packages.

https://github.com/ehrlinger/ggrandomforests

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.9%) to scientific vocabulary
Last synced: 7 months ago · JSON representation

Repository

Graphical analysis of random forests with the randomForestSRC, randomForest and ggplot2 packages.

Basic Info
Statistics
  • Stars: 147
  • Watchers: 9
  • Forks: 31
  • Open Issues: 21
  • Releases: 15
Created about 13 years ago · Last pushed 7 months ago
Metadata Files
Readme Changelog License

README.md

ggRandomForests: Visually Exploring Random Forests

cranlogs CRAN_Status_Badge

active

R-CMD-check Codecov test coverage

DOI <!-- badges: end -->

ggRandomForests will help uncover variable associations in the random forests models. The package is designed for use with the randomForest package (A. Liaw and M. Wiener 2002) or the randomForestSRC package (Ishwaran et.al. 2014, 2008, 2007) for survival, regression and classification random forests and uses the ggplot2 package (Wickham 2009) for plotting diagnostic and variable association results. ggRandomForests is structured to extract data objects from randomForestSRC or randomForest objects and provides S3 functions for printing and plotting these objects.

The randomForestSRC package provides a unified treatment of Breiman's (2001) random forests for a variety of data settings. Regression and classification forests are grown when the response is numeric or categorical (factor) while survival and competing risk forests (Ishwaran et al. 2008, 2012) are grown for right-censored survival data. Recently, support for the randomForest package (A. Liaw and M. Wiener 2002) for regression and classification forests has also been added.

Many of the figures created by the ggRandomForests package are also available directly from within the randomForestSRC or randomForest package. However, ggRandomForests offers the following advantages:

  • Separation of data and figures: ggRandomForests contains functions that operate on either the forest object directly, or on the output from randomForestSRC and randomForest post processing functions (i.e. plot.variable, var.select, find.interaction) to generate intermediate ggRandomForests data objects. S3 functions are provide to further process these objects and plot results using the ggplot2 graphics package. Alternatively, users can use these data objects for additional custom plotting or analysis operations.

  • Each data object/figure is a single, self contained object. This allows simple modification and manipulation of the data or ggplot2 objects to meet users specific needs and requirements.

  • The use of ggplot2 for plotting. We chose to use the ggplot2 package for our figures to allow users flexibility in modifying the figures to their liking. Each S3 plot function returns either a single ggplot2 object, or a list of ggplot2 objects, allowing users to use additional ggplot2 functions or themes to modify and customize the figures to their liking.

The package has recently been extended for Breiman and Cutler's Random Forests for Classification and Regression package randomForest where possible. Though methods have been provided for all gg_* functions, the unsupported functions will return an error message indicating where support is still lacking.

References

Breiman, L. (2001). Random forests, Machine Learning, 45:5-32.

Ishwaran H. and Kogalur U.B. (2014). Random Forests for Survival, Regression and Classification (RF-SRC), R package version 1.5.5.

Ishwaran H. and Kogalur U.B. (2007). Random survival forests for R. R News 7(2), 25--31.

Ishwaran H., Kogalur U.B., Blackstone E.H. and Lauer M.S. (2008). Random survival forests. Ann. Appl. Statist. 2(3), 841--860.

A. Liaw and M. Wiener (2002). Classification and Regression by randomForest. R News 2(3), 18--22.

Wickham, H. ggplot2: elegant graphics for data analysis. Springer New York, 2009.

Owner

  • Name: John Ehrlinger
  • Login: ehrlinger
  • Kind: user
  • Location: Cleveland, OH
  • Company: Microsoft @Azure @microsoft

Working Data Scientist. Statistician applying statistical machine learning. Mechanical Engineer Fluid Mechanics and Dynamics. Physics nerd.

GitHub Events

Total
  • Issues event: 1
  • Watch event: 3
  • Delete event: 1
  • Issue comment event: 7
  • Push event: 68
  • Pull request event: 10
  • Fork event: 1
  • Create event: 3
Last Year
  • Issues event: 1
  • Watch event: 3
  • Delete event: 1
  • Issue comment event: 7
  • Push event: 68
  • Pull request event: 10
  • Fork event: 1
  • Create event: 3

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 434
  • Total Committers: 4
  • Avg Commits per committer: 108.5
  • Development Distribution Score (DDS): 0.007
Past Year
  • Commits: 81
  • Committers: 1
  • Avg Commits per committer: 81.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
John Ehrlinger j****r@g****m 431
Romain Francois r****n@r****m 1
timelyportfolio k****l@t****m 1
John Ehrlinger e****j@c****g 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 36
  • Total pull requests: 19
  • Average time to close issues: about 1 year
  • Average time to close pull requests: about 2 months
  • Total issue authors: 15
  • Total pull request authors: 3
  • Average comments per issue: 1.06
  • Average comments per pull request: 0.84
  • Merged pull requests: 15
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 10
  • Average time to close issues: N/A
  • Average time to close pull requests: 6 days
  • Issue authors: 1
  • Pull request authors: 1
  • Average comments per issue: 3.0
  • Average comments per pull request: 0.7
  • Merged pull requests: 8
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • ehrlinger (22)
  • rshouv (1)
  • afb206 (1)
  • martin-weber (1)
  • Hamedhm (1)
  • GGG-del (1)
  • harrisonwsmith (1)
  • TassosDam (1)
  • jgarces02 (1)
  • romainfrancois (1)
  • ds4ci (1)
  • krz (1)
  • daniellemccool (1)
  • subasish (1)
Pull Request Authors
  • ehrlinger (23)
  • romainfrancois (1)
  • timelyportfolio (1)
Top Labels
Issue Labels
enhancement (8) bug (5)
Pull Request Labels

Packages

  • Total packages: 4
  • Total downloads:
    • cran 698 last-month
  • Total docker downloads: 21,777
  • Total dependent packages: 0
    (may contain duplicates)
  • Total dependent repositories: 3
    (may contain duplicates)
  • Total versions: 31
  • Total maintainers: 1
proxy.golang.org: github.com/ehrlinger/ggrandomforests
  • Versions: 8
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 6.5%
Average: 6.7%
Dependent repos count: 6.9%
Last synced: 7 months ago
proxy.golang.org: github.com/ehrlinger/ggRandomForests
  • Versions: 8
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 6.5%
Average: 6.7%
Dependent repos count: 6.9%
Last synced: 7 months ago
cran.r-project.org: ggRandomForests

Visually Exploring Random Forests

  • Versions: 13
  • Dependent Packages: 0
  • Dependent Repositories: 3
  • Downloads: 698 Last month
  • Docker Downloads: 21,777
Rankings
Forks count: 2.5%
Stargazers count: 2.9%
Docker downloads count: 12.6%
Average: 13.6%
Dependent repos count: 16.5%
Downloads: 18.7%
Dependent packages count: 28.7%
Maintainers (1)
Last synced: 7 months ago
conda-forge.org: r-ggrandomforests
  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Stargazers count: 27.5%
Forks count: 27.9%
Dependent repos count: 34.0%
Average: 35.1%
Dependent packages count: 51.2%
Last synced: 7 months ago

Dependencies

DESCRIPTION cran
  • R >= 3.5.0 depends
  • randomForest * depends
  • randomForestSRC >= 1.5.5 depends
  • ggplot2 * imports
  • parallel * imports
  • survival * imports
  • tidyr * imports
  • MASS * suggests
  • RColorBrewer * suggests
  • bookdown * suggests
  • datasets * suggests
  • dplyr * suggests
  • knitr * suggests
  • lintr * suggests
  • plot3D * suggests
  • rmarkdown * suggests
  • rmdformats * suggests
  • testthat * suggests
.github/workflows/R-CMD-check.yaml actions
  • actions/checkout v4 composite
  • r-lib/actions/check-r-package v2 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
.github/workflows/check-release.yaml actions
  • actions/checkout v2 composite
  • r-lib/actions/check-r-package v2 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
  • r-lib/actions/setup-tinytex v2 composite
.github/workflows/check-standard.yaml actions
  • actions/checkout v2 composite
  • r-lib/actions/check-r-package v2 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
  • r-lib/actions/setup-tinytex v2 composite
.github/workflows/lint.yaml actions
  • actions/checkout v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
.github/workflows/pkgdown.yaml actions
  • JamesIves/github-pages-deploy-action v4.5.0 composite
  • actions/checkout v4 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
.github/workflows/pr-commands.yml actions
  • actions/checkout v2 composite
  • r-lib/actions/pr-fetch v2 composite
  • r-lib/actions/pr-push v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
.github/workflows/test-coverage.yaml actions
  • actions/checkout v4 composite
  • actions/upload-artifact v4 composite
  • codecov/codecov-action v4 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite