tree.interpreter

Decision tree interpreter for randomForest/ranger as described in

https://github.com/nalzok/tree.interpreter

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.9%) to scientific vocabulary

Keywords

data-science datascience interpretability machine-learning r random-forest
Last synced: 6 months ago · JSON representation

Repository

Decision tree interpreter for randomForest/ranger as described in

Basic Info
Statistics
  • Stars: 12
  • Watchers: 4
  • Forks: 4
  • Open Issues: 3
  • Releases: 0
Topics
data-science datascience interpretability machine-learning r random-forest
Created over 6 years ago · Last pushed about 6 years ago
Metadata Files
Readme License

README.md

tree.interpreter

Travis build status AppVeyor build status Codecov test coverage <!-- badges: end -->

An R re-implementation of the treeinterpreter package on PyPI. Each prediction can be decomposed as 'prediction = bias + feature_1_contribution + ... + feature_n_contribution'. This decomposition is then used to calculate the Mean Decrease Impurity (MDI) and Mean Decrease Impurity using out-of-bag samples (MDI-oob) feature importance measures based on the work of Li et al. (2019) arXiv:1906.10845.

Installation

To install the CRAN version, run

r install.packages('tree.interpreter')

To install the latest development version, run

r devtools::install_github('nalzok/tree.interpreter')

macOS users might want to follow the set up instructions by The Coatless Professor to minimize operational headaches and maximize computational performance.

Usage

For example, you can calculate the state-of-the-art MDI-oob feature importance measure for ranger. See vignette('MDI', package='tree.interpreter') for more information.

```r library(ranger) library(tree.interpreter)

set.seed(42L) rfobj <- ranger(mpg ~ ., mtcars, keep.inbag = TRUE) tidy.RF <- tidyRF(rfobj, mtcars[, -1], mtcars[, 1]) mtcars.MDIoob <- MDIoob(tidy.RF, mtcars[, -1], mtcars[, 1]) mtcars.MDIoob ```

References

This package companies the paper A Debiased MDI Feature Importance Measure for Random Forests.

Owner

  • Name: Qingyao Sun
  • Login: nalzok
  • Kind: user
  • Location: Chicago, IL
  • Company: The University of Chicago

A STAT major obsessed with machines.

GitHub Events

Total
Last Year

Committers

Last synced: almost 3 years ago

All Time
  • Total Commits: 55
  • Total Committers: 2
  • Avg Commits per committer: 27.5
  • Development Distribution Score (DDS): 0.018
Top Committers
Name Email Commits
Qingyao Sun s****5@i****m 54
Marvin Wright g****b@w****e 1
Committer Domains (Top 20 + Academic)
wrig.de: 1

Packages

  • Total packages: 1
  • Total downloads:
    • cran 520 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 2
  • Total maintainers: 1
cran.r-project.org: tree.interpreter

Random Forest Prediction Decomposition and Feature Importance Measure

  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 520 Last month
Rankings
Downloads: 7.9%
Forks count: 12.8%
Stargazers count: 16.3%
Average: 20.5%
Dependent packages count: 29.8%
Dependent repos count: 35.5%
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • Rcpp >= 1.0.2 imports
  • MASS * suggests
  • covr * suggests
  • knitr * suggests
  • randomForest * suggests
  • ranger * suggests
  • rmarkdown * suggests
  • testthat >= 2.1.0 suggests