tree.interpreter
Decision tree interpreter for randomForest/ranger as described in
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.9%) to scientific vocabulary
Keywords
Repository
Decision tree interpreter for randomForest/ranger as described in
Basic Info
- Host: GitHub
- Owner: nalzok
- License: other
- Language: R
- Default Branch: master
- Homepage: https://arxiv.org/abs/1906.10845
- Size: 139 KB
Statistics
- Stars: 12
- Watchers: 4
- Forks: 4
- Open Issues: 3
- Releases: 0
Topics
Metadata Files
README.md
tree.interpreter
An R re-implementation of the treeinterpreter package on PyPI. Each prediction can be decomposed as 'prediction = bias + feature_1_contribution + ... + feature_n_contribution'. This decomposition is then used to calculate the Mean Decrease Impurity (MDI) and Mean Decrease Impurity using out-of-bag samples (MDI-oob) feature importance measures based on the work of Li et al. (2019) arXiv:1906.10845.
Installation
To install the CRAN version, run
r
install.packages('tree.interpreter')
To install the latest development version, run
r
devtools::install_github('nalzok/tree.interpreter')
macOS users might want to follow the set up instructions by The Coatless Professor to minimize operational headaches and maximize computational performance.
Usage
For example, you can calculate the state-of-the-art MDI-oob feature importance
measure for ranger. See vignette('MDI', package='tree.interpreter') for
more information.
```r library(ranger) library(tree.interpreter)
set.seed(42L) rfobj <- ranger(mpg ~ ., mtcars, keep.inbag = TRUE) tidy.RF <- tidyRF(rfobj, mtcars[, -1], mtcars[, 1]) mtcars.MDIoob <- MDIoob(tidy.RF, mtcars[, -1], mtcars[, 1]) mtcars.MDIoob ```
References
This package companies the paper A Debiased MDI Feature Importance Measure for Random Forests.
Owner
- Name: Qingyao Sun
- Login: nalzok
- Kind: user
- Location: Chicago, IL
- Company: The University of Chicago
- Repositories: 8
- Profile: https://github.com/nalzok
A STAT major obsessed with machines.
GitHub Events
Total
Last Year
Committers
Last synced: almost 3 years ago
All Time
- Total Commits: 55
- Total Committers: 2
- Avg Commits per committer: 27.5
- Development Distribution Score (DDS): 0.018
Top Committers
| Name | Commits | |
|---|---|---|
| Qingyao Sun | s****5@i****m | 54 |
| Marvin Wright | g****b@w****e | 1 |
Committer Domains (Top 20 + Academic)
Packages
- Total packages: 1
-
Total downloads:
- cran 520 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 2
- Total maintainers: 1
cran.r-project.org: tree.interpreter
Random Forest Prediction Decomposition and Feature Importance Measure
- Homepage: https://github.com/nalzok/tree.interpreter
- Documentation: http://cran.r-project.org/web/packages/tree.interpreter/tree.interpreter.pdf
- License: MIT + file LICENSE
-
Latest release: 0.1.1
published about 6 years ago
Rankings
Maintainers (1)
Dependencies
- Rcpp >= 1.0.2 imports
- MASS * suggests
- covr * suggests
- knitr * suggests
- randomForest * suggests
- ranger * suggests
- rmarkdown * suggests
- testthat >= 2.1.0 suggests