arulesCBA

Classification Based on Association Rules in R

https://github.com/mhahsler/arulescba

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    2 of 8 committers (25.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.1%) to scientific vocabulary

Keywords

association-rules classification
Last synced: 6 months ago · JSON representation

Repository

Classification Based on Association Rules in R

Basic Info
  • Host: GitHub
  • Owner: mhahsler
  • Language: R
  • Default Branch: master
  • Homepage:
  • Size: 1.38 MB
Statistics
  • Stars: 3
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Fork of ianstenbit/arulesCBA
Topics
association-rules classification
Created almost 4 years ago · Last pushed 7 months ago
Metadata Files
Readme

README.Rmd

---
output: github_document
---

```{r echo=FALSE, results = 'asis'}
pkg <- 'arulesCBA'

source("https://raw.githubusercontent.com/mhahsler/pkg_helpers/main/pkg_helpers.R")
pkg_title(pkg)
```

The R package [arulesCBA](https://cran.r-project.org/package=arulesCBA) (Hahsler et al, 2020) 
is an extension of the package [arules](https://cran.r-project.org/package=arules) to perform
association rule-based classification. The package provides the infrastructure for class association rules and implements associative classifiers based on the following algorithms:

* __CBA__:    Classification Based on Association Rules (Liu et al, 1998).
* __CMAR__:   Classification based on Multiple Association Rule  (Li, Han and Pei, 2001) via LUCS-KDD Software Library.
* __CPAR__:   Classification based on Predictive Association Rules (Yin and Han, 2003) via LUCS-KDD Software Library.
* __C4.5__:   Rules extracted from a C4.5 decision tree (Quinlan, 1993) via J48 in R/Weka.
* __FOIL__:   First-Order Inductive Learner (Yin and Han, 2003).
* __PART__:   Rules from Partial Decision Trees (Frank and Witten, 1998) via R/Weka.
* __PRM__:    Predictive Rule Mining (Yin and Han, 2003) via LUCS-KDD Software Library.
* __RCAR__:   Regularized Class Association Rules using Logistic Regression (Azmi et al, 2019).
* __RIPPER__: Repeated Incremental Pruning to Produce Error Reduction (Cohen, 1995) via R/Weka.

The package also provides the infrastructure for associative classification (supervised discetization, mining class association rules (CARs)), and implements various association rule-based classification strategies
(first match, majority voting, weighted voting, etc.).

```{r echo=FALSE, results = 'asis'}
pkg_install(pkg)
```
## Usage

```{r}
library("arulesCBA")
data("iris")
```

Learn a classifier.

```{r}
classifier <- CBA(Species ~ ., data = iris)
classifier
```

Inspect the rulebase.

```{r}
inspect(classifier$rules, linebreak = TRUE)
```
  
Make predictions for the first few instances of iris.

```{r}
predict(classifier, head(iris))
```

## Cite This Package AS

* M. Hahsler, I. Johnson, T. Kliegr and J. Kuchar (2019). [Associative Classification in R: arc, arulesCBA, and rCBA](https://journal.r-project.org/archive/2019/RJ-2019-048/). _The R Journal_ 11(2), pp. 254-267.

## References

* M. Azmi, G.C. Runger, and A. Berrado (2019). Interpretable regularized class association rules algorithm for classification in a categorical data space. _Information Sciences,_ Volume 483, May 2019, pp. 313-331.
* W. W. Cohen (1995). Fast effective rule induction. In A. Prieditis and S. Russell (eds.), _Proceedings of the 12th International Conference on Machine Learning,_ pp. 115-123. Morgan Kaufmann. ISBN 1-55860-377-8.
* E. Frank and I. H. Witten (1998). Generating accurate rule sets without global optimization. In J. Shavlik (ed.), _Machine Learning: Proceedings of the Fifteenth International Conference,_ Morgan Kaufmann Publishers: San Francisco, CA.
* W. Li, J. Han and J. Pei (2001). CMAR: accurate and efficient classification based on multiple class-association rules, _Proceedings 2001 IEEE International Conference on Data Mining,_ San Jose, CA, USA, pp. 369-376.
* B. Liu, W. Hsu and Y. Ma (1998). Integrating Classification and Association Rule Mining. _KDD'98 Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining,_ New York, AAAI, pp. 80-86.
* R. Quinlan (1993). _C4.5: Programs for Machine Learning._ Morgan Kaufmann Publishers, San Mateo, CA.
* X. Yin and J. Han (2003). CPAR: Classification based on Predictive Association Rules, _Proceedings of the 2003 SIAM International Conference on Data Minin,_ pp. 331-235.

Owner

  • Name: Michael Hahsler
  • Login: mhahsler
  • Kind: user
  • Location: Dallas, TX
  • Company: SMU

I develop packages for AI, ML, and Data Science.

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 364
  • Total Committers: 8
  • Avg Commits per committer: 45.5
  • Development Distribution Score (DDS): 0.538
Past Year
  • Commits: 3
  • Committers: 1
  • Avg Commits per committer: 3.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Michael Hahsler m****l@h****t 168
Ian Johnson i****n@i****m 155
Ian Johnson i****n@m****m 14
Ian Johnson i****y@a****m 13
Ian Johnson i****n 7
Tyler Giallanza t****a@g****m 3
Michael Hahsler m****r@s****u 3
tylergiallanza t****a@s****u 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 1,589 last-month
  • Total docker downloads: 21,777
  • Total dependent packages: 3
  • Total dependent repositories: 4
  • Total versions: 20
  • Total maintainers: 1
cran.r-project.org: arulesCBA

Classification Based on Association Rules

  • Versions: 20
  • Dependent Packages: 3
  • Dependent Repositories: 4
  • Downloads: 1,589 Last month
  • Docker Downloads: 21,777
Rankings
Dependent packages count: 10.7%
Downloads: 11.2%
Dependent repos count: 14.8%
Average: 17.2%
Forks count: 21.3%
Stargazers count: 28.0%
Maintainers (1)
Last synced: 6 months ago