clevr

Clustering and Link Prediction Evaluation in R

https://github.com/cleanzr/clevr

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.8%) to scientific vocabulary

Keywords

clustering-evaluation entity-resolution evaluation-metrics link-prediction r-package record-linkage
Last synced: 6 months ago · JSON representation

Repository

Clustering and Link Prediction Evaluation in R

Basic Info
  • Host: GitHub
  • Owner: cleanzr
  • License: gpl-2.0
  • Language: R
  • Default Branch: main
  • Homepage:
  • Size: 114 KB
Statistics
  • Stars: 13
  • Watchers: 2
  • Forks: 3
  • Open Issues: 1
  • Releases: 3
Topics
clustering-evaluation entity-resolution evaluation-metrics link-prediction r-package record-linkage
Created over 5 years ago · Last pushed over 2 years ago
Metadata Files
Readme Changelog License

README.Rmd

---
output: github_document
---



```{r, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/README-",
  out.width = "100%"
)
```

# clevr: Clustering and Link Prediction Evaluation in R




clevr implements functions for evaluating link prediction and clustering 
algorithms in R. It includes efficient implementations of common performance 
measures, such as:

* pairwise precision, recall, F-measure;
* homogeneity, completeness and V-measure;
* (adjusted) Rand index;
* variation of information; and
* mutual information.

While the current focus is on supervised (a.k.a. external) performance 
measures, unsupervised (internal) measures are also in scope for future 
releases.

## Installation

You can install the latest release from [CRAN](https://CRAN.R-project.org) 
by entering:

``` r
install.packages("clevr")
```

The development version can be installed from GitHub using `devtools`:

``` r
# install.packages("devtools")
devtools::install_github("cleanzr/clevr")
```

## Example

Several functions are included which transform between different clustering 
representations.

```{r example}
library(clevr)
# A clustering of four records represented as a membership vector
pred_membership <- c("Record1" = 1, "Record2" = 1, "Record3" = 1, "Record4" = 2)

# Represent as a set of record pairs that appear in the same cluster
pred_pairs <- membership_to_pairs(pred_membership)
print(pred_pairs)

# Represent as a list of record clusters
pred_clusters <- membership_to_clusters(pred_membership)
print(pred_clusters)
```

Performance measures are available for evaluating linked pairs:

```{r pair-measures}
true_pairs <- rbind(c("Record1", "Record2"), c("Record3", "Record4"))

pr <- precision_pairs(true_pairs, pred_pairs)
print(pr)

re <- recall_pairs(true_pairs, pred_pairs)
print(re)
```

and for evaluating clusterings:

```{r clust-measures}
true_membership <- c("Record1" = 1, "Record2" = 1, "Record3" = 2, "Record4" = 2)

ari <- adj_rand_index(true_membership, pred_membership)
print(ari)

vi <- variation_info(true_membership, pred_membership)
print(vi)
```

Owner

  • Name: cleanzr
  • Login: cleanzr
  • Kind: organization

GitHub Events

Total
  • Watch event: 2
Last Year
  • Watch event: 2

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 3
  • Total pull requests: 3
  • Average time to close issues: 2 days
  • Average time to close pull requests: 2 days
  • Total issue authors: 1
  • Total pull request authors: 2
  • Average comments per issue: 0.33
  • Average comments per pull request: 0.33
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • ngmarchant (3)
Pull Request Authors
  • ngmarchant (2)
  • OlivierBinette (1)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 329 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 2
  • Total versions: 2
  • Total maintainers: 1
cran.r-project.org: clevr

Clustering and Link Prediction Evaluation in R

  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 2
  • Downloads: 329 Last month
Rankings
Forks count: 14.2%
Dependent repos count: 19.3%
Stargazers count: 19.3%
Average: 23.5%
Dependent packages count: 28.8%
Downloads: 35.9%
Maintainers (1)
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • R >= 3.0.2 depends
  • Matrix * imports
  • Rcpp >= 1.0.5 imports
  • stats * imports
  • testthat * suggests