Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.8%) to scientific vocabulary
Keywords
clustering-evaluation
entity-resolution
evaluation-metrics
link-prediction
r-package
record-linkage
Last synced: 6 months ago
·
JSON representation
Repository
Clustering and Link Prediction Evaluation in R
Basic Info
Statistics
- Stars: 13
- Watchers: 2
- Forks: 3
- Open Issues: 1
- Releases: 3
Topics
clustering-evaluation
entity-resolution
evaluation-metrics
link-prediction
r-package
record-linkage
Created over 5 years ago
· Last pushed over 2 years ago
Metadata Files
Readme
Changelog
License
README.Rmd
---
output: github_document
---
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "100%"
)
```
# clevr: Clustering and Link Prediction Evaluation in R
clevr implements functions for evaluating link prediction and clustering
algorithms in R. It includes efficient implementations of common performance
measures, such as:
* pairwise precision, recall, F-measure;
* homogeneity, completeness and V-measure;
* (adjusted) Rand index;
* variation of information; and
* mutual information.
While the current focus is on supervised (a.k.a. external) performance
measures, unsupervised (internal) measures are also in scope for future
releases.
## Installation
You can install the latest release from [CRAN](https://CRAN.R-project.org)
by entering:
``` r
install.packages("clevr")
```
The development version can be installed from GitHub using `devtools`:
``` r
# install.packages("devtools")
devtools::install_github("cleanzr/clevr")
```
## Example
Several functions are included which transform between different clustering
representations.
```{r example}
library(clevr)
# A clustering of four records represented as a membership vector
pred_membership <- c("Record1" = 1, "Record2" = 1, "Record3" = 1, "Record4" = 2)
# Represent as a set of record pairs that appear in the same cluster
pred_pairs <- membership_to_pairs(pred_membership)
print(pred_pairs)
# Represent as a list of record clusters
pred_clusters <- membership_to_clusters(pred_membership)
print(pred_clusters)
```
Performance measures are available for evaluating linked pairs:
```{r pair-measures}
true_pairs <- rbind(c("Record1", "Record2"), c("Record3", "Record4"))
pr <- precision_pairs(true_pairs, pred_pairs)
print(pr)
re <- recall_pairs(true_pairs, pred_pairs)
print(re)
```
and for evaluating clusterings:
```{r clust-measures}
true_membership <- c("Record1" = 1, "Record2" = 1, "Record3" = 2, "Record4" = 2)
ari <- adj_rand_index(true_membership, pred_membership)
print(ari)
vi <- variation_info(true_membership, pred_membership)
print(vi)
```
Owner
- Name: cleanzr
- Login: cleanzr
- Kind: organization
- Repositories: 18
- Profile: https://github.com/cleanzr
GitHub Events
Total
- Watch event: 2
Last Year
- Watch event: 2
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 3
- Total pull requests: 3
- Average time to close issues: 2 days
- Average time to close pull requests: 2 days
- Total issue authors: 1
- Total pull request authors: 2
- Average comments per issue: 0.33
- Average comments per pull request: 0.33
- Merged pull requests: 3
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- ngmarchant (3)
Pull Request Authors
- ngmarchant (2)
- OlivierBinette (1)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- cran 329 last-month
- Total dependent packages: 0
- Total dependent repositories: 2
- Total versions: 2
- Total maintainers: 1
cran.r-project.org: clevr
Clustering and Link Prediction Evaluation in R
- Homepage: https://github.com/cleanzr/clevr
- Documentation: http://cran.r-project.org/web/packages/clevr/clevr.pdf
- License: GPL-2
-
Latest release: 0.1.2
published over 2 years ago
Rankings
Forks count: 14.2%
Dependent repos count: 19.3%
Stargazers count: 19.3%
Average: 23.5%
Dependent packages count: 28.8%
Downloads: 35.9%
Maintainers (1)
Last synced:
6 months ago
Dependencies
DESCRIPTION
cran
- R >= 3.0.2 depends
- Matrix * imports
- Rcpp >= 1.0.5 imports
- stats * imports
- testthat * suggests