clustering-benchmarks

A framework for benchmarking clustering algorithms

https://github.com/gagolews/clustering-benchmarks

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (5.9%) to scientific vocabulary

Keywords

benchmark-suite benchmarking cluster cluster-analysis clustering clustering-algorithms clustering-benchmarks clustering-evaluation data data-science dataset datasets ground-truth machine-learning
Last synced: 6 months ago · JSON representation ·

Repository

A framework for benchmarking clustering algorithms

Basic Info
Statistics
  • Stars: 43
  • Watchers: 4
  • Forks: 8
  • Open Issues: 1
  • Releases: 0
Topics
benchmark-suite benchmarking cluster cluster-analysis clustering clustering-algorithms clustering-benchmarks clustering-evaluation data data-science dataset datasets ground-truth machine-learning
Created almost 6 years ago · Last pushed 7 months ago
Metadata Files
Readme Changelog License Code of conduct Citation

README.md

A Framework for Benchmarking Clustering Algorithms

Maintained/edited/authored by Marek Gagolewski.

This project aims to:

  • aggregate, polish, and standardise the existing clustering benchmark batteries referred to across the machine learning and data mining literature,
  • introduce new datasets of different dimensionalities, sizes, and cluster types,
  • propose a consistent methodology for evaluating clustering algorithms.

See https://clustering-benchmarks.gagolewski.com/ for more details.

How to cite: Gagolewski M., A framework for benchmarking clustering algorithms, SoftwareX 20, 2022, 101270, https://clustering-benchmarks.gagolewski.com, DOI: 10.1016/j.softx.2022.101270.

Owner

  • Name: Marek Gagolewski
  • Login: gagolews
  • Kind: user
  • Location: Melbourne, VIC, Australia
  • Company: Deakin University

Free universities!

Citation (CITATION.cff)

cff-version: 1.2.0
message: "Please cite this package as follows."
authors:
  - family-names: Gagolewski
    given-names: Marek
    orcid: "https://orcid.org/0000-0003-0637-6028"
repository-code: "https://github.com/gagolews/clustering-benchmarks"
title: "A framework for benchmarking clustering algorithms"
doi: 10.1016/j.softx.2022.101270
preferred-citation:
    type: article
    year: 2022
    url: "https://clustering-benchmarks.gagolewski.com/"
    doi: 10.1016/j.softx.2022.101270
    title: "A framework for benchmarking clustering algorithms"
    journal: "SoftwareX"
    volume: 20
    issue: 101270
    authors:
    -   family-names: Gagolewski
        given-names: Marek
        orcid: "https://orcid.org/0000-0003-0637-6028"

GitHub Events

Total
  • Release event: 1
  • Watch event: 7
  • Issue comment event: 2
  • Push event: 18
  • Pull request event: 5
  • Fork event: 1
  • Create event: 1
Last Year
  • Release event: 1
  • Watch event: 7
  • Issue comment event: 2
  • Push event: 18
  • Pull request event: 5
  • Fork event: 1
  • Create event: 1

Committers

Last synced: 11 months ago

All Time
  • Total Commits: 138
  • Total Committers: 1
  • Avg Commits per committer: 138.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 18
  • Committers: 1
  • Avg Commits per committer: 18.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
gagolews m****i@g****m 138

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 5
  • Total pull requests: 2
  • Average time to close issues: 8 months
  • Average time to close pull requests: 3 days
  • Total issue authors: 4
  • Total pull request authors: 1
  • Average comments per issue: 3.8
  • Average comments per pull request: 1.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 2
  • Average time to close issues: N/A
  • Average time to close pull requests: 3 days
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 1.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • gagolews (2)
  • studentz (1)
  • prabhant (1)
  • yerbles (1)
Pull Request Authors
  • tstoeter (4)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 159 last-month
  • Total dependent packages: 1
  • Total dependent repositories: 1
  • Total versions: 7
  • Total maintainers: 1
pypi.org: clustering-benchmarks

A Framework for Benchmarking Clustering Algorithms

  • Versions: 7
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Downloads: 159 Last month
Rankings
Dependent packages count: 4.8%
Stargazers count: 11.8%
Forks count: 13.3%
Average: 13.8%
Downloads: 17.6%
Dependent repos count: 21.5%
Maintainers (1)
Last synced: 6 months ago

Dependencies

requirements.txt pypi
  • cython *
  • genieclust *
  • matplotlib *
  • numpy *
  • scikit-learn *
  • scipy *
setup.py pypi
  • genieclust *
  • matplotlib *
  • natsort *
  • numpy *
  • pandas *
  • scipy *
.github/workflows/py.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v1 composite