clusteval
Clusteval provides methods for unsupervised cluster validation
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: zenodo.org -
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.9%) to scientific vocabulary
Keywords
Repository
Clusteval provides methods for unsupervised cluster validation
Basic Info
- Host: GitHub
- Owner: erdogant
- License: other
- Language: Jupyter Notebook
- Default Branch: master
- Homepage: https://erdogant.github.io/clusteval
- Size: 27.5 MB
Statistics
- Stars: 60
- Watchers: 2
- Forks: 8
- Open Issues: 2
- Releases: 24
Topics
Metadata Files
README.md
clusteval
clusteval is a python package that is developed to evaluate detected clusters and return the cluster labels that have most optimal clustering tendency, Number of clusters and clustering quality. Multiple evaluation strategies are implemented for the evaluation; silhouette, dbindex, and derivative, and four clustering methods can be used: agglomerative, kmeans, dbscan and hdbscan.
⭐️ Star this repo if you like it ⭐️
📘 Blogs
- A step-by-step guide for clustering images
- Detection of Duplicate Images Using Image Hash Functions
- From Data to Clusters: When is Your Clustering Good Enough?
- From Clusters To Insights; The Next Step
📚 Documentation
Full documentation is available at erdogant.github.io/clusteval, including examples and API references.
⚙️ Installation
It is advisable to use a virtual environment:
bash
conda create -n env_clusteval python=3.12
conda activate env_clusteval
Install via PyPI:
bash
pip install clusteval
To upgrade to the latest version:
bash
pip install --upgrade clusteval
Import the library:
python
from clusteval import clusteval
💡 Examples
A structured overview is available in the documentation.
Silhouette Score |
Optimal Clusters |
Dendrogram |
Davies-Bouldin Index |
Derivative Method |
DBSCAN |
HDBSCAN A |
HDBSCAN B |
📖 Citation
Please cite clusteval in your publications if it has been helpful in your research. Citation information is available at the top right of the GitHub page.
🔍 Related Tools & Blogs
- Use ARI when clustering contains large equal-sized clusters
- Use AMI for unbalanced clusters with small components
- Adjusted Rand Score — scikit-learn
- Adjusted for Chance Measures — scikit-learn
- imagededup GitHub repo
- Clustering images by visual similarity
- Facebook DeepCluster
- PCA on Hyperspectral Data
- Face Recognition with PCA
☕ Support
If you find this project useful, consider supporting me:
Owner
- Name: Erdogan
- Login: erdogant
- Kind: user
- Location: Den Haag
- Website: https://erdogant.github.io/
- Repositories: 51
- Profile: https://github.com/erdogant
Machine Learning | Statistics | Bayesian | D3js | Visualizations
Citation (CITATION.cff)
# YAML 1.2
---
authors:
-
family-names: Taskesen
given-names: Erdogan
orcid: "https://orcid.org/0000-0002-3430-9618"
cff-version: "1.1.0"
date-released: 2020-09-01
keywords:
- "Python"
- "Clustering"
- "Validation"
- "Dbindex"
- "Silhouette method"
- "Unsupervised"
license: "MIT"
message: "If you use this software, please cite it using these metadata."
repository-code: "https://erdogant.github.io/clusteval"
title: "clusteval is a python package for unsupervised cluster validation."
version: "0.1.0"
...
GitHub Events
Total
- Create event: 2
- Release event: 3
- Issues event: 1
- Watch event: 3
- Issue comment event: 1
- Push event: 13
Last Year
- Create event: 2
- Release event: 3
- Issues event: 1
- Watch event: 3
- Issue comment event: 1
- Push event: 13
Committers
Last synced: 9 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| erdogant | e****t@g****m | 257 |
| Matt Payne | 1****e | 1 |
Issues and Pull Requests
Last synced: 9 months ago
All Time
- Total issues: 10
- Total pull requests: 1
- Average time to close issues: 3 months
- Average time to close pull requests: about 3 hours
- Total issue authors: 10
- Total pull request authors: 1
- Average comments per issue: 1.8
- Average comments per pull request: 1.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 1
- Pull request authors: 0
- Average comments per issue: 1.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- naty88 (1)
- Tyrannas (1)
- Michael-E-Rose (1)
- junefeld (1)
- nitinvinayak (1)
- toopazo (1)
- LaiaBigFish (1)
- programmer2710 (1)
- ydennisy (1)
- Jainu-s (1)
Pull Request Authors
- Matthew-J-Payne (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- pipinstallsphinx_rtd_theme *
- pytest * development
- rst2pdf * development
- sphinx * development
- sphinx_rtd_theme * development
- spyder-kernels ==2.2. development
- hdbscan *
- matplotlib *
- numpy *
- pandas *
- pypickle *
- seaborn *
- sklearn *
- tqdm *
- wget *
- matplotlib *
- numpy *
- pandas *
- pypickle *
- seaborn *
- sklearn *
- tqdm *
- wget *
- actions/checkout v2 composite
- github/codeql-action/analyze v1 composite
- github/codeql-action/autobuild v1 composite
- github/codeql-action/init v1 composite
- actions/checkout v2 composite
- conda-incubator/setup-miniconda v2 composite