fuzzyclara
fuzzyclara: Efficient Medoid-based Clustering Algorithms for Large and Fuzzy Data - Published in JOSS (2025)
Science Score: 93.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 2 DOI reference(s) in README and JOSS metadata -
✓Academic publication links
Links to: joss.theoj.org -
○Academic email domains
-
○Institutional organization owner
-
✓JOSS paper metadata
Published in Journal of Open Source Software
Repository
Efficient medoid-based clustering algorithms for large and fuzzy data
Basic Info
- Host: GitHub
- Owner: bauer-alex
- License: other
- Language: R
- Default Branch: main
- Homepage: https://bauer-alex.github.io/fuzzyclara/
- Size: 23.5 MB
Statistics
- Stars: 3
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 1
Metadata Files
README.md
fuzzyclara 
Efficient and fuzzy clustering based on the CLARA algorithm
- Authors: Maximilian Weigert, Alexander Bauer, Jana Gauss
- Contributors: Theresa Kriecherbauer, Asmik Nalmpatian
- Version: 1.0.1
Aim of this Package
The fuzzyclara package tackles two issues of cluster analysis applications.
First, it includes routines for fuzzy clustering which avoid the common hard
clustering assumption that each observation is a clear member of one sole
cluster. Instead, membership probabilities indicate to which extent the
characteristics of each observation are shaped by the characteristics of several
'typical' clusters. Second, the estimation of classical clustering algorithms
is often only hardly or not at all feasible in large data situations with
thousands of observations. Subsampling-based algorithms building on the CLARA
algorithm are implemented to make the estimation feasible in such situations.
Building on these two points, the 'fuzzyclara' package offers routines for all
aspects of a cluster analysis, including the use of user-defined distance
functions and diverse visualization techniques.
Documentation and Useful Materials
To get an overview of the functionalities of the package, check out the JOSS publication or the package vignette.
Installation
The most current version from GitHub can be installed via
``` r devtools::install_github("bauer-alex/fuzzyclara")
potential installation problems (specifically on MacOS) might be resolved
by previously specifically installing some dependency packages
install.packages(c("vegclust", "ggwordcloud", "ggpubr", "factoextra")) ```
How to Contribute
If you encounter problems with the package, find bugs or have suggestions for additional functionalities please open a GitHub issue. Alternatively, feel free to contact us directly via email.
Contributions (via pull requests or otherwise) are welcome. Please adhere to the Advanced R style guide when contributing code. Before you open a pull request or share your updates with us, please make sure that all unit tests pass without errors or warning messages. You can run the unit tests by calling
r
devtools::test()
Owner
- Name: Alexander Bauer
- Login: bauer-alex
- Kind: user
- Repositories: 11
- Profile: https://github.com/bauer-alex
JOSS Publication
fuzzyclara: Efficient Medoid-based Clustering Algorithms for Large and Fuzzy Data
Authors
Tags
cluster analysis high-dimensional data fuzzy clusteringGitHub Events
Total
- Release event: 1
- Issues event: 2
- Watch event: 1
- Push event: 85
Last Year
- Release event: 1
- Issues event: 2
- Watch event: 1
- Push event: 85
Issues and Pull Requests
Last synced: 11 months ago
All Time
- Total issues: 23
- Total pull requests: 0
- Average time to close issues: 4 months
- Average time to close pull requests: N/A
- Total issue authors: 2
- Total pull request authors: 0
- Average comments per issue: 0.74
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 0
- Average time to close issues: less than a minute
- Average time to close pull requests: N/A
- Issue authors: 1
- Pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- ghost (13)
- bauer-alex (9)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- R >= 2.10 depends
- Matrix * imports
- checkmate * imports
- cluster * imports
- dplyr * imports
- factoextra * imports
- ggplot2 * imports
- ggpubr * imports
- ggwordcloud * imports
- parallel * imports
- proxy * imports
- scales * imports
- tibble * imports
- tidyr * imports
- tidyselect * imports
- vegclust * imports
- knitr * suggests
- rmarkdown * suggests
- shiny * suggests
- testthat >= 3.0.0 suggests
- actions/checkout v2 composite
- actions/upload-artifact main composite
- r-lib/actions/check-r-package v2 composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
- actions/checkout v2 composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
- actions/checkout v2 composite
- r-lib/actions/setup-pandoc v2 composite
- r-lib/actions/setup-r v2 composite
- r-lib/actions/setup-r-dependencies v2 composite
