hdbscan
hdbscan: Hierarchical density based clustering - Published in JOSS (2017)
Persistable
Persistable: persistent and stable clustering - Published in JOSS (2023)
Local clustering
Local clustering - Published in JOSS (2018)
gmm_diag and gmm_full
gmm_diag and gmm_full: C++ classes for multi-threaded Gaussian mixture models and Expectation-Maximisation - Published in JOSS (2017)
networkanalysis-ts
TypeScript port of the Java networkanalysis package that provides data structures and algorithms for network analysis.
publicationclassification
Java package for creating a multi-level classification of scientific publications based on citation links between publications.
genieclust
Genie: Fast and Robust Hierarchical Clustering with Noise Point Detection - in Python and R
unsupervised_analysis
A general purpose Snakemake workflow and MrBiomics module to perform unsupervised analyses (dimensionality reduction & cluster analysis) and visualizations of high-dimensional data.
kmodes
Python implementations of the k-modes and k-prototypes clustering algorithms, for clustering categorical data
itclust
Iterative transfer learning with neural network improves clustering and cell type classification in single-cell RNA-seq analysis
s4rdm3x
s4rdm3x: A Tool Suite to Explore Code to Architecture Mapping Techniques - Published in JOSS (2021)
icellr
Single (i) Cell R package (iCellR) is an interactive R package to work with high-throughput single cell sequencing technologies (i.e scRNA-seq, scVDJ-seq, scATAC-seq, CITE-Seq and Spatial Transcriptomics (ST)).
https://github.com/aryashah2k/datalogy-customer-segmentation-data-science-internship
A Repository Maintaining My Summer Internship Work At Datalogy As A Data Science Intern Working On Customer Segmentation Models Using Heirarchical Clustering, K-Means Clustering And Identifying Loyal Customers Based On Creation Of Recency, Frequence, Monetary (RFM) Matrix.
https://github.com/chris-santiago/kmeans
A simple implementation of K-Means & K-Medoids Clustering
comstream
In this project, we implemented a topic detection system on Twitter. This system reads tweets from a data stream and assigns them to one of the existing clusters or a new one. Each cluster acts as an agent, which makes the proposed approach a multi-agent system. There is also a coordinator, who monitors the whole system and coordinates the agent.
cluster_htrc
Identifying the boundaries of main content of fiction and non-fiction works in the HathiTrust Extracted Features dataset.
corrp
corrp: An R package for multiple correlation-like analysis and clustering in mixed data - Published in JOSS (2025)
https://github.com/cwtsleiden/networkanalysis
Java package that provides data structures and algorithms for network analysis.
https://github.com/acil-group/dvfa
Dual Vigilance Fuzzy ART - Companion MATLAB Code
silhouette
Silhouette-Based Diagnostics for Standard, Soft, and Multi-Way Clustering
https://github.com/acil-group/ddvfa
Distributed Dual Vigilance Fuzzy ART - Companion MATLAB Code
torchsom
TorchSOM is a PyTorch-based library for training Self-Organizing Maps (SOMs), a type of unsupervised learning algorithm used for clustering and dimensionality reduction. Designed for scalability and ease of use, this implementation is optimized for large-scale data.