Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.8%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: DirSwe-Art
  • License: bsd-3-clause
  • Language: Python
  • Default Branch: main
  • Size: 22.9 MB
Statistics
  • Stars: 1
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed over 1 year ago
Metadata Files
Readme License Citation

README.md

This is a description of the implementation of research accepted for publication at the IEEE ICECCME 2024.

Reference paper: D. Sweidan (2024). "Accurate, Even at Random: Exploring Multiple Distinct Data Clusterings via Random Projection". In proceeding: The IEEE 4th International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME), November 04-06, 2024, Maldives.

The project is about a method for exploring different hidden insights in one given data, i.e., discovering different clustering solutions in the same data set. The approach employs random projection, where the original data is transformed (projected) multiple times into other spaces with randomly defined and linearly independent basis vectors. Each projection leads to obtaining one clustering solution using, e.g., a Gaussian Mixture Model with a specified number of components. The obtained clustering solutions from the multiple projections are further clustered using a hierarchical agglomerative clustering approach to group similar solutions. Each group of these solutions is aggregated into one representative clustering of the original data. Ultimately, the method explores several distinct and independent representative solutions that can be presented to the user.

To support the research community, the repository will include detailed documentation and examples to help users understand and apply the method. We are working on the optimization phase for large-dimensional data processing efficiency. We encourage others to use and extend our work.

The author.

Owner

  • Name: DirarSweidan
  • Login: DirSwe-Art
  • Kind: user
  • Location: Sweden

Citation (CITATION.cff)

cff-version: 1.2.0
message: If you use this software, please cite it using these metadata.
authors:
  - given-names: Dirar
    family-names: Sweidan
    email: dirar.sweidan.research@gmail.com
    orcid: 'https://orcid.org/0000-0001-5378-0862'
title: Multiple Clusterings via Random Projection
type: software
keywords:
  - "multiple clusterings, random projection, clusterings dissimilarity, clusterings aggregation"
  - research
repository-code: "https://github.com/DirSwe-Art/mClustRandomProjection.git"
license: BSD-3-Clause

GitHub Events

Total
  • Push event: 37
Last Year
  • Push event: 37