GMCM

Unsupervised Clustering and Meta-analysis using Gaussian Mixture Copula Models

https://github.com/aebilgrau/gmcm

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: arxiv.org, ieee.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (16.2%) to scientific vocabulary

Keywords

clustering cran gaussian-mixture-models meta-analysis r rank unsupervised-cluster-analysis
Last synced: 6 months ago · JSON representation

Repository

Unsupervised Clustering and Meta-analysis using Gaussian Mixture Copula Models

Basic Info
Statistics
  • Stars: 15
  • Watchers: 3
  • Forks: 3
  • Open Issues: 23
  • Releases: 0
Topics
clustering cran gaussian-mixture-models meta-analysis r rank unsupervised-cluster-analysis
Created about 12 years ago · Last pushed over 4 years ago
Metadata Files
Readme

README.md

Project Status: Active - The project has reached a stable, usable state and is being actively developed. Build Status CRAN version CRAN RStudio mirror downloads Total CRAN downloads Coverage Status

GMCM

Fast estimation of Gaussian Mixture Copula Models

The GMCM package (Bilgrau et. al., 2016) offers R functions that very fast perform high-dimensional meta-analysis (Li et. al., 2011) and general unsupervised cluster analysis (Tewari et. al., 2011) using Gaussian Copula Mixture Models. Online documentation is available here.

Gaussian copula mixture models (GMCMs) are a very flexible alternative to regular Gaussian mixture models (GMMs) in unsupervised cluster analysis of continuous data where non-normal clusters are present. GMCMs models the ranks of the observed data and are thus invariant to monotone increasing transformations of the data, i.e. they are semi-parametric and only the ordering of the data is important providing needed flexibility. A special-case of GMCMs can be used for a novel meta-analysis approach in high-dimensional settings. In this context, the model tries to cluster results into two groups which agree and do not agree on statistical evidence. These two groups corresponds to a reproducible and irreproducible group.

The optimization of the complicated likelihood function is difficult, however. GMCM utilizes Rcpp and RcppArmadillo to evaluate the likelihood function quickly and arrive at a parameter estimate using either standard numerical optimization routines or an pseudo EM algorithm.

Additional information, documentation, help, and examples can be found by here or by running ?GMCM in R. The paper[1] is also found as a vignette by vignette("GMCM-JStatSoft") or the official website online.. The core user functions of GMCM are fit.full.GMCM and fit.meta.GMCM.

Installation

The released and tested version of GMCM is available at CRAN (Comprehensive R Archive Network). It can be installed from within R by running

R install.packages("GMCM")

If you wish to install the latest version of GMCM directly from the master branch at GitHub, run

```R

install.packages("remotes") # Install remotes if needed

remotes::install_github("AEBilgrau/GMCM") ```

Note, that this version is in development and is likely different from the version at CRAN. As such, it may be unstable. Be sure that you have the package development prerequisites if you wish to install the package from the source.

When installed, run GMCM::runGMCM() to launch a local instance of the GMCM shiny application also available online at shinyapps.io. Run news(package = "GMCM") to view the latest changes of GMCM or visit here.

For previous versions of GMCM, visit the old releases at GitHub or the archive at CRAN.

Usage

As noted above, the usage of GMCM comes in two different applications; one general and one special.

Special GMCMs

An example of using the package to fit special GMCMs for meta analysis of is described here vignette("usage-example-special-model"). This model is a specific special case of the general GMCMs.

General GMCMs

An example of unsupervised clustering using the package is found with vignette("usage-example-general-model") for general purposes.

Shiny

The package also provides a graphical user interface via Shiny for both its uses. See vignette("usage-shiny-graphical-interface").

References

  1. Anders Ellern Bilgrau, Poul Svante Eriksen, Jakob Gulddahl Rasmussen, Hans Erik Johnsen, Karen Dybkaer, Martin Boegsted (2016). GMCM: Unsupervised Clustering and Meta-Analysis Using Gaussian Mixture Copula Models. Journal of Statistical Software, 70(2), 1-23. doi:10.18637/jss.v070.i02

Owner

  • Name: Anders Ellern Bilgrau
  • Login: AEBilgrau
  • Kind: user
  • Location: Denmark

GitHub Events

Total
Last Year

Committers

Last synced: over 1 year ago

All Time
  • Total Commits: 597
  • Total Committers: 2
  • Avg Commits per committer: 298.5
  • Development Distribution Score (DDS): 0.106
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Anders E Bilgrau a****u@g****m 534
Anders Ellern Bilgrau a****u@m****k 63
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 52
  • Total pull requests: 11
  • Average time to close issues: 5 months
  • Average time to close pull requests: 1 day
  • Total issue authors: 3
  • Total pull request authors: 1
  • Average comments per issue: 0.31
  • Average comments per pull request: 0.09
  • Merged pull requests: 11
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • AEBilgrau (50)
  • eliotshekhtman (1)
  • karan (1)
Pull Request Authors
  • AEBilgrau (11)
Top Labels
Issue Labels
enhancement (33) bug (15) shiny (15) docs (6) site (5)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 390 last-month
  • Total docker downloads: 20,392
  • Total dependent packages: 0
  • Total dependent repositories: 2
  • Total versions: 13
  • Total maintainers: 1
cran.r-project.org: GMCM

Fast Estimation of Gaussian Mixture Copula Models

  • Versions: 13
  • Dependent Packages: 0
  • Dependent Repositories: 2
  • Downloads: 390 Last month
  • Docker Downloads: 20,392
Rankings
Docker downloads count: 12.6%
Forks count: 14.2%
Stargazers count: 15.1%
Dependent repos count: 19.3%
Average: 20.2%
Dependent packages count: 28.8%
Downloads: 31.0%
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • Rcpp >= 0.10.6 imports
  • ellipse * imports
  • DT * suggests
  • Hmisc * suggests
  • RColorBrewer * suggests
  • foreach * suggests
  • idr * suggests
  • jpeg * suggests
  • knitr * suggests
  • rhandsontable * suggests
  • rmarkdown * suggests
  • shiny * suggests
  • shinyBS * suggests
  • shinydashboard * suggests
  • testthat >= 0.3 suggests