mbkmeans

K-means clustering for large single-cell datasets

https://github.com/drisso/mbkmeans

Science Score: 33.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
    Links to: plos.org
  • Committers with academic emails
    2 of 7 committers (28.6%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.8%) to scientific vocabulary

Keywords

clustering human-cell-atlas
Last synced: 9 months ago · JSON representation

Repository

K-means clustering for large single-cell datasets

Basic Info
  • Host: GitHub
  • Owner: drisso
  • License: mit
  • Language: R
  • Default Branch: master
  • Homepage:
  • Size: 5.89 MB
Statistics
  • Stars: 11
  • Watchers: 3
  • Forks: 4
  • Open Issues: 5
  • Releases: 0
Topics
clustering human-cell-atlas
Created about 8 years ago · Last pushed about 4 years ago
Metadata Files
Readme License

README.md

mbkmeans: Mini-batch k-means clustering for single-cell RNA-seq

Project Status: Active - The project has reached a stable, usable state and is being actively developed. BioC release BioC devel R-CMD-check

This package implements the mini-batch k-means algorithm for large datasets, including support for on-disk data representation.

The method is described in details in the paper:

S. Hicks, R. Liu, Y. Ni, E. Purdom, D. Risso (2021). mbkmeans: Fast clustering for single cell data using mini-batch k-means. PLOS Computational Biology.

Installation

In virtually all cases, installing from Bioconductor is recommended.

{r} if (!requireNamespace("BiocManager", quietly=TRUE)) install.packages("BiocManager") BiocManager::install("mbkmeans")

In the rare event you need the development version from GitHub, use the following.

{r} library(devtools) BiocManager::install("drisso/mbkmeans")

Owner

  • Name: Davide Risso
  • Login: drisso
  • Kind: user
  • Location: New York, NY
  • Company: Weill Cornell Medicine

GitHub Events

Total
  • Watch event: 1
  • Fork event: 1
Last Year
  • Watch event: 1
  • Fork event: 1

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 340
  • Total Committers: 7
  • Avg Commits per committer: 48.571
  • Development Distribution Score (DDS): 0.553
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
YuweiNi45 y****5@g****m 152
Davide Risso r****e@g****m 99
Elizabeth Purdom e****m@s****u 36
Davide Risso d****2@m****u 24
Stephanie Hicks s****s@g****m 16
Nitesh Turaga n****a@g****m 12
Stephanie Hicks s****s@S****t 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: over 1 year ago

All Time
  • Total issues: 18
  • Total pull requests: 1
  • Average time to close issues: 5 months
  • Average time to close pull requests: 9 days
  • Total issue authors: 8
  • Total pull request authors: 1
  • Average comments per issue: 1.72
  • Average comments per pull request: 3.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • drisso (11)
  • robertamezquita (1)
  • JaneChik (1)
  • epurdom (1)
  • ruoxi430 (1)
  • const-ae (1)
  • stephaniehicks (1)
  • mpadge (1)
Pull Request Authors
  • drisso (1)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

DESCRIPTION cran
  • R >= 3.6 depends
  • BiocParallel * imports
  • ClusterR * imports
  • DelayedArray * imports
  • Matrix * imports
  • Rcpp * imports
  • S4Vectors * imports
  • SingleCellExperiment * imports
  • SummarizedExperiment * imports
  • benchmarkme * imports
  • methods * imports
  • BiocStyle * suggests
  • DelayedMatrixStats * suggests
  • HDF5Array * suggests
  • Rhdf5lib * suggests
  • TENxPBMCData * suggests
  • beachmat * suggests
  • bluster * suggests
  • knitr * suggests
  • rmarkdown * suggests
  • scater * suggests
  • testthat * suggests