https://github.com/brocklab/clustercleaver

Separate your clusters with gene distance

https://github.com/brocklab/clustercleaver

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
    Links to: biorxiv.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.4%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Separate your clusters with gene distance

Basic Info
  • Host: GitHub
  • Owner: brocklab
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 23.1 MB
Statistics
  • Stars: 1
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License

README.md

clusterCleaver

clusterCleaver is a computational pipeline for analyzing cluster data in single-cell RNA sequencing (scRNAseq). You can check out our preprint here. This is a scanpy-compatible package which leverages the Earth Mover's Distance (EMD) to find genes which can be used to distinguish between clusters of cells.

clusterCleaver provides a number of advantages:

  1. clusterCleaver is fast and compatible with scanpy, meaning you can iterate through multiple potential parameters.
  2. clusterCleaver provides easy to interpret plots, giving you confidence in its predictions.
  3. Most importantly, clusterCleaver is one of the few surface marker identification tools which has validated its predictions.

Installation

You can currently install clusterCleaver using pip:

pip install clusterCleaver

Processing assumptions

clusterCleaver is designed around data which is processed according to the current single-cell best practices.

clusterCleaver works best on data which: 1. Is not regressed. 2. Has gene expression data which is greater than or equal to zero. 3. Has gene expression data which does not have excessive values.

The last caveat is important as some normalization methods can yield results with excessively high gene expression values. The EMD has high sensitivity to outliers. For our purposes, this is a good thing. However, if gene expression values are too high, this will result in spurious results.

Usage

To get started with clusterCleaver, check out tutorial.ipynb in the home directory.

Owner

  • Name: brocklab
  • Login: brocklab
  • Kind: organization
  • Location: Austin, TX

GitHub Events

Total
Last Year

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 11 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 1
  • Total maintainers: 1
pypi.org: clustercleaver

Separate your clusters with gene distance

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 11 Last month
Rankings
Dependent packages count: 10.9%
Average: 36.2%
Dependent repos count: 61.4%
Maintainers (1)
Last synced: 10 months ago

Dependencies

pyproject.toml pypi
  • matplotlib >=3.8
  • numpy >=1.26.3
  • pandas >=2.2.0
  • scanpy >=1.9.8
  • scipy >=1.12.0
  • seaborn >=0.13.2
  • tqdm >=4.66.1