https://github.com/brocklab/clustercleaver

Separate your clusters with gene distance

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: biorxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (8.4%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Separate your clusters with gene distance

Basic Info

Host: GitHub
Owner: brocklab
License: mit
Language: Jupyter Notebook
Default Branch: main
Homepage:
Size: 23.1 MB

Statistics

Stars: 1
Watchers: 0
Forks: 0
Open Issues: 0
Releases: 0

Created about 2 years ago · Last pushed almost 2 years ago

Metadata Files

Readme License

clusterCleaver

clusterCleaver is a computational pipeline for analyzing cluster data in single-cell RNA sequencing (scRNAseq). You can check out our preprint here. This is a scanpy-compatible package which leverages the Earth Mover's Distance (EMD) to find genes which can be used to distinguish between clusters of cells.

clusterCleaver provides a number of advantages:

clusterCleaver is fast and compatible with scanpy, meaning you can iterate through multiple potential parameters.
clusterCleaver provides easy to interpret plots, giving you confidence in its predictions.
Most importantly, clusterCleaver is one of the few surface marker identification tools which has validated its predictions.

Installation

You can currently install clusterCleaver using pip:

pip install clusterCleaver

Processing assumptions

clusterCleaver is designed around data which is processed according to the current single-cell best practices.

clusterCleaver works best on data which: 1. Is not regressed. 2. Has gene expression data which is greater than or equal to zero. 3. Has gene expression data which does not have excessive values.

The last caveat is important as some normalization methods can yield results with excessively high gene expression values. The EMD has high sensitivity to outliers. For our purposes, this is a good thing. However, if gene expression values are too high, this will result in spurious results.

Usage

To get started with clusterCleaver, check out tutorial.ipynb in the home directory.

Owner

Name: brocklab
Login: brocklab
Kind: organization
Location: Austin, TX

Website: http://www.brocklab.com/
Repositories: 4
Profile: https://github.com/brocklab

GitHub Events

Total

Last Year

Packages

Total packages: 1
Total downloads:
- pypi 11 last-month

Total dependent packages: 0
Total dependent repositories: 0
Total versions: 1
Total maintainers: 1

pypi.org: clustercleaver

Separate your clusters with gene distance

Homepage: https://github.com/brocklab/clusterCleaver
Documentation: https://clustercleaver.readthedocs.io/
License: MIT License
Latest release: 0.0.0.3
published about 2 years ago

Versions: 1
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 11 Last month

Rankings

Dependent packages count: 10.9%

Average: 36.2%

Dependent repos count: 61.4%

Maintainers (1)

tyler_jost

Last synced: 10 months ago

Dependencies

pyproject.toml pypi

matplotlib >=3.8
numpy >=1.26.3
pandas >=2.2.0
scanpy >=1.9.8
scipy >=1.12.0
seaborn >=0.13.2
tqdm >=4.66.1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/brocklab/clustercleaver

Science Score: 36.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

clusterCleaver

Installation

Processing assumptions

Usage

Owner

GitHub Events

Total

Last Year

Packages

pypi.org: clustercleaver

Rankings

Maintainers (1)

Dependencies