scanpy

Single-cell analysis in Python. Scales to >100M cells.

https://github.com/scverse/scanpy

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 6 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
    18 of 161 committers (11.2%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.8%) to scientific vocabulary

Keywords

anndata bioinformatics data-science machine-learning python scanpy scverse transcriptomics visualize-data

Keywords from Contributors

genomics closember scrna-seq mudata muon multimodal-omics-analysis multi-omics dataflow cite-seq scatac-seq
Last synced: 6 months ago · JSON representation

Repository

Single-cell analysis in Python. Scales to >100M cells.

Basic Info
Statistics
  • Stars: 2,174
  • Watchers: 48
  • Forks: 659
  • Open Issues: 563
  • Releases: 0
Topics
anndata bioinformatics data-science machine-learning python scanpy scverse transcriptomics visualize-data
Created about 9 years ago · Last pushed 6 months ago
Metadata Files
Readme Contributing License

README.md

Stars PyPI PyPI Downloads Conda Forge Conda Forge Downloads Docs CI Discourse topics Chat Powered by NumFOCUS

Scanpy – Single-Cell Analysis in Python

Scanpy is a scalable toolkit for analyzing single-cell gene expression data built jointly with anndata. It includes preprocessing, visualization, clustering, trajectory inference and differential expression testing. The Python-based implementation efficiently deals with datasets of more than one million cells.

Discuss usage on the scverse Discourse. Read the documentation. If you'd like to contribute by opening an issue or creating a pull request, please take a look at our contribution guide.

scanpy is part of the scverse® project (website, governance) and is fiscally sponsored by NumFOCUS. If you like scverse® and want to support our mission, please consider making a tax-deductible donation to help the project pay for developer time, professional services, travel, workshops, and a variety of other needs.

Citation

If you use scanpy in your work, please cite the scanpy publication as follows:

SCANPY: large-scale single-cell gene expression data analysis

F. Alexander Wolf, Philipp Angerer, Fabian J. Theis

Genome Biology 2018 Feb 06. doi: 10.1186/s13059-017-1382-0.

You can cite the scverse publication as follows:

The scverse project provides a computational ecosystem for single-cell omics data analysis

Isaac Virshup, Danila Bredikhin, Lukas Heumos, Giovanni Palla, Gregor Sturm, Adam Gayoso, Ilia Kats, Mikaela Koutrouli, Scverse Community, Bonnie Berger, Dana Pe’er, Aviv Regev, Sarah A. Teichmann, Francesca Finotello, F. Alexander Wolf, Nir Yosef, Oliver Stegle & Fabian J. Theis

Nat Biotechnol. 2023 Apr 10. doi: 10.1038/s41587-023-01733-8.

Owner

  • Name: scverse
  • Login: scverse
  • Kind: organization

Foundational tools for omics data in the life sciences

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 3,340
  • Total Committers: 161
  • Avg Commits per committer: 20.745
  • Development Distribution Score (DDS): 0.666
Past Year
  • Commits: 210
  • Committers: 29
  • Avg Commits per committer: 7.241
  • Development Distribution Score (DDS): 0.433
Top Committers
Name Email Commits
falexwolf f****f@g****e 1,115
Philipp A f****p@w****e 605
Isaac Virshup i****p@g****m 452
Fidel Ramírez f****z@g****m 223
Koncopd k****d@g****m 167
Gökçen Eraslan g****n@g****m 88
pre-commit-ci[bot] 6****] 82
LuckyMD m****n@i****e 47
Awni Mousa a****a@g****m 44
giovp g****l@g****m 39
adamgayoso a****o 30
Ilan Gold i****d@g****m 25
Severin Dicks 3****7 24
Francesco G. Brundu f****u@g****m 20
Lukas Heumos l****s@p****t 19
Volker Bergen 3****n 17
Gökcen Eraslan g****n@b****g 15
Andres Munoz a****s@y****u 14
tobias t****s@l****m 13
Scott Gigante s****e@g****m 13
atarashansky a****y@g****m 11
Jonathan Manning j****g@e****k 11
michalk8 4****8 10
Fabian Rost f****t@p****e 9
Eljas Roellin 6****l 9
Tom White t****e@g****m 7
dawe c****e@g****m 7
Marius Lange m****e@t****e 6
LuckyMD m****n@g****m 5
Gregor Sturm m****l@g****e 5
and 131 more...

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 727
  • Total pull requests: 1,428
  • Average time to close issues: 11 months
  • Average time to close pull requests: about 1 month
  • Total issue authors: 412
  • Total pull request authors: 119
  • Average comments per issue: 2.81
  • Average comments per pull request: 1.78
  • Merged pull requests: 1,100
  • Bot issues: 0
  • Bot pull requests: 127
Past Year
  • Issues: 166
  • Pull requests: 603
  • Average time to close issues: 13 days
  • Average time to close pull requests: 5 days
  • Issue authors: 92
  • Pull request authors: 37
  • Average comments per issue: 0.69
  • Average comments per pull request: 1.16
  • Merged pull requests: 461
  • Bot issues: 0
  • Bot pull requests: 57
Top Authors
Issue Authors
  • flying-sheep (54)
  • scverse-bot (32)
  • ivirshup (30)
  • ilan-gold (14)
  • Intron7 (10)
  • wangjiawen2013 (10)
  • Zethson (8)
  • adamgayoso (8)
  • danli349 (7)
  • brianpenghe (7)
  • grst (6)
  • LuckyMD (6)
  • giovp (6)
  • alexlenail (5)
  • meeseeksmachine (5)
Pull Request Authors
  • flying-sheep (471)
  • meeseeksmachine (352)
  • pre-commit-ci[bot] (125)
  • ivirshup (96)
  • ilan-gold (65)
  • Intron7 (43)
  • Koncopd (12)
  • Zethson (12)
  • eroell (11)
  • kaushalprasadhial (10)
  • gokceneraslan (9)
  • ashish615 (8)
  • Sakshi-2797 (6)
  • selmanozleyen (6)
  • glitchheadgit (6)
Top Labels
Issue Labels
Bug 🐛 (171) Enhancement ✨ (119) Triage 🩺 (106) Needs info❔ (58) Area - Plotting 🌺 (58) Area - Documentation 📒 (36) Question (31) Development Process 🚀 (22) Closing candidate (19) Area – API (18) Upstream (14) Area – Performance 🐌 (13) good first issue (13) Area - Out of core 💾 (9) Area - Topology 🍩 (8) Area – Differential Expression (7) Area – Spatial 🗺 (6) Area - IO (6) Duplicate (6) Meta (6) Help Wanted (5) Area – Reproducibility (5) breaking change ‼️ (4) Area – Preprocessing 🧼 (4) Area – External (4) Installation (4) benchmark (2) RFC/ Discussion (2) Area - Raw 🍣 (1) Area - Datasets (1)
Pull Request Labels
Development Process 🚀 (505) Area - Documentation 📒 (84) benchmark (45) Bug 🐛 (36) Area – Performance 🐌 (22) Area - Plotting 🌺 (20) no milestone (18) Area - Out of core 💾 (13) Area – Preprocessing 🧼 (8) Enhancement ✨ (6) Area – API (5) needs retroactive review (4) Area – External (4) Upstream (3) Area - Dimensionality Reduction 🤏 (3) Area - Topology 🍩 (3) Still Needs Manual Backport (2) dependencies (2) github_actions (2) Area - IO (2) PR blocked: Waiting on author (1) Needs info❔ (1) breaking change ‼️ (1) Area – Differential Expression (1) Area - Datasets (1) Area – Reproducibility (1)

Packages

  • Total packages: 2
  • Total downloads:
    • pypi 671,457 last-month
  • Total docker downloads: 20,246
  • Total dependent packages: 422
    (may contain duplicates)
  • Total dependent repositories: 526
    (may contain duplicates)
  • Total versions: 93
  • Total maintainers: 3
pypi.org: scanpy

Single-Cell Analysis in Python.

  • Versions: 88
  • Dependent Packages: 416
  • Dependent Repositories: 493
  • Downloads: 671,457 Last month
  • Docker Downloads: 20,246
Rankings
Dependent packages count: 0.1%
Dependent repos count: 0.6%
Downloads: 1.1%
Average: 1.2%
Docker downloads count: 1.4%
Stargazers count: 1.7%
Forks count: 2.3%
Maintainers (3)
Last synced: 6 months ago
conda-forge.org: scanpy

Scanpy is a scalable toolkit for analyzing single-cell gene expression data built jointly with anndata. It includes preprocessing, visualization, clustering, trajectory inference and differential expression testing. The Python-based implementation efficiently deals with datasets of more than one million cells.

  • Homepage: https://scverse.org/
  • License: BSD-3-Clause
  • Latest release: 1.9.1
    published almost 4 years ago
  • Versions: 5
  • Dependent Packages: 6
  • Dependent Repositories: 33
Rankings
Dependent repos count: 6.4%
Forks count: 7.5%
Average: 8.4%
Dependent packages count: 9.0%
Stargazers count: 10.7%
Last synced: 6 months ago

Dependencies

.github/workflows/check-pr.yml actions
pyproject.toml pypi
  • anndata >=0.7.4
  • h5py >=3
  • joblib *
  • matplotlib >=3.4
  • natsort *
  • networkx >=2.3
  • numba >=0.41.0
  • numpy >=1.17.0
  • packaging *
  • pandas >=1.0
  • patsy *
  • scikit-learn >=0.24
  • scipy >=1.4
  • seaborn *
  • session-info *
  • statsmodels >=0.10.0rc2
  • tqdm *
  • umap-learn >=0.3.10
.github/workflows/publish.yml actions
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
  • pypa/gh-action-pypi-publish release/v1 composite
.github/workflows/benchmark.yml actions
  • actions/cache v4 composite
  • actions/checkout v4 composite
  • actions/setup-python v5 composite