dask

Parallel computing with task scheduling

https://github.com/dask/dask

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    27 of 612 committers (4.4%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.5%) to scientific vocabulary

Keywords

dask numpy pandas pydata python scikit-learn scipy

Keywords from Contributors

flexible alignment closember distributed notebook cython qt ipython wx tk
Last synced: 6 months ago · JSON representation

Repository

Parallel computing with task scheduling

Basic Info
  • Host: GitHub
  • Owner: dask
  • License: bsd-3-clause
  • Language: Python
  • Default Branch: main
  • Homepage: https://dask.org
  • Size: 122 MB
Statistics
  • Stars: 13,441
  • Watchers: 212
  • Forks: 1,794
  • Open Issues: 1,194
  • Releases: 27
Topics
dask numpy pandas pydata python scikit-learn scipy
Created about 11 years ago · Last pushed 6 months ago
Metadata Files
Readme Contributing License

README.rst

Dask
====

|Build Status| |Coverage| |Doc Status| |Discourse| |Version Status| |NumFOCUS|

Dask is a flexible parallel computing library for analytics.  See
documentation_ for more information.


LICENSE
-------

New BSD. See `License File `__.

.. _documentation: https://dask.org
.. |Build Status| image:: https://github.com/dask/dask/actions/workflows/tests.yml/badge.svg
   :target: https://github.com/dask/dask/actions/workflows/tests.yml
.. |Coverage| image:: https://codecov.io/gh/dask/dask/branch/main/graph/badge.svg
   :target: https://codecov.io/gh/dask/dask/branch/main
   :alt: Coverage status
.. |Doc Status| image:: https://readthedocs.org/projects/dask/badge/?version=latest
   :target: https://dask.org
   :alt: Documentation Status
.. |Discourse| image:: https://img.shields.io/discourse/users?logo=discourse&server=https%3A%2F%2Fdask.discourse.group
   :alt: Discuss Dask-related things and ask for help
   :target: https://dask.discourse.group
.. |Version Status| image:: https://img.shields.io/pypi/v/dask.svg
   :target: https://pypi.python.org/pypi/dask/
.. |NumFOCUS| image:: https://img.shields.io/badge/powered%20by-NumFOCUS-orange.svg?style=flat&colorA=E1523D&colorB=007D8A
   :target: https://www.numfocus.org/

Owner

  • Name: dask
  • Login: dask
  • Kind: organization

Committers

Last synced: 8 months ago

All Time
  • Total Commits: 8,806
  • Total Committers: 612
  • Avg Commits per committer: 14.389
  • Development Distribution Score (DDS): 0.718
Past Year
  • Commits: 468
  • Committers: 39
  • Avg Commits per committer: 12.0
  • Development Distribution Score (DDS): 0.609
Top Committers
Name Email Commits
Matthew Rocklin m****n@g****m 2,480
Patrick Hoefler 6****l 905
Jim Crist c****2@u****u 609
James Bourbeau j****u 583
Blake Griffith b****h@g****m 353
Richard (Rick) Zamora r****7@g****m 301
jakirkham j****m@g****m 228
Julia Signell j****l@g****m 203
Florian Jetter f****r 194
Tom Augspurger T****r 172
crusaderky c****y@g****m 149
sinhrks s****s@g****m 112
Martin Durant m****t 108
Hendrik Makait h****k@m****m 88
Phillip Cloud c****d@g****m 74
M. Farrajota f****a 66
Irina Truong i****a@g****m 63
Stephan Hoyer s****r@c****m 63
Charles Blackmon-Luca 2****a 62
Miles m****3@g****m 55
dependabot[bot] 4****] 52
Peter Andreas Entschev p****r@e****m 48
Erik Welch e****h@g****m 44
Genevieve Buckley 3****y 39
Mads R. B. Kristensen m****k@g****m 38
Mariano m****r@g****m 35
Sarah Charlotte Johnson s****h@c****o 34
Jacob Tomlinson j****n 32
Pavithra Eswaramoorthy p****s@o****m 31
Elliott Sales de Andrade q****t@g****m 28
and 582 more...

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 993
  • Total pull requests: 1,605
  • Average time to close issues: 5 months
  • Average time to close pull requests: 29 days
  • Total issue authors: 474
  • Total pull request authors: 153
  • Average comments per issue: 3.46
  • Average comments per pull request: 1.83
  • Merged pull requests: 1,158
  • Bot issues: 28
  • Bot pull requests: 69
Past Year
  • Issues: 272
  • Pull requests: 681
  • Average time to close issues: 8 days
  • Average time to close pull requests: 4 days
  • Issue authors: 147
  • Pull request authors: 58
  • Average comments per issue: 1.17
  • Average comments per pull request: 1.59
  • Merged pull requests: 517
  • Bot issues: 19
  • Bot pull requests: 28
Top Authors
Issue Authors
  • phofl (45)
  • fjetter (39)
  • jrbourbeau (37)
  • crusaderky (31)
  • github-actions[bot] (27)
  • mrocklin (26)
  • rjzamora (15)
  • TomAugspurger (15)
  • frbelotto (13)
  • dcherian (13)
  • gjoseph92 (12)
  • hendrikmakait (11)
  • jakirkham (10)
  • jsignell (9)
  • dbalabka (9)
Pull Request Authors
  • phofl (500)
  • fjetter (232)
  • crusaderky (116)
  • jrbourbeau (104)
  • hendrikmakait (58)
  • rjzamora (57)
  • dependabot[bot] (54)
  • milesgranger (44)
  • TomAugspurger (38)
  • charlesbluca (24)
  • mrocklin (24)
  • j-bennet (17)
  • github-actions[bot] (15)
  • graingert (13)
  • DimitriPapadopoulos (13)
Top Labels
Issue Labels
needs attention (311) needs triage (262) dataframe (189) array (132) bug (59) io (43) documentation (40) upstream (36) needs info (34) tests (28) highlevelgraph (28) dask-expr (26) discussion (25) parquet (24) enhancement (22) deprecation (18) good first issue (15) feature (12) core (10) delayed (9) scheduler (8) convert-string (8) gpu (7) p3 (4) dependencies (3) regression (3) bag (3) config (3) community (2) hygiene (2)
Pull Request Labels
dataframe (173) needs attention (90) upstream (76) array (68) io (53) dependencies (53) documentation (48) bug (19) needs review (9) enhancement (9) parquet (8) dispatch (8) deprecation (7) needs triage (7) tests (6) gpu (5) github_actions (4) feature (4) dask-expr (3) almost done (1) delayed (1) bag (1) regression (1) hygiene (1)

Packages

  • Total packages: 5
  • Total downloads:
    • pypi 17,246,401 last-month
  • Total docker downloads: 456,876,498
  • Total dependent packages: 1,293
    (may contain duplicates)
  • Total dependent repositories: 19,775
    (may contain duplicates)
  • Total versions: 673
  • Total maintainers: 8
  • Total advisories: 2
pypi.org: dask

Parallel PyData with Task Scheduling

  • Versions: 222
  • Dependent Packages: 880
  • Dependent Repositories: 13,853
  • Downloads: 17,246,401 Last month
  • Docker Downloads: 456,876,498
Rankings
Dependent packages count: 0.0%
Dependent repos count: 0.1%
Downloads: 0.1%
Stargazers count: 0.2%
Average: 0.3%
Docker downloads count: 0.3%
Forks count: 1.0%
Last synced: 6 months ago
conda-forge.org: dask

Dask is a flexible parallel computing library for analytics.

  • Homepage: https://dask.org/
  • License: BSD-3-Clause
  • Latest release: 2022.11.0
    published over 3 years ago
  • Versions: 133
  • Dependent Packages: 273
  • Dependent Repositories: 2,283
Rankings
Dependent repos count: 0.2%
Dependent packages count: 0.2%
Average: 1.6%
Stargazers count: 2.7%
Forks count: 3.2%
Last synced: 6 months ago
conda-forge.org: dask-core
  • Versions: 120
  • Dependent Packages: 115
  • Dependent Repositories: 678
Rankings
Dependent packages count: 0.7%
Dependent repos count: 1.0%
Average: 1.9%
Stargazers count: 2.7%
Forks count: 3.2%
Last synced: 6 months ago
anaconda.org: dask

Dask is a flexible parallel computing library for analytics.

  • Homepage: https://www.dask.org
  • License: BSD-3-Clause
  • Latest release: 2025.7.0
    published 7 months ago
  • Versions: 98
  • Dependent Packages: 15
  • Dependent Repositories: 2,283
Rankings
Dependent repos count: 1.1%
Dependent packages count: 2.5%
Average: 4.9%
Stargazers count: 7.3%
Forks count: 8.7%
Last synced: 6 months ago
anaconda.org: dask-core

Dask is a flexible parallel computing library for analytics.

  • Homepage: https://www.dask.org
  • License: BSD-3-Clause
  • Latest release: 2025.7.0
    published 7 months ago
  • Versions: 100
  • Dependent Packages: 10
  • Dependent Repositories: 678
Rankings
Dependent packages count: 4.9%
Dependent repos count: 6.0%
Average: 6.7%
Stargazers count: 7.3%
Forks count: 8.6%
Last synced: 6 months ago

Dependencies

docs/requirements-docs.txt pypi
  • cloudpickle >=1.5.0
  • dask-sphinx-theme >=3.0.0
  • fsspec *
  • ipython *
  • jupyter_sphinx *
  • numpydoc *
  • pandas >=1.4.0
  • pytest *
  • pytest-check-links *
  • requests-cache *
  • scipy *
  • sphinx >=4.0.0
  • sphinx-click *
  • sphinx-copybutton *
  • sphinx-design *
  • sphinx-remove-toctrees *
  • sphinx-tabs *
  • sphinx_autosummary_accessors *
  • toolz *
setup.py pypi
  • cloudpickle *
  • fsspec *
  • packaging *
  • partd *
  • pyyaml *
  • toolz *
.github/workflows/additional.yml actions
  • actions/checkout v3.3.0 composite
  • conda-incubator/setup-miniconda v2.2.0 composite
.github/workflows/conda.yml actions
  • actions/checkout v3.3.0 composite
  • conda-incubator/setup-miniconda v2.2.0 composite
.github/workflows/label-all.yml actions
  • andymckay/labeler 1.0.4 composite
.github/workflows/label-prs.yml actions
  • actions/labeler main composite
.github/workflows/pre-commit.yml actions
  • actions/checkout v3.3.0 composite
  • actions/setup-python v4 composite
  • pre-commit/action v3.0.0 composite
.github/workflows/stale-bot.yaml actions
  • actions/stale v6 composite
.github/workflows/tests.yml actions
  • actions/checkout v3.3.0 composite
  • actions/setup-java v3 composite
  • actions/upload-artifact v3 composite
  • codecov/codecov-action v3 composite
  • conda-incubator/setup-miniconda v2.2.0 composite
.github/workflows/update-gpuci.yml actions
  • actions/checkout v3.3.0 composite
  • jacobtomlinson/gha-anaconda-package-version 0.1.3 composite
  • jacobtomlinson/gha-find-replace v2 composite
  • peter-evans/create-pull-request v4 composite
  • the-coding-turtle/ga-yaml-parser v0.1.2 composite
.github/workflows/upstream.yml actions
  • actions/checkout v3.3.0 composite
  • codecov/codecov-action v3 composite
  • conda-incubator/setup-miniconda v2.2.0 composite
  • xarray-contrib/ci-trigger v1 composite
  • xarray-contrib/issue-from-pytest-log v1.2.5 composite
pyproject.toml pypi
  • click >= 8.0
  • cloudpickle >= 1.5.0
  • fsspec >= 2021.09.0
  • importlib_metadata >= 4.13.0
  • packaging >= 20.0
  • partd >= 1.2.0
  • pyyaml >= 5.3.1
  • toolz >= 0.10.0