dask-ml

Scalable Machine Learning with Dask

https://github.com/dask/dask-ml

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    6 of 90 committers (6.7%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.8%) to scientific vocabulary

Keywords

hacktoberfest

Keywords from Contributors

alignment flexible closember distributed gbm parallel-computing gbrt gbdt data-mining parallel
Last synced: 10 months ago · JSON representation

Repository

Scalable Machine Learning with Dask

Basic Info
  • Host: GitHub
  • Owner: dask
  • License: bsd-3-clause
  • Language: Python
  • Default Branch: main
  • Homepage: http://ml.dask.org
  • Size: 98.1 MB
Statistics
  • Stars: 942
  • Watchers: 38
  • Forks: 260
  • Open Issues: 284
  • Releases: 13
Topics
hacktoberfest
Created about 9 years ago · Last pushed about 1 year ago
Metadata Files
Readme Contributing License

README.rst

Dask-ML
=======

|Build Status| |Azure Pipelines| |Coverage| |Doc Status| |Discourse| |Version Status| |NumFOCUS|

Dask-ML provides scalable machine learning in Python using `Dask `__ alongside popular machine learning libraries like `Scikit-Learn `__, `XGBoost `__, and others.

You can try Dask-ML on a small cloud instance by clicking the following button:

.. image:: https://mybinder.org/badge.svg
   :target: https://mybinder.org/v2/gh/dask/dask-examples/main?filepath=machine-learning.ipynb

LICENSE
-------

New BSD. See `License File `__.

.. _documentation: https://dask.org
.. |Build Status| image:: https://github.com/dask/dask-ml/workflows/CI/badge.svg?branch=main
   :target: https://github.com/dask/dask-ml/actions?query=workflow%3A%22CI%22
.. |Azure Pipelines| image:: https://dev.azure.com/dask-dev/dask/_apis/build/status/dask.dask-ml?branchName=main
   :target: https://dev.azure.com/dask-dev/dask/_build/latest?definitionId=1&branchName=main
.. |Coverage| image:: https://codecov.io/gh/dask/dask-ml/branch/main/graph/badge.svg
   :target: https://codecov.io/gh/dask/dask-ml/branch/main
   :alt: Coverage status
.. |Doc Status| image:: https://readthedocs.org/projects/ml/badge/?version=latest
   :target: https://ml.dask.org/
   :alt: Documentation Status
.. |Discourse| image:: https://img.shields.io/discourse/users?logo=discourse&server=https%3A%2F%2Fdask.discourse.group
   :alt: Discuss Dask-related things and ask for help
   :target: https://dask.discourse.group
.. |Version Status| image:: https://img.shields.io/pypi/v/dask-ml.svg
   :target: https://pypi.python.org/pypi/dask-ml/
.. |NumFOCUS| image:: https://img.shields.io/badge/powered%20by-NumFOCUS-orange.svg?style=flat&colorA=E1523D&colorB=007D8A
   :target: https://www.numfocus.org/

Owner

  • Name: dask
  • Login: dask
  • Kind: organization

GitHub Events

Total
  • Create event: 2
  • Release event: 1
  • Issues event: 19
  • Watch event: 45
  • Delete event: 1
  • Issue comment event: 46
  • Push event: 10
  • Pull request review comment event: 13
  • Pull request review event: 14
  • Pull request event: 12
  • Fork event: 7
Last Year
  • Create event: 2
  • Release event: 1
  • Issues event: 19
  • Watch event: 45
  • Delete event: 1
  • Issue comment event: 46
  • Push event: 10
  • Pull request review comment event: 13
  • Pull request review event: 14
  • Pull request event: 12
  • Fork event: 7

Committers

Last synced: 10 months ago

All Time
  • Total Commits: 744
  • Total Committers: 90
  • Avg Commits per committer: 8.267
  • Development Distribution Score (DDS): 0.738
Past Year
  • Commits: 5
  • Committers: 3
  • Avg Commits per committer: 1.667
  • Development Distribution Score (DDS): 0.6
Top Committers
Name Email Commits
Tom Augspurger t****r@g****m 195
Tom Augspurger T****r@u****m 163
Jim Crist c****2@u****u 84
Matthew Rocklin m****n@g****m 32
Scott Sievert s****t@u****m 31
James Bourbeau j****u@u****m 29
James Bourbeau j****u@g****m 27
Scott Sievert d****v@s****m 22
Scott Sievert g****b@s****m 15
Thomas Fan t****n@g****m 9
James Lamb j****0@g****m 7
Jim Crist j****t@u****m 7
Abdulelah Bin Mahfoodh a****m@g****m 5
Jim Crist j****t@g****m 5
Mike McCarty m****y@n****m 5
Ray Bell r****0@g****m 5
Tom Augspurger t****8@g****m 4
Hristo h****g@u****m 4
Eric Czech e****h@g****m 3
Jacob Tomlinson j****n@u****m 3
Jan Koch J****h@t****e 3
Ryan Deak r****k@z****m 3
severo d****o@g****m 3
Robert Sare r****e@g****m 2
Chiara Marmo c****o@u****m 2
Guillaume Lemaitre g****8@g****m 2
J42994 j****h@e****m 2
Joris Van den Bossche j****e@g****m 2
Julia Signell j****l@g****m 2
Julien Jerphanion g****t@j****z 2
and 60 more...

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 107
  • Total pull requests: 65
  • Average time to close issues: 9 months
  • Average time to close pull requests: 3 months
  • Total issue authors: 84
  • Total pull request authors: 28
  • Average comments per issue: 3.75
  • Average comments per pull request: 1.77
  • Merged pull requests: 35
  • Bot issues: 2
  • Bot pull requests: 0
Past Year
  • Issues: 13
  • Pull requests: 13
  • Average time to close issues: 22 days
  • Average time to close pull requests: about 1 month
  • Issue authors: 10
  • Pull request authors: 5
  • Average comments per issue: 1.23
  • Average comments per pull request: 3.08
  • Merged pull requests: 7
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • TomAugspurger (7)
  • SultanOrazbayev (3)
  • phobson (3)
  • GaetanLepage (3)
  • mobigroup (3)
  • sh2022515 (2)
  • jrbourbeau (2)
  • flying-sheep (2)
  • mmccarty (2)
  • brian-methodical (2)
  • Zethson (2)
  • trivialfis (2)
  • github-actions[bot] (2)
  • ivirshup (2)
  • ayushdg (1)
Pull Request Authors
  • TomAugspurger (14)
  • mmccarty (7)
  • jrbourbeau (7)
  • cmarmo (3)
  • VibhuJawa (3)
  • npk7 (2)
  • gforsyth (2)
  • fujiisoup (2)
  • GaetanLepage (2)
  • wietzesuijker (2)
  • pr38 (2)
  • scharlottej13 (2)
  • jacobtomlinson (2)
  • pavithraes (1)
  • ayushdg (1)
Top Labels
Issue Labels
good first issue (3) Algorithm (2) needs triage (2) Needs Info (2) upstream (2) metric (1) dataframe (1) Documentation (1) bug (1) Roadmap (1)
Pull Request Labels
upstream (1)

Packages

  • Total packages: 4
  • Total downloads:
    • pypi 116,553 last-month
  • Total docker downloads: 1,185,626
  • Total dependent packages: 42
    (may contain duplicates)
  • Total dependent repositories: 346
    (may contain duplicates)
  • Total versions: 131
  • Total maintainers: 2
pypi.org: dask-ml

A library for distributed and parallel machine learning

  • Homepage: https://github.com/dask/dask-ml
  • Documentation: https://dask-ml.readthedocs.io/
  • License: Copyright (c) 2017, Anaconda, Inc. and contributors All rights reserved. Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met: Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution. Neither the name of Anaconda nor the names of any contributors may be used to endorse or promote products derived from this software without specific prior written permission. THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
  • Latest release: 2025.1.0
    published over 1 year ago
  • Versions: 38
  • Dependent Packages: 35
  • Dependent Repositories: 190
  • Downloads: 116,553 Last month
  • Docker Downloads: 1,185,626
Rankings
Dependent packages count: 0.4%
Docker downloads count: 0.8%
Downloads: 0.8%
Dependent repos count: 1.1%
Average: 1.4%
Stargazers count: 2.1%
Forks count: 3.4%
Maintainers (2)
Last synced: 10 months ago
proxy.golang.org: github.com/dask/dask-ml
  • Versions: 43
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 5.5%
Average: 5.7%
Dependent repos count: 5.9%
Last synced: 10 months ago
conda-forge.org: dask-ml

Distributed and parallel machine learning using dask.

  • Versions: 27
  • Dependent Packages: 6
  • Dependent Repositories: 78
Rankings
Dependent repos count: 4.0%
Dependent packages count: 9.0%
Average: 9.6%
Forks count: 11.4%
Stargazers count: 13.8%
Last synced: 11 months ago
anaconda.org: dask-ml

Dask-ML provides scalable machine learning in Python using Dask alongside popular machine learning libraries like Scikit-Learn, XGBoost, and others.

  • Versions: 23
  • Dependent Packages: 1
  • Dependent Repositories: 78
Rankings
Dependent repos count: 20.7%
Forks count: 21.3%
Average: 24.5%
Stargazers count: 25.3%
Dependent packages count: 30.6%
Last synced: 10 months ago

Dependencies

setup.py pypi
  • dask *
.github/workflows/docs.yaml actions
  • JamesIves/github-pages-deploy-action 3.7.1 composite
  • actions/checkout v2 composite
  • conda-incubator/setup-miniconda v2 composite
.github/workflows/lint.yaml actions
  • actions/checkout v3 composite
  • actions/setup-python v3 composite
  • pre-commit/action v3.0.0 composite
.github/workflows/release.yaml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
.github/workflows/tests.yaml actions
  • actions/checkout v2 composite
  • conda-incubator/setup-miniconda v2 composite
.github/workflows/upstream.yml actions
  • actions/checkout v2 composite
  • actions/github-script v3 composite
  • conda-incubator/setup-miniconda v2 composite
  • xarray-contrib/ci-trigger v1 composite