ck

Collective Knowledge (CK), Collective Mind (CM/CMX) and MLPerf automations: community-driven projects to facilitate collaborative and reproducible research and to learn how to run AI, ML, and other emerging workloads more efficiently and cost-effectively across diverse models, datasets, software, and hardware using MLPerf methodology and benchmarks

https://github.com/mlcommons/ck

Science Score: 77.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 3 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org, acm.org
✓
Committers with academic emails
2 of 32 committers (6.3%) from academic institutions
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (11.9%) to scientific vocabulary

Keywords

automation best-practices ck cknowledge cm cmind cmx collaboration ctuning education metadata mlops mlperf mlperf-automations mlperf-inference modularity optimization portability reusability workflows

Last synced: 6 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: mlcommons
License: apache-2.0
Language: Python
Default Branch: master
Homepage: https://access.cKnowledge.org
Size: 35.1 MB

Statistics

Stars: 629
Watchers: 49
Forks: 118
Open Issues: 9
Releases: 139

Topics

Created over 11 years ago · Last pushed 8 months ago

Metadata Files

Readme Changelog Contributing License Citation Codeowners

README.md

Collective Knowledge project (CK)

Collective Knowledge (CK) is a community-driven project dedicated to supporting open science, enhancing reproducible research, and fostering collaborative learning on how to run AI, ML, and other emerging workloads in the most efficient and cost-effective way across diverse models, data sets, software and hardware: [ white paper ].

It includes the following sub-projects.

Collective Mind project (MLCommons CM)

The Collective Mind automation framework (CM) was developed to support open science and facilitate collaborative, reproducible, and reusable research, development, and experimentation based on FAIR principles.

It helps users non-intrusively convert their software projects into file-based repositories of portable and reusable artifacts (code, data, models, scripts) with extensible metadata and reusable automations, a unified command-line interface, and a simple Python API.

Such artifacts can be easily chained together into portable and technology-agnostic automation workflows, enabling users to rerun, reproduce, and reuse complex experimental setups across diverse and rapidly evolving models, datasets, software, and hardware.

For example, CM helps to modularize, automate and customize MLPerf benchmarks.

Legacy CM API and CLI (2021-2024)

See the project page for more details.

Legacy and simplified CM and MLPerf automations were donated to MLCommons by Grigori Fursin, the cTuning foundation and OctoML. They are now supported by the MLCommons Infra WG (MLCFlow, MLC scripts, mlcr ...).

New CM API and CLI (CMX, 2025+)

Collective Mind eXtension or Common Metadata eXchange (CMX) is the next evolution of the Collective Mind automation framework (MLCommons CM) designed to enhance simplicity, flexibility, and extensibility of automations based on user feedback. It is backwards compatible with CM, released along with CM in the cmind package and can serve as drop-in replacement for CM and legacy MLPerf automations while providing a simpler and more robust interface.

See the project page and CMX4MLOps automations for more details.

MLOps and MLPerf automations

We have developed a collection of portable, extensible and technology-agnostic automation recipes with a common CLI and Python API (CM scripts) to unify and automate all the manual steps required to compose, run, benchmark and optimize complex ML/AI applications on diverse platforms with any software and hardware.

The two key automations are script and cache: see online catalog at CK playground, online MLCommons catalog.

CM scripts extend the concept of cmake with simple Python automations, native scripts and JSON/YAML meta descriptions. They require Python 3.8+ with minimal dependencies and are continuously extended by the community and MLCommons members to run natively on Ubuntu, MacOS, Windows, RHEL, Debian, Amazon Linux and any other operating system, in a cloud or inside automatically generated containers while keeping backward compatibility.

See the online MLPerf documentation at MLCommons to run MLPerf inference benchmarks across diverse systems using CMX. Just install pip install cmx4mlperf and substitute the following commands and flags: * cm -> cmx * mlc -> cmlc * mlcr -> cmlcr * -v -> --v

Collective Knowledge Playground

Collective Knowledge Playground - a unified and open-source platform designed to index all CM/CMX automations similar to PYPI and assist users in preparing CM/CMX commands to:

aggregate, process, visualize, and compare MLPerf benchmarking results for AI and ML systems
run MLPerf benchmarks
organize open and reproducible optimization challenges and tournaments.

Artifact Evaluation and Reproducibility Initiatives

Artifact Evaluation automation - a community-driven initiative leveraging CK, CM and CMX to automate artifact evaluation and support reproducibility efforts at ML and systems conferences.

Legacy projects

License

Apache 2.0

Copyright

Grigori Fursin, the cTuning foundation and OctoML donated this project to MLCommons to benefit everyone.

Author

Grigori Fursin

Maintainers

Legacy CM, CM4MLOps and MLPerf automations: MLCommons infra WG
CMX (the next generation of CM since 2025): Grigori Fursin

Concepts

To learn more about the motivation behind this project, please explore the following articles and presentations:

HPCA'25 article "MLPerf Power: Benchmarking the Energy Efficiency of Machine Learning Systems from Microwatts to Megawatts for Sustainable AI": [ Arxiv ], [ tutorial to reproduce results using CM/CMX ]
NeuralMagic's vLLM MLPerf inference 4.1 submission automated by CM: [README]
SDXL MLPerf inference 4.1 submission automated by CM: [README]
"Enabling more efficient and cost-effective AI/ML systems with Collective Mind, virtualized MLOps, MLPerf, Collective Knowledge Playground and reproducible optimization tournaments": [ ArXiv ]
ACM REP'23 keynote about the MLCommons CM automation framework: [ slides ]
ACM TechTalk'21 about Collective Knowledge project: [ YouTube ] [ slides ]
Journal of Royal Society'20: [ paper ]

Acknowledgments

This open-source project was created by Grigori Fursin and sponsored by cTuning.org, OctoAI and HiPEAC. Grigori donated this project to MLCommons to modularize and automate MLPerf benchmarks, benefit the community, and foster its development as a collaborative, community-driven effort.

We thank MLCommons, FlexAI and cTuning for supporting this project, as well as our dedicated volunteers and collaborators for their feedback and contributions!

If you found the CM, CMX and MLPerf automations helpful, kindly reference this article: [ ArXiv ], [ BibTex ].

You are welcome to contact the author to discuss long-term plans and potential collaboration.

Owner

Name: MLCommons
Login: mlcommons
Kind: organization

Repositories: 79
Profile: https://github.com/mlcommons

Citation (citation.bib)

@misc{fursin2024enabling,
      title={Enabling more efficient and cost-effective AI/ML systems with Collective Mind, virtualized MLOps, MLPerf, Collective Knowledge Playground and reproducible optimization tournaments}, 
      author={Grigori Fursin},
      year={2024},
      eprint={2406.16791},
      archivePrefix={arXiv},
      primaryClass={id='cs.LG' full_name='Machine Learning' is_active=True alt_name=None in_archive='cs' is_general=False description='Papers on all aspects of machine learning research (supervised, unsupervised, reinforcement learning, bandit problems, and so on) including also robustness, explanation, fairness, and methodology. cs.LG is also an appropriate primary category for applications of machine learning methods.'}
}

@article{doi:10.1098/rsta.2020.0211,
    author = {Fursin, Grigori},
    title = {Collective knowledge: organizing research projects as a database of reusable components and portable workflows with common interfaces},
    journal = {Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences},
    volume = {379},
    number = {2197},
    pages = {20200211},
    year = {2021},
    doi = {10.1098/rsta.2020.0211},
    URL = {https://royalsocietypublishing.org/doi/abs/10.1098/rsta.2020.0211},
    eprint = {https://royalsocietypublishing.org/doi/pdf/10.1098/rsta.2020.0211},
    howpublished = {https://doi.org/10.1098/rsta.2020.0211}
}

@misc{acm_rep_23_cm_keynote,
    author = {Fursin, Grigori},
    year = "2023",
    booktitle = "{Keynote at the 1st ACM conference on reproducibility and replicability (ACM REP'23)}",
    title = "Collective Mind: toward a common language to facilitate reproducible research and technology transfer",
    howpublished = {https://doi.org/10.5281/zenodo.8105339}
}

GitHub Events

Total

Create event: 26
Issues event: 44
Release event: 16
Watch event: 32
Delete event: 7
Issue comment event: 104
Push event: 101
Pull request review event: 51
Pull request event: 128
Fork event: 8

Last Year

Create event: 26
Issues event: 44
Release event: 16
Watch event: 32
Delete event: 7
Issue comment event: 104
Push event: 101
Pull request review event: 51
Pull request event: 128
Fork event: 8

Committers

Last synced: almost 3 years ago

All Time

Total Commits: 3,737
Total Committers: 32
Avg Commits per committer: 116.781
Development Distribution Score (DDS): 0.581

Top Committers

Name	Email	Commits
Grigori Fursin	G**n@c**g	1,567
Arjun Suresh	a**7@g**m	1,365
Grigori Fursin	g**n@c**g	501
Grigori Fursin	g**n@g**m	103
Grigori Fursin	g**n@u**m	68
dsavenko	d**o@x**m	40
Grigori Fursin	G**n@C**i	15
anandhu-eng	a**s@g**m	13
Anton Lokhmotov	a**n@d**m	12
Thomas Zhu	t**h@h**m	8
Grigori Fursin	c**x@c**t	5
Grigori Fursin	G**n@c**i	5
Leo Gordon	e**4@u**m	4
Anton Lokhmotov	a**v@g**m	4
Ailurus1	i**5@g**m	4
Grigori Fursin	g**n@c**i	3
Ailurus1	k**2@y**u	2
Grigori Fursin	g**i@o**i	2
Grigori Fursin	f**e@h**m	2
Himanshu Dutta	m**a@g**m	2
Alexey Kravets	a**s@a**m	1
Bruno Ferreira	b**f@g**m	1
Dave Greasley	D**y@u**m	1
Ailurus1	k**2@y**u	1
alered01	4**1@u**m	1
xintin	g**a@c**u	1
Ubuntu	u**u@i**l	1
Lahiru Rasnayake	l**e@n**o	1
Grigori Fursin	f**n@c**e	1
Ilya Kozulin	7**1@u**m	1
and 2 more...

Committer Domains (Top 20 + Academic)

codereef.ai: 3 ctuning.org: 3 ck.home: 1 ntnu.no: 1 ip-172-31-57-60.us-west-2.compute.internal: 1 cs.stonybrook.edu: 1 yandex.ru: 1 arm.com: 1 host49.hostmonster.com: 1 octoml.ai: 1 ya.ru: 1 cindex.3t4iphb3vewuhgtfvuyrpfazke.bx.internal.cloudapp.net: 1 dividiti.com: 1 xored.com: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 194
Total pull requests: 552
Average time to close issues: 4 months
Average time to close pull requests: about 10 hours
Total issue authors: 40
Total pull request authors: 20
Average comments per issue: 2.79
Average comments per pull request: 1.17
Merged pull requests: 505
Bot issues: 0
Bot pull requests: 23

Past Year

Issues: 34
Pull requests: 126
Average time to close issues: 7 days
Average time to close pull requests: about 5 hours
Issue authors: 3
Pull request authors: 7
Average comments per issue: 1.03
Average comments per pull request: 1.17
Merged pull requests: 110
Bot issues: 0
Bot pull requests: 5

View more stats

Top Authors

Issue Authors

gfursin (105)
arjunsuresh (28)
anandhu-eng (6)
KingICCrab (5)
WarrenSchultz (4)
willamloo3192 (4)
jdesfossez (2)
sahilavaran (2)
JoachimMoe (2)
dmgusev (1)
Sachin-23 (1)
Seemapatil1998 (1)
Agalakdak (1)
keithachorn-intel (1)
Adityashaw (1)

Pull Request Authors

gfursin (344)
arjunsuresh (279)
dependabot[bot] (32)
ctuning-admin (26)
anandhu-eng (8)
Ailurus1 (7)
jdesfossez (3)
pnfox (2)
Chaitanya110703 (2)
Dnaynu (2)
himanshu-dutta (2)
nathanw-mlc (1)
Maximallnyi (1)
SennikovAndrey (1)
morphine00 (1)

Top Labels

Issue Labels

enhancement (49) cmx-core (24) cm-script-automation (16) bug (14) cm-mlperf (13) cm-core (12) cm-documentation (7) help wanted (6) cm-tests (4) cm-maintenance (3) requesting-community-help (2) waiting for user input (2) cm-cache-automation (2) cm-mlperf-reproducibility-matrix (2) question (2) mlperf-automation (1) reproducibility (1) cm-docker (1) nice-to-have (1) not-cm-issue (1) cm-mlperf-power (1) mil (1) reproducibility-initiatives (1)

Pull Request Labels

dependencies (32) python (4)

Packages

Total packages: 8
Total downloads:
- pypi 7,422 last-month
Total docker downloads: 6,069

Total dependent packages: 0
(may contain duplicates)
Total dependent repositories: 27
(may contain duplicates)
Total versions: 217
Total maintainers: 1

pypi.org: ck

Collective Knowledge - a lightweight knowledge manager to organize, cross-link, share and reuse artifacts and workflows based on FAIR principles

Homepage: https://github.com/mlcommons/ck
Documentation: https://ck.readthedocs.io/
License: Apache 2.0
Latest release: 2.6.4
published over 1 year ago

Versions: 97
Dependent Packages: 0
Dependent Repositories: 23
Downloads: 692 Last month
Docker Downloads: 6,069

Rankings

Docker downloads count: 1.8%

Stargazers count: 2.7%

Dependent repos count: 3.0%

Downloads: 4.3%

Average: 4.4%

Forks count: 4.6%

Dependent packages count: 10.1%

Maintainers (1)

gfursin

Last synced: 6 months ago

pypi.org: cmind

Common Metadata eXchange framework (CMX) and Collective Mind automation framework (CM)

Homepage: https://cKnowledge.org
Documentation: https://docs.mlcommons.org/ck
License: Apache 2.0
Latest release: 4.2.0
published 10 months ago

Versions: 111
Dependent Packages: 0
Dependent Repositories: 4
Downloads: 6,369 Last month

Rankings

Stargazers count: 2.7%

Downloads: 4.4%

Forks count: 4.6%

Average: 5.8%

Dependent repos count: 7.5%

Dependent packages count: 10.1%

Maintainers (1)

gfursin

Last synced: 6 months ago

pypi.org: cmind4mlperf

TBD

Homepage: https://github.com/mlcommons/ck
Documentation: https://cmind4mlperf.readthedocs.io/
License: Apache Software License (Apache 2.0)
Latest release: 0.0.0.2
published over 1 year ago

Versions: 2
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 100 Last month

Rankings

Stargazers count: 3.5%

Forks count: 5.7%

Dependent packages count: 10.8%

Average: 20.2%

Dependent repos count: 60.7%

Maintainers (1)

gfursin

Last synced: about 1 year ago

pypi.org: cmx4mlperf

CMX4MLPerf repository with legacy MLPerf automations

Homepage: https://github.com/mlcommons/ck
Documentation: https://docs.mlcommons.org/ck
License: Apache 2.0
Latest release: 1.0.0
published about 1 year ago

Versions: 1
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 45 Last month

Rankings

Dependent packages count: 9.6%

Average: 31.9%

Dependent repos count: 54.1%

Maintainers (1)

gfursin

Last synced: 6 months ago

pypi.org: cmx4mlops

CMX4MLOps repository

Homepage: https://github.com/mlcommons/ck
Documentation: https://cmx4mlops.readthedocs.io/
License: Apache 2.0
Latest release: 1.0.1
published about 1 year ago

Versions: 3
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 31 Last month

Rankings

Dependent packages count: 9.8%

Average: 32.4%

Dependent repos count: 54.9%

Maintainers (1)

gfursin

Last synced: 6 months ago

pypi.org: cmind4system

TBD

Homepage: https://github.com/mlcommons/ck
Documentation: https://cmind4system.readthedocs.io/
License: Apache Software License (Apache 2.0)
Latest release: 0.0.0.1
published over 1 year ago

Versions: 1
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 64 Last month

Rankings

Dependent packages count: 10.8%

Average: 35.8%

Dependent repos count: 60.7%

Maintainers (1)

gfursin

Last synced: about 1 year ago

pypi.org: cm4system

TBD

Homepage: https://github.com/mlcommons/ck
Documentation: https://cm4system.readthedocs.io/
License: Apache Software License (Apache 2.0)
Latest release: 0.0.0.1
published over 1 year ago

Versions: 1
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 62 Last month

Rankings

Dependent packages count: 10.8%

Average: 35.8%

Dependent repos count: 60.7%

Maintainers (1)

gfursin

Last synced: about 1 year ago

pypi.org: cm4research

TBD

Homepage: https://github.com/mlcommons/ck
Documentation: https://cm4research.readthedocs.io/
License: Apache Software License (Apache 2.0)
Status: removed
Latest release: 0.0.0.1
published over 1 year ago

Versions: 1
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 59 Last month

Rankings

Dependent packages count: 10.8%

Average: 35.8%

Dependent repos count: 60.7%

Maintainers (1)

gfursin

Last synced: 6 months ago

Dependencies

.github/workflows/cla.yml actions

mlcommons/cla-bot master composite

.github/workflows/test-cm-script-features.yml actions

actions/checkout v3 composite
actions/setup-python v3 composite

.github/workflows/test-cm-scripts.yml actions

actions/checkout v3 composite
actions/setup-python v3 composite

.github/workflows/test-cm.yml actions

actions/checkout v3 composite
actions/setup-python v3 composite

.github/workflows/test-mlperf-inference-bert.yml actions

actions/checkout v3 composite
actions/setup-python v3 composite

.github/workflows/test-mlperf-inference-resnet50.yml actions

actions/checkout v3 composite
actions/setup-python v3 composite

.github/workflows/test-mlperf-inference-retinanet.yml actions

actions/checkout v3 composite
actions/setup-python v3 composite

.github/workflows/test-mlperf-inference-tvm.yml actions

actions/checkout v3 composite
actions/setup-python v3 composite

ck/incubator/cbench/setup.py pypi

ck *
click >=7.0
requests *
virtualenv *

ck/incubator/cdatabase/setup.py pypi

ck *
pyyaml *

ck/requirements.txt pypi

pyyaml *
requests *

cm/requirements.txt pypi

pyyaml *
requests *

cm/setup.py pypi

pyyaml *
requests *

cm-mlops/requirements.txt pypi

cmind >=1.0.3
requests *

cm-mlops/script/app-image-classification-onnx-py/requirements.txt pypi

Pillow *
numpy *
opencv-python *

cm-mlops/script/app-image-classification-torch-py/requirements.txt pypi

Pillow *
numpy *
requests *

cm-mlops/script/app-image-classification-tvm-onnx-py/requirements.txt pypi

attrs *
decorator *
matplotlib *
onnx *
opencv-python *
psutil *
scipy *

cm-mlops/script/get-sys-utils-cm/requirements.txt pypi

giturlparse *
numpy *
pandas *
requests *
wheel *

requirements.txt pypi

pyyaml *