vector-max-cuda

Performance of sequential vs CUDA-based vector element max.

https://github.com/puzzlef/vector-max-cuda

Science Score: 41.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
○
DOI references
✓
Academic publication links
Links to: zenodo.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (8.8%) to scientific vocabulary

Keywords

basics cuda element experiment max vector

Last synced: 6 months ago · JSON representation ·

Repository

Performance of sequential vs CUDA-based vector element max.

Basic Info

Host: GitHub
Owner: puzzlef
License: mit
Language: C++
Default Branch: main
Homepage:
Size: 57.6 KB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 1

Topics

basics cuda element experiment max vector

Created over 3 years ago · Last pushed about 3 years ago

Metadata Files

Readme License Citation

README.md

Comparing performance of sequential vs CUDA-based vector element max.

For each experiment given below, we attempt each approach on a number of vector sizes, running each approach 5 times per size to get a good time measure. Note that time taken to copy data back and forth from the GPU is not measured, and the sequential approach does not make use of SIMD instructions. The experiments are done with guidance from Prof. Kishore Kothapalli and Prof. Dip Sankar Banerjee.

Comparison with Sequential approach

This experiment (compare-sequential, main) compares the performance between finding max(x) using a single thread (sequential) and using CUDA (not power-of-2 and power-of-2 reduce). Here x is a 32-bit integer vector. While it might seem that CUDA approach would be a clear winner, the results indicate it is dependent upon the workload. Results indicate that from 10^5 elements, CUDA approach performs better than sequential. Both CUDA approaches (not power-of-2/power-of-2 reduce) seem to have similar performance. All outputs are saved in a gist. Some charts are also included below, generated from sheets.

References

Owner

Name: puzzlef
Login: puzzlef
Kind: organization

Website: https://puzzlef.github.io/
Repositories: 10
Profile: https://github.com/puzzlef

A summary of experiments.

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Sahu
    given-names: Subhajit
    orcid: https://orcid.org/0000-0001-5140-6578
title: "puzzlef/max-sequential-vs-cuda: Performance of sequential vs CUDA-based vector element max "
version: 1.0.0
doi: 10.5281/zenodo.7256199
date-released: 2022-10-27

GitHub Events

Total

Push event: 1

Last Year

Push event: 1

Issues and Pull Requests

Last synced: 11 months ago

All Time

Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science