vector-max-cuda
Performance of sequential vs CUDA-based vector element max.
Science Score: 41.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.8%) to scientific vocabulary
Keywords
Repository
Performance of sequential vs CUDA-based vector element max.
Basic Info
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 1
Topics
Metadata Files
README.md
Comparing performance of sequential vs CUDA-based vector element max.
For each experiment given below, we attempt each approach on a number of vector sizes, running each approach 5 times per size to get a good time measure. Note that time taken to copy data back and forth from the GPU is not measured, and the sequential approach does not make use of SIMD instructions. The experiments are done with guidance from Prof. Kishore Kothapalli and Prof. Dip Sankar Banerjee.
Comparison with Sequential approach
This experiment (compare-sequential, main) compares the performance
between finding max(x) using a single thread (sequential) and using
CUDA (not power-of-2 and power-of-2 reduce). Here x is a 32-bit
integer vector. While it might seem that CUDA approach would be a clear
winner, the results indicate it is dependent upon the workload. Results indicate
that from 10^5 elements, CUDA approach performs better than sequential.
Both CUDA approaches (not power-of-2/power-of-2 reduce) seem to have
similar performance. All outputs are saved in a gist. Some charts are also
included below, generated from sheets.
References
- CUDA by Example :: Jason Sanders, Edward Kandrot
- Managed memory vs cudaHostAlloc - TK1
- How to enable C++17 code generation in VS2019 CUDA project
- "More than one operator + matches these operands" error
- How to import VSCode keybindings into Visual Studio?
- Explicit conversion constructors (C++ only)
- Configure X11 Forwarding with PuTTY and Xming
- code-server setup and configuration
- Installing snap on CentOS
Owner
- Name: puzzlef
- Login: puzzlef
- Kind: organization
- Website: https://puzzlef.github.io/
- Repositories: 10
- Profile: https://github.com/puzzlef
A summary of experiments.
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: Sahu
given-names: Subhajit
orcid: https://orcid.org/0000-0001-5140-6578
title: "puzzlef/max-sequential-vs-cuda: Performance of sequential vs CUDA-based vector element max "
version: 1.0.0
doi: 10.5281/zenodo.7256199
date-released: 2022-10-27
GitHub Events
Total
- Push event: 1
Last Year
- Push event: 1
Issues and Pull Requests
Last synced: 11 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0

