vector-multiplication-cuda
Comparing approaches for CUDA-based vector multiplication.
Science Score: 41.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (5.3%) to scientific vocabulary
Keywords
Repository
Comparing approaches for CUDA-based vector multiplication.
Basic Info
- Host: GitHub
- Owner: puzzlef
- License: mit
- Language: C++
- Default Branch: main
- Homepage: https://gist.github.com/wolfram77/4ef16ab9699ac03a617b8731dd240e1f
- Size: 218 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 1
Topics
Metadata Files
README.md
Comparing approaches for CUDA-based vector multiplication.
In each of the experiments given below, we multiply two floating-point vectors
x and y, with number of elements from 10^6 to 10^9 using OpenMP.
Each element count is attempted with various approaches, running each approach 5
times to get a good time measure. Multiplication here represents any
memory-aligned independent operation, or a map() operation.
Adjusting Launch config
In this experiment (adjust-launch), we multiply two floating-point vectors x
and y using CUDA. Each element count is attempted with various CUDA launch
configs. Results indicate that a grid_limit of 16384/32768, and a
block_size of 128/256 to be suitable for both float and double.
Using a grid_limit of MAX and a block_size of 256 could be a decent
choice.
Adjusting Thread duty
In this experiment (adjust-duty), we compare various per-thread duty numbers for CUDA-based vector multiplication. Each element count is attempted with various CUDA launch configs and per-thread duties. Results indicate no significant difference between adjust-launch approach, and this one.
References
Owner
- Name: puzzlef
- Login: puzzlef
- Kind: organization
- Website: https://puzzlef.github.io/
- Repositories: 10
- Profile: https://github.com/puzzlef
A summary of experiments.
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: Sahu
given-names: Subhajit
orcid: https://orcid.org/0000-0001-5140-6578
title: "puzzlef/multiply-cuda-adjust-launch: Comparing various launch configs for CUDA based vector multiply"
version: 1.0.0
doi: 10.5281/zenodo.5599206
date-released: 2021-10-26
GitHub Events
Total
- Push event: 1
Last Year
- Push event: 1
Issues and Pull Requests
Last synced: 11 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
