Updated 6 months ago
kernel_float
CUDA/HIP header-only library for low-precision (16 bit, 8 bit) and vectorized GPU kernel development
Updated 6 months ago
kernel_launcher
Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner