Triumvirate
Triumvirate: A Python/C++ package for three-point clustering measurements - Published in JOSS (2023)
celeritas
Celeritas is a new Monte Carlo transport code designed to accelerate scientific discovery in high energy physics by improving detector simulation throughput and energy efficiency using GPUs.
cudawrappers
C++ wrapper for the Nvidia/HIP C libraries (e.g. CUDA driver, nvrtc, hiprtc, cuFFT, hipFFT, etc.)
kernel_float
CUDA/HIP header-only library for low-precision (16 bit, 8 bit) and vectorized GPU kernel development
software-engineer
A curated learning repository focused on High-Performance Computing (HPC) — covering fundamentals to advanced topics in CUDA, MPI, C++, and Python-C++ interoperability.
Ginkgo
Ginkgo: A high performance numerical linear algebra library - Published in JOSS (2020)
simulateqcd
SIMULATeQCD is a multi-GPU Lattice QCD framework that makes it easy for physicists to implement lattice QCD formulas while still providing competitive performance.
kmm
KMM: parallel dataflow scheduler and efficient memory management for multi-GPU platforms
PAAT
PAAT: Physical Activity Analysis Toolbox for the analysis of hip-worn raw accelerometer data in Python - Published in JOSS (2025)