Caffeine
Caffeine: A parallel runtime library for supporting modern Fortran compilers - Published in JOSS (2025)
parallel-hashmap
A family of header-only, very fast and memory-friendly hashmap and btree containers.
librapid
A highly optimised C++ library for mathematical applications and neural networks.
Synch
Synch: A framework for concurrent data-structures and benchmarks - Published in JOSS (2021)
https://github.com/madsjulia/robustpmap.jl
Robust pmap calls for efficient parallelization and high-performance computing
cudamatrixtranspose
Optimizing matrix transposition on GPU with CUDA (University of Trento, Italy)
https://github.com/adamouization/relaxation-technique-parallel-computing
:repeat: Relaxation technique using POSIX threads (shared memory configuration) and MPI (distributed memory configuration).
https://github.com/alexkranias/triton_vs_cuda
Building Triton and CUDA kernels side-by-side to create a cuBLAS-performant GEMM kernel.
https://github.com/beehive-lab/tornadovm
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
pyqkd
Repository with codes for simulating and optimising quantum key distribution protocols.
parallel-etudes
Parallel programming assignments using OpenMP, MPI and CUDA/OpenCL
https://github.com/aabs/actorsrcgen
ActorSrcGen is a C# Source Generator allowing the conversion of simple C# classes into dataflow compatible pipelines supporting the actor model.
https://github.com/bluebrain/bluepyparallel
Provides an embarrassingly parallel tool with sql backend
https://github.com/charmplusplus/charm4py
Parallel Programming with Python and Charm++