Executorlib – Up-scaling Python workflows for hierarchical heterogenous high-performance computing
Executorlib – Up-scaling Python workflows for hierarchical heterogenous high-performance computing - Published in JOSS (2025)
mpi4jax
mpi4jax: Zero-copy MPI communication of JAX arrays - Published in JOSS (2021)
The Python Sky Model 3 software
The Python Sky Model 3 software - Published in JOSS (2021)
DeepHyper
DeepHyper: A Python Package for Massively Parallel Hyperparameter Optimization in Machine Learning - Published in JOSS (2025)
schwimmbad
schwimmbad: A uniform interface to parallel processing pools in Python - Published in JOSS (2017)
t8code - modular adaptive mesh refinement in the exascale era
t8code - modular adaptive mesh refinement in the exascale era - Published in JOSS (2025)
ParaMonte
ParaMonte: A high-performance serial/parallel Monte Carlo simulation library for C, C++, Fortran - Published in JOSS (2021)
GridapDistributed
GridapDistributed: a massively parallel finite element toolbox in Julia - Published in JOSS (2022)
madupite
madupite: A High-Performance Distributed Solver for Large-Scale Markov Decision Processes - Published in JOSS (2025)
MPIFiles.jl
MPIFiles.jl: A Julia Package for Magnetic Particle Imaging Files - Published in JOSS (2019)
neworder
neworder: a dynamic microsimulation framework for Python - Published in JOSS (2021)
Excimontec v1.0
Excimontec v1.0: An Open-Source Software Tool for Kinetic Monte Carlo Simulations of Organic Electronic Devices - Published in JOSS (2020)
sboxgates
sboxgates: A program for finding low gate count implementations of S-boxes - Published in JOSS (2021)
SARAS
SARAS: A general-purpose PDE solver for fluid dynamics - Published in JOSS (2021)
heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
perun
Perun is a Python package that measures the energy consumption of your applications.
pennylane-lightning
The Lightning plugin ecosystem provides fast quantum state-vector and tensor network simulators written in C++ for use with PennyLane.
sqsgenerator
A command line tool written in Python/C++ for finding optimized SQS structures
numba-mpi
Numba @jit compatible wrappers for MPI C API tested on Linux, macOS and Windows
pigeons.jl
Sampling from intractable distributions, with support for distributed and parallel methods
rockyml
⛰️ RockyML - A High-Performance Scientific Computing Framework for Non-smooth Machine Learning Problems
cato
Automatic source transformation to apply HPC frameworks with minimal user interaction
gchp
The "superproject" wrapper repository for GCHP, the high-performance instance of the GEOS-Chem chemical-transport model.
necsim-rust
Spatially explicit biodiversity simulations using a parallel library written in Rust
swift
Modern astrophysics and cosmology particle-based code. Mirror of gitlab developments at https://gitlab.cosma.dur.ac.uk/swift/swiftsim
https://github.com/magneticparticleimaging/mpimeasurements.jl
Julia package for performing MPI measurements
https://github.com/becksteinlab/mdanalysis-with-dask
Benchmarking MDAnalysis with dask.
https://github.com/powsybl/powsybl-hpc
High Performance Computing modules for powsybl
https://github.com/radiantone/blazer
An HPC abstraction over MPI with built-in parallel compute primitives
https://github.com/byt3n33dl3/passwordcracker
Attacking Assimilation-bridge by looping with SNMPv3 & GPG, usingas CPUs, FPGAs cycles: 0x989, most advanced password and logon recovery.
shamrock
The Shamrock Framework, an open-source, multi-GPU hydrodynamics framework for astrophysics. Scales seamlessly from laptops to exascale supercomputers, supporting SPH, AMR, and more.
maplib
MapLib provides algorithms for generating mapping of processing elements to processor unities.
https://github.com/becksteinlab/parallel-analysis-in-the-mdanalysis-library
Benchmarking MDAnalysis with Dask (and MPI). Supplementary Information for SciPy 2017 paper.
https://github.com/adamouization/relaxation-technique-parallel-computing
:repeat: Relaxation technique using POSIX threads (shared memory configuration) and MPI (distributed memory configuration).
simulateqcd
SIMULATeQCD is a multi-GPU Lattice QCD framework that makes it easy for physicists to implement lattice QCD formulas while still providing competitive performance.
parallel-etudes
Parallel programming assignments using OpenMP, MPI and CUDA/OpenCL
ocgis
OpenClimateGIS is a set of geoprocessing and calculation tools for CF-compliant climate datasets.