libCEED
libCEED: Fast algebra for high-order element-based discretizations - Published in JOSS (2021)
The drake R package
The drake R package: a pipeline toolkit for reproducibility and high-performance computing - Published in JOSS (2018)
The targets R package
The targets R package: a dynamic Make-like function-oriented pipeline toolkit for reproducibility and high-performance computing - Published in JOSS (2021)
mpi4jax
mpi4jax: Zero-copy MPI communication of JAX arrays - Published in JOSS (2021)
torchquad
torchquad: Numerical Integration in Arbitrary Dimensions with PyTorch - Published in JOSS (2021)
Monte Carlo / Dynamic Code (MC/DC)
Monte Carlo / Dynamic Code (MC/DC): An accelerated Python package for fully transient neutron transport and rapid methods development - Published in JOSS (2024)
AMReX
AMReX: a framework for block-structured adaptive mesh refinement - Published in JOSS (2019)
Cabana
Cabana: A Performance Portable Library for Particle-Based Simulations - Published in JOSS (2022)
t8code - modular adaptive mesh refinement in the exascale era
t8code - modular adaptive mesh refinement in the exascale era - Published in JOSS (2025)
MPTRAC
MPTRAC: A high-performance Lagrangian transport model for atmospheric air parcel dispersion - Published in JOSS (2025)
madupite
madupite: A High-Performance Distributed Solver for Large-Scale Markov Decision Processes - Published in JOSS (2025)
The jagstargets R package
The jagstargets R package: a reproducible workflow framework for Bayesian data analysis with JAGS - Published in JOSS (2021)
The stantargets R package
The stantargets R package: a workflow framework for efficient reproducible Stan-powered Bayesian data analysis pipelines - Published in JOSS (2021)
BracketingNonlinearSolve
High-performance and differentiation-enabled nonlinear solvers (Newton methods), bracketed rootfinding (bisection, Falsi), with sparsity and Newton-Krylov support.
parpe
Parameter estimation for dynamical models using high-performance computing, batch and mini-batch optimizers, and dynamic load balancing.
t-elf
Tensor Extraction of Latent Features (T-ELF). Within T-ELF's arsenal are non-negative matrix and tensor factorization solutions, equipped with automatic model determination (also known as the estimation of latent factors - rank) for accurate data modeling. Our software suite encompasses cutting-edge data pre-processing and post-processing modules.
jurassic
The Juelich Rapid Spectral Simulation Code (JURASSIC) is a fast infrared radiative transfer model for the analysis of atmospheric remote sensing measurements.
iasi
This repository provides a collection of codes for the analysis of remote sensing observations of EUMETSAT's Infrared Atmospheric Sounding Interferometer (IASI).
gps
This repository provides a collection of codes for the analysis of GPS/RO observations.
https://github.com/stfc/psyclone
PSyclone is a source-to-source Fortran compiler designed to programmatically optimise, parallelise and instrument HPC applications via user-provided transformation scripts.
batchtools
batchtools: Tools for R to work on batch systems - Published in JOSS (2017)
fluidx3d
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
calibr
Parallelized Bayesian calibration of simulations using Gaussian process emulation
https://github.com/pypr/pysph
A framework for Smoothed Particle Hydrodynamics in Python
biomero
BIOMERO - A python library for easy connecting between OMERO (jobs) and a Slurm cluster
https://github.com/sciml/surrogates.jl
Surrogate modeling and optimization for scientific machine learning (SciML)
sundials
Official development repository for SUNDIALS - a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. Pull requests are welcome for bug fixes and minor changes.
pyhpc-benchmarks
A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python :rocket:
aspect
A parallel, extensible finite element code to simulate convection in both 2D and 3D models.
https://github.com/ornladios/adios2
Next generation of ADIOS developed in the Exascale Computing Program
airs
The AIRS Code Collection enables data processing and analysis for remote sensing observations captured by NASA's Atmospheric InfraRed Sounder.
PoissonRandom
Fast Poisson Random Numbers in pure Julia for scientific machine learning (SciML)
https://github.com/cans-world/cans
A code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows
kokkos
Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
https://github.com/precice/precice
A coupling library for partitioned multi-physics simulations, including, but not restricted to fluid-structure interaction and conjugate heat transfer simulations.
PreallocationTools
Tools for building non-allocating pre-cached functions in Julia, allowing for GC-free usage of automatic differentiation in complex codes
https://github.com/exalearn/colmena
Library for steering campaigns of simulations on supercomputers
ntpoly
A massively parallel library for computing the functions of sparse matrices.
librapid
A highly optimised C++ library for mathematical applications and neural networks.
conduit
C++ library that wraps intra-thread, inter-thread, and inter-process communication in a uniform, modular, object-oriented interface, with a focus on asynchronous high-performance computing applications.
https://github.com/madsjulia/robustpmap.jl
Robust pmap calls for efficient parallelization and high-performance computing
https://github.com/sail-sg/envpool
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
https://github.com/keichi/kedm
A high-performance implementation of Empirical Dynamic Modeling (EDM)
opencl-benchmark
A small OpenCL benchmark program to measure peak GPU/CPU performance.
pararealgpu.jl
A distributed and GPU-based implementation of the Parareal algorithm for parallel-in-time integration of equations of motion.
entropy-core-problem
Repository for suppelementary material from my publications on the entropy core problem
https://github.com/bio-phys/mdbenchmark
Quickly generate, start and analyze benchmarks for molecular dynamics simulations.
https://github.com/hongbo-miao/hongbomiao.com
A personal research and development (R&D) lab that facilitates the sharing of knowledge.
https://github.com/austinksmith/Hamsters.js
100% Vanilla Javascript Multithreading & Parallel Execution Library
https://github.com/cryotools/osaris
Scripts to facilitate parallel InSAR processing and analysis of Sentinel-1 time series on HPC clusters based on GMTSAR and Slurm.
https://github.com/google/tf-quant-finance
High-performance TensorFlow library for quantitative finance.
https://github.com/cran-task-views/highperformancecomputing
CRAN Task View: High-Performance and Parallel Computing with R
https://github.com/albumentations-team/albucore
A high-performance image processing library designed to optimize and extend the Albumentations library with specialized functions for advanced image transformations. Perfect for developers working in computer vision who require efficient and scalable image augmentation.
https://github.com/alpa-projects/alpa
Training and serving large-scale neural networks with auto parallelization.
https://github.com/madsjulia/biguq.jl
Bayesian Information Gap Decision Theory
https://github.com/agnostiqhq/covalent-cloud-github-workflow
Template for integrating Covalent Cloud's high-performance computing capabilities into GitHub Workflows
https://github.com/cran-task-views/modeldeployment
CRAN Task View: Model Deployment with R
https://github.com/mpes-kit/pesfit
Distributed multicomponent lineshape fitting routines and benchmarks for multidimensional spectroscopy and spectral imaging
https://github.com/gregyjames/tsunami
A High Performance C# wrapper that allows you to get the benefits of SIMD Intrinsics on List<T>.
https://github.com/hypershell/hypershell
Cross-platform, high-throughput computing utility for processing shell commands over a distributed, asynchronous queue.
thread-pool
BS::thread_pool: a fast, lightweight, modern, and easy-to-use C++17 / C++20 / C++23 thread pool library
CaNS-Fizzy
CaNS-Fizzy: A GPU-accelerated finite difference solver for turbulent two-phase flows - Published in JOSS (2025)
spinfoam_radiative_corrections
Codes for the computation of the radiative corrections to the EPRL spin foam propagator in covariant LQG.
delicoco-ieee-transactions
In compressed decentralized optimization settings, there are benefits to having multiple gossip steps between subsequent gradient iterations, even when the cost of doing so is appropriately accounted for e.g. by means of reducing the precision of compressed information.
hpc_submit_scripts
Slurm job scripts to run and analyze molecular dynamics simulations on high performance computers
geoslicer
Open source digital rocks software platform for micro-CT, CT, thin sections and borehole image analysis. Includes tools for: annotation, AI, HPC, porous media flow simulation, porosity analysis, permeability analysis and much more.
elexi
Open Source High-Order Unstructured Discontinuous Galerkin Fluid Dynamics Solver with Euler-Lagrange Extension