libCEED
libCEED: Fast algebra for high-order element-based discretizations - Published in JOSS (2021)
Executorlib – Up-scaling Python workflows for hierarchical heterogenous high-performance computing
Executorlib – Up-scaling Python workflows for hierarchical heterogenous high-performance computing - Published in JOSS (2025)
Collaborative Container Modules with Singularity Registry HPC
Collaborative Container Modules with Singularity Registry HPC - Published in JOSS (2021)
Easy SimAuto (ESA)
Easy SimAuto (ESA): A Python Package that Simplifies Interacting with PowerWorld Simulator - Published in JOSS (2020)
DeepHyper
DeepHyper: A Python Package for Massively Parallel Hyperparameter Optimization in Machine Learning - Published in JOSS (2025)
Open OnDemand
Open OnDemand: A web-based client portal for HPC centers - Published in JOSS (2018)
Cabana
Cabana: A Performance Portable Library for Particle-Based Simulations - Published in JOSS (2022)
t8code - modular adaptive mesh refinement in the exascale era
t8code - modular adaptive mesh refinement in the exascale era - Published in JOSS (2025)
Shenfun
Shenfun: High performance spectral Galerkin computing platform - Published in JOSS (2018)
$hp\mathrm{3D}$
$hp\mathrm{3D}$: A Scalable MPI/OpenMP $hp$-Adaptive Finite Element Software Library for Complex Multiphysics Applications - Published in JOSS (2024)
Launcher
Launcher: A simple tool for executing high throughput computing workloads - Published in JOSS (2017)
FeenoX
FeenoX: a cloud-first finite-element(ish) computational engineering tool - Published in JOSS (2024)
WRF-CMake
WRF-CMake: integrating CMake support into the Advanced Research WRF (ARW) modelling system - Published in JOSS (2019)
SCAS dashboard
SCAS dashboard: A tool to intuitively and interactively analyze Slurm cluster usage - Published in JOSS (2024)
umpire
An application-focused API for memory management on NUMA & GPU architectures
heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
hatchet
Graph-indexed Pandas DataFrames for analyzing hierarchical performance data
t-elf
Tensor Extraction of Latent Features (T-ELF). Within T-ELF's arsenal are non-negative matrix and tensor factorization solutions, equipped with automatic model determination (also known as the estimation of latent factors - rank) for accurate data modeling. Our software suite encompasses cutting-edge data pre-processing and post-processing modules.
llnl-thicket
https://github.com/hpcaitech/colossalai
Making large AI models cheaper, faster and more accessible
perun
Perun is a Python package that measures the energy consumption of your applications.
pennylane-lightning
The Lightning plugin ecosystem provides fast quantum state-vector and tensor network simulators written in C++ for use with PennyLane.
Containershare
Containershare: Open Source Registry to build, test, deploy with CircleCI - Published in JOSS (2018)
batchtools
batchtools: Tools for R to work on batch systems - Published in JOSS (2017)
cromwell
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
https://github.com/craylabs/smartredis
SmartSim Infrastructure Library Clients.
numba-mpi
Numba @jit compatible wrappers for MPI C API tested on Linux, macOS and Windows
compss
COMP Superscalar (COMPSs) is a framework which aims to ease the development and execution of applications for distributed infrastructures, such as Clusters, Grids and Clouds.
fluidx3d
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
devito
DSL and compiler framework for automated finite-differences and stencil computation
sundials
Official development repository for SUNDIALS - a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. Pull requests are welcome for bug fixes and minor changes.
https://github.com/sylabs/singularity
SingularityCE is the Community Edition of Singularity, an open source container platform designed to be simple, fast, and secure.
udocker
A basic user tool to execute simple docker containers in batch or interactive systems without root privileges.
https://github.com/changliao1025/pyearth
A leatherman alike Python package for Earth science
https://github.com/agnostiqhq/covalent
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments.
https://github.com/ornladios/adios2
Next generation of ADIOS developed in the Exascale Computing Program
seissol
A scientific software for the numerical simulation of seismic wave phenomena and earthquake dynamics
swift
Modern astrophysics and cosmology particle-based code. Mirror of gitlab developments at https://gitlab.cosma.dur.ac.uk/swift/swiftsim
https://github.com/bluebrain/nmodl
Code Generation Framework For NEURON MODeling Language
https://github.com/lablup/backend.ai
Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator support including CUDA GPU, ROCm GPU, TPU, IPU and other NPUs.
https://github.com/akio-tomiya/latticeqcd.jl
A native Julia code for lattice QCD with dynamical fermions in 4 dimension.
rho-diffusion
Parameterizing n-dimensional density fields using denoising diffusion probabilistic models
cpu-performance-tests
This repository contains the code to benchmark CPU cache miss latency and branch misprediction penalty
https://github.com/pegasus-isi/pegasus
Pegasus Workflow Management System - Automate, recover, and debug scientific computations.
https://github.com/easybuilders/easybuild
EasyBuild - building software with ease
suanpan
🧮 An Open Source, Parallel and Heterogeneous Finite Element Analysis Framework
future
:rocket: R package: future: Unified Parallel and Distributed Processing in R for Everyone
https://github.com/madsjulia/robustpmap.jl
Robust pmap calls for efficient parallelization and high-performance computing
https://github.com/futureverse/future.apply
:rocket: R package: future.apply - Apply Function to Elements in Parallel using Futures
opencl-benchmark
A small OpenCL benchmark program to measure peak GPU/CPU performance.
https://github.com/aces/cbrain
CBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.
doFuture
:rocket: R package: doFuture - Use Foreach to Parallelize via Future Framework
https://github.com/llnl/callflow
Visualization tool for analyzing call trees and graphs
https://github.com/futureverse/future.batchtools
:rocket: R package future.batchtools: A Future API for Parallel and Distributed Processing using batchtools