Scientific Software
Updated 10 months ago

libCEED — Peer-reviewed • Rank 18.8 • Science 100%

libCEED: Fast algebra for high-order element-based discretizations - Published in JOSS (2021)

Economics (40%)
Scientific Software · Peer-reviewed
Scientific Software
Updated 10 months ago

New developments in PySDM and PySDM-examples v2 — Peer-reviewed • Rank 18.2 • Science 95%

New developments in PySDM and PySDM-examples v2: collisional breakup, immersion freezing, dry aerosol initialization, and adaptive time-stepping - Published in JOSS (2023)

Scientific Software
Updated 10 months ago

Triumvirate — Peer-reviewed • Rank 9.9 • Science 98%

Triumvirate: A Python/C++ package for three-point clustering measurements - Published in JOSS (2023)

Updated about 1 month ago

jaxDecomp: JAX Library for 3D Domain Decomposition and Parallel FFTs • Rank 12.0 • Science 92%

jaxDecomp: JAX Library for 3D Domain Decomposition and Parallel FFTs - Published in JOSS (2026)

Scientific Software
Updated 10 months ago

GPUE — Peer-reviewed • Rank 6.0 • Science 95%

GPUE: Graphics Processing Unit Gross--Pitaevskii Equation solver - Published in JOSS (2018)

Materials Science
Scientific Software · Peer-reviewed
Scientific Software
Updated 10 months ago

Disimpy — Peer-reviewed • Rank 7.5 • Science 93%

Disimpy: A massively parallel Monte Carlo simulator for generating diffusion-weighted MRI data in Python - Published in JOSS (2020)

Mathematics
Scientific Software · Peer-reviewed
Scientific Software
Updated 10 months ago

Open Source Optical Coherence Tomography Software — Peer-reviewed • Rank 5.9 • Science 93%

Open Source Optical Coherence Tomography Software - Published in JOSS (2020)

Updated 10 months ago

CUDA • Rank 21.1 • Science 77%

CUDA programming in Julia.

Scientific Software
Updated 10 months ago

sboxgates — Peer-reviewed • Rank 4.4 • Science 93%

sboxgates: A program for finding low gate count implementations of S-boxes - Published in JOSS (2021)

Earth and Environmental Sciences (40%)
Scientific Software · Peer-reviewed
Scientific Software
Updated 10 months ago

ACHR.cu — Peer-reviewed • Rank 1.6 • Science 93%

ACHR.cu: GPU-accelerated sampling of metabolic networks - Published in JOSS (2019)

Scientific Software · Peer-reviewed
Updated 10 months ago

torchao • Rank 25.9 • Science 64%

PyTorch native quantization and sparsity for training and inference

Updated 10 months ago

pycuda • Rank 23.1 • Science 64%

CUDA integration for Python, plus shiny features

Updated 10 months ago

pennylane-lightning • Rank 21.8 • Science 64%

The Lightning plugin ecosystem provides fast quantum state-vector and tensor network simulators written in C++ for use with PennyLane.

Updated 10 months ago

flamegpu2 • Rank 8.2 • Science 77%

FLAME GPU 2 is a GPU accelerated agent based modelling framework for CUDA C++ and Python

Updated 10 months ago

octotiger • Rank 7.7 • Science 77%

Astrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees

Updated 10 months ago

aluminum • Rank 8.4 • Science 72%

High-performance, GPU-aware communication library

Updated 10 months ago

tutorial-multi-gpu • Rank 8.1 • Science 72%

Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial

Updated 9 months ago

iree-base-compiler • Rank 25.0 • Science 54%

A retargetable MLIR-based machine learning compiler and runtime toolkit.

Updated 10 months ago

cuvec • Rank 11.7 • Science 67%

Unifying Python/C++/CUDA memory: Python buffered array ↔️ `std::vector` ↔️ CUDA managed memory

Updated 10 months ago

cutlass • Rank 24.1 • Science 54%

CUDA Templates for Linear Algebra Subroutines

Updated 10 months ago

babelstream • Rank 10.7 • Science 67%

STREAM, for lots of devices written in many programming models

Updated 10 months ago

cuquantum • Rank 20.7 • Science 57%

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

Updated 10 months ago

tiny-cuda-nn • Rank 11.8 • Science 64%

Lightning fast C++/CUDA neural network framework

Updated 10 months ago

gemmkernels.jl • Rank 7.8 • Science 67%

Flexible and performant GEMM kernels in Julia

Updated 10 months ago

accelerated-scan • Rank 6.3 • Science 67%

Accelerated First Order Parallel Associative Scan

Updated 9 months ago

RadonKA • Rank 5.2 • Science 67%

A simple yet sufficiently fast (attenuated) Radon and backproject implementation using KernelAbstractions.jl. Runs on CPU, CUDA, ...

Updated 10 months ago

abmgpu • Rank 4.2 • Science 67%

Agent Based Model on GPU using CUDA 12.2.1 and OpenGL 4.5 (CUDA OpenGL interop) on Windows/Linux

Updated 10 months ago

arbor • Rank 17.2 • Science 54%

The Arbor multi-compartment neural network simulation library.

Updated 10 months ago

alpaka • Rank 11.1 • Science 59%

Abstraction Library for Parallel Kernel Acceleration :llama:

Updated 10 months ago

burn • Rank 33.9 • Science 36%

Burn is a next generation Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.

Updated 10 months ago

celeritas • Rank 9.1 • Science 59%

Celeritas is a new Monte Carlo transport code designed to accelerate scientific discovery in high energy physics by improving detector simulation throughput and energy efficiency using GPUs.

Updated 10 months ago

nimpa • Rank 8.8 • Science 59%

NiftyPET: Neuro-Image Manipulation, Processing and Analysis

Updated 9 months ago

pika • Rank 8.3 • Science 59%

pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.

Updated 10 months ago

lc0 • Rank 12.7 • Science 54%

Open source neural network chess engine with GPU acceleration and broad hardware support.

Updated 10 months ago

necsim-rust • Rank 4.7 • Science 59%

Spatially explicit biodiversity simulations using a parallel library written in Rust

Updated 9 months ago

tmu • Rank 13.3 • Science 49%

Implements the Tsetlin Machine, Coalesced Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features, drop clause, Type III Feedback, focused negative sampling, multi-task classifier, autoencoder, literal budget, and one-vs-one multi-class classifier. TMU is written in Python with wrappers for C and CUDA-based clause evaluation and updating.

Updated 10 months ago

nvgraph.sh • Rank 5.1 • Science 54%

CLI for nvGraph, which is a GPU-based graph analytics library written by NVIDIA, using CUDA.

Updated 10 months ago

torchpq • Rank 14.4 • Science 44%

Approximate nearest neighbor search with product quantization on GPU in pytorch and cuda

Updated 10 months ago

cunessie.jl • Rank 0.7 • Science 57%

CUDA-accelerated Nonlocal Electrostatics in Structured Solvents

Updated 10 months ago

matx • Rank 10.7 • Science 44%

An efficient C++17 GPU numerical computing library with Python-like syntax

Updated 10 months ago

icicle-core • Rank 18.0 • Science 36%

A hardware acceleration library for compute intensive cryptography :ice_cube:

Updated 10 months ago

quokka • Rank 7.8 • Science 46%

Two-moment AMR radiation hydrodynamics (with self-gravity, particles, and chemistry) on CPUs/GPUs for astrophysics

Updated 9 months ago

scikit-cuda • Rank 20.2 • Science 33%

Python interface to GPU-powered libraries

Updated 9 months ago

https://github.com/bytedance/flux • Rank 9.0 • Science 36%

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

Updated 9 months ago

https://github.com/openmm/nnpops • Rank 7.1 • Science 36%

High-performance operations for neural network potentials

Updated 9 months ago

thundersvm • Rank 17.5 • Science 23%

ThunderSVM: A Fast SVM Library on GPUs and CPUs

Updated 9 months ago

QPT • Rank 8.5 • Science 26%

[内测中]QPT - 致力于让开源项目更好通往互联网世界的Python to EXE工具(Python打包)。

Scientific Software
Updated 10 months ago

SPbLA — Peer-reviewed • Rank 3.9 • Science 26%

SPbLA: The Library of GPGPU-powered Sparse Boolean Linear Algebra Operations - Published in JOSS (2022)

Updated 10 months ago

pytorch-cuda-2.7.1 • Rank 0.7 • Science 26%

Clone of PyTorch: Tensors and Dynamic neural networks in Python and C++ with strong GPU acceleration.

Updated 9 months ago

https://github.com/bytedance/abq-llm • Rank 7.1 • Science 13%

An acceleration library that supports arbitrary bit-width combinatorial quantization operations

Updated 9 months ago

https://github.com/dbraun/pytorchtop • Rank 5.1 • Science 13%

GPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV

Updated 9 months ago

https://github.com/bencardoen/singularity_slurm_cuda • Rank 1.8 • Science 13%

Example on how to get started with Singularity and CUDA on a SLURM cluster

Updated 9 months ago

https://github.com/afd-illinois/kharma • Science 26%

Kokkos-based High-Accuracy Relativistic Magnetohydrodynamics with AMR

Updated 10 months ago

fast_frechet • Science 54%

Comparison of different (fast) discrete Fréchet distance implementations in C++ and CUDA.

Updated 10 months ago

air-traffic-distribution • Science 39%

A GPU-Accelerated Multi-Objective Genetic Algorithm for Air Traffic Management

Updated 10 months ago

python-intan • Science 44%

Tools and demos for working with EMG data from intan using python

Updated 8 months ago

https://github.com/demoriarty/doksparse • Science 13%

sparse DOK tensors on GPU, pytorch

Updated 10 months ago

gpugraphlayout • Science 67%

An experimental GPU accelerated implementation of ForceAtlas2

Updated 10 months ago

futhark • Science 72%

:boom::computer::boom: A data-parallel functional programming language

Artificial Intelligence and Machine Learning
Scientific Software
Updated 10 months ago

MRI-NUFFT — Peer-reviewed • Science 93%

MRI-NUFFT: Doing non-Cartesian MRI has never been easier - Published in JOSS (2025)

Mathematics (32%)
Scientific Software · Peer-reviewed
Updated 10 months ago

tiledmultigpuvisualization • Science 57%

Using tiled display systems for the interactive visualization of GPGPU computed data

Updated 10 months ago

plus • Science 54%

More versatile and extensible GPU-accelerated micromagnetic simulator

Updated 10 months ago

astro-accelerate • Science 67%

AstroAccelerate is a many-core accelerated software package for processing time-domain radio-astronomy data.

Updated 10 months ago

cuda-accelerated-visual-inertial-odometry-fusion • Science 44%

Harness the power of GPU acceleration for fusing visual odometry and IMU data with an advanced Unscented Kalman Filter (UKF) implementation. Developed in C++ and utilizing CUDA, cuBLAS, and cuSOLVER, this system offers unparalleled real-time performance in state and covariance estimation for robotics and autonomous system applications.

Updated 10 months ago

superterrainplus • Science 44%

SuperTerrain+: A real-time procedural 3D infinite terrain engine with geographical features and photorealistic rendering.

Updated 10 months ago

MaterialPointSolver • Science 67%

🧮 High-performance Material Point Method (MPM) Solver in Julia.

Updated 9 months ago

https://github.com/alexkranias/triton_vs_cuda • Science 13%

Building Triton and CUDA kernels side-by-side to create a cuBLAS-performant GEMM kernel.

Updated 10 months ago

dockerfiles • Science 18%

Compilation of Dockerfiles with automated builds enabled on the Docker Registry

Updated 9 months ago

https://github.com/dansarie/socracked • Science 23%

Performs key-recovery attacks on the SoDark family of algorithms.