Scientific Software
Updated 6 months ago

libCEED — Peer-reviewed • Rank 18.8 • Science 100%

libCEED: Fast algebra for high-order element-based discretizations - Published in JOSS (2021)

Economics (40%)
Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

New developments in PySDM and PySDM-examples v2 — Peer-reviewed • Rank 18.2 • Science 95%

New developments in PySDM and PySDM-examples v2: collisional breakup, immersion freezing, dry aerosol initialization, and adaptive time-stepping - Published in JOSS (2023)

Scientific Software
Updated 6 months ago

Triumvirate — Peer-reviewed • Rank 9.9 • Science 98%

Triumvirate: A Python/C++ package for three-point clustering measurements - Published in JOSS (2023)

Scientific Software
Updated 6 months ago

GPUE — Peer-reviewed • Rank 6.0 • Science 95%

GPUE: Graphics Processing Unit Gross--Pitaevskii Equation solver - Published in JOSS (2018)

Materials Science
Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

Disimpy — Peer-reviewed • Rank 7.5 • Science 93%

Disimpy: A massively parallel Monte Carlo simulator for generating diffusion-weighted MRI data in Python - Published in JOSS (2020)

Mathematics
Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

Open Source Optical Coherence Tomography Software — Peer-reviewed • Rank 5.9 • Science 93%

Open Source Optical Coherence Tomography Software - Published in JOSS (2020)

Updated 6 months ago

CUDA • Rank 21.1 • Science 77%

CUDA programming in Julia.

Scientific Software
Updated 6 months ago

sboxgates — Peer-reviewed • Rank 4.4 • Science 93%

sboxgates: A program for finding low gate count implementations of S-boxes - Published in JOSS (2021)

Earth and Environmental Sciences (40%)
Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

ACHR.cu — Peer-reviewed • Rank 1.6 • Science 93%

ACHR.cu: GPU-accelerated sampling of metabolic networks - Published in JOSS (2019)

Scientific Software · Peer-reviewed
Updated 6 months ago

torchao • Rank 25.9 • Science 64%

PyTorch native quantization and sparsity for training and inference

Updated 6 months ago

pennylane-lightning • Rank 21.8 • Science 64%

The Lightning plugin ecosystem provides fast quantum state-vector and tensor network simulators written in C++ for use with PennyLane.

Updated 6 months ago

flamegpu2 • Rank 8.2 • Science 77%

FLAME GPU 2 is a GPU accelerated agent based modelling framework for CUDA C++ and Python

Updated 6 months ago

octotiger • Rank 7.7 • Science 77%

Astrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees

Updated 6 months ago

aluminum • Rank 8.4 • Science 72%

High-performance, GPU-aware communication library

Updated 6 months ago

tutorial-multi-gpu • Rank 8.1 • Science 72%

Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial

Updated 5 months ago

iree-base-compiler • Rank 25.0 • Science 54%

A retargetable MLIR-based machine learning compiler and runtime toolkit.

Updated 6 months ago

cuvec • Rank 11.7 • Science 67%

Unifying Python/C++/CUDA memory: Python buffered array ↔️ `std::vector` ↔️ CUDA managed memory

Updated 6 months ago

cutlass • Rank 24.1 • Science 54%

CUDA Templates for Linear Algebra Subroutines

Updated 6 months ago

babelstream • Rank 10.7 • Science 67%

STREAM, for lots of devices written in many programming models

Updated 6 months ago

cuquantum • Rank 20.7 • Science 57%

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

Updated 6 months ago

tiny-cuda-nn • Rank 11.8 • Science 64%

Lightning fast C++/CUDA neural network framework

Updated 6 months ago

gemmkernels.jl • Rank 7.8 • Science 67%

Flexible and performant GEMM kernels in Julia

Updated 6 months ago

accelerated-scan • Rank 6.3 • Science 67%

Accelerated First Order Parallel Associative Scan

Updated 5 months ago

RadonKA • Rank 5.2 • Science 67%

A simple yet sufficiently fast (attenuated) Radon and backproject implementation using KernelAbstractions.jl. Runs on CPU, CUDA, ...

Updated 6 months ago

abmgpu • Rank 4.2 • Science 67%

Agent Based Model on GPU using CUDA 12.2.1 and OpenGL 4.5 (CUDA OpenGL interop) on Windows/Linux

Updated 6 months ago

arbor • Rank 17.2 • Science 54%

The Arbor multi-compartment neural network simulation library.

Updated 6 months ago

alpaka • Rank 11.1 • Science 59%

Abstraction Library for Parallel Kernel Acceleration :llama:

Updated 6 months ago

burn • Rank 33.9 • Science 36%

Burn is a next generation Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.

Updated 6 months ago

celeritas • Rank 9.1 • Science 59%

Celeritas is a new Monte Carlo transport code designed to accelerate scientific discovery in high energy physics by improving detector simulation throughput and energy efficiency using GPUs.

Updated 6 months ago

nimpa • Rank 8.8 • Science 59%

NiftyPET: Neuro-Image Manipulation, Processing and Analysis

Updated 5 months ago

pika • Rank 8.3 • Science 59%

pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.

Updated 6 months ago

lc0 • Rank 12.7 • Science 54%

Open source neural network chess engine with GPU acceleration and broad hardware support.

Updated 6 months ago

necsim-rust • Rank 4.7 • Science 59%

Spatially explicit biodiversity simulations using a parallel library written in Rust

Updated 5 months ago

tmu • Rank 13.3 • Science 49%

Implements the Tsetlin Machine, Coalesced Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features, drop clause, Type III Feedback, focused negative sampling, multi-task classifier, autoencoder, literal budget, and one-vs-one multi-class classifier. TMU is written in Python with wrappers for C and CUDA-based clause evaluation and updating.

Updated 6 months ago

nvgraph.sh • Rank 5.1 • Science 54%

CLI for nvGraph, which is a GPU-based graph analytics library written by NVIDIA, using CUDA.

Updated 6 months ago

torchpq • Rank 14.4 • Science 44%

Approximate nearest neighbor search with product quantization on GPU in pytorch and cuda

Updated 6 months ago

cunessie.jl • Rank 0.7 • Science 57%

CUDA-accelerated Nonlocal Electrostatics in Structured Solvents

Updated 6 months ago

matx • Rank 10.7 • Science 44%

An efficient C++17 GPU numerical computing library with Python-like syntax

Updated 6 months ago

icicle-core • Rank 18.0 • Science 36%

A hardware acceleration library for compute intensive cryptography :ice_cube:

Updated 6 months ago

quokka • Rank 7.8 • Science 46%

Two-moment AMR radiation hydrodynamics (with self-gravity, particles, and chemistry) on CPUs/GPUs for astrophysics

Updated 5 months ago

scikit-cuda • Rank 20.2 • Science 33%

Python interface to GPU-powered libraries

Updated 5 months ago

https://github.com/bytedance/flux • Rank 9.0 • Science 36%

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

Updated 5 months ago

https://github.com/openmm/nnpops • Rank 7.1 • Science 36%

High-performance operations for neural network potentials

Updated 5 months ago

thundersvm • Rank 17.5 • Science 23%

ThunderSVM: A Fast SVM Library on GPUs and CPUs

Updated 5 months ago

QPT • Rank 8.5 • Science 26%

[内测中]QPT - 致力于让开源项目更好通往互联网世界的Python to EXE工具(Python打包)。

Scientific Software
Updated 6 months ago

SPbLA — Peer-reviewed • Rank 3.9 • Science 26%

SPbLA: The Library of GPGPU-powered Sparse Boolean Linear Algebra Operations - Published in JOSS (2022)

Updated 6 months ago

pytorch-cuda-2.7.1 • Rank 0.7 • Science 26%

Clone of PyTorch: Tensors and Dynamic neural networks in Python and C++ with strong GPU acceleration.

Updated 5 months ago

https://github.com/bytedance/abq-llm • Rank 7.1 • Science 13%

An acceleration library that supports arbitrary bit-width combinatorial quantization operations

Updated 5 months ago

https://github.com/dbraun/pytorchtop • Rank 5.1 • Science 13%

GPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV

Updated 5 months ago

https://github.com/bencardoen/singularity_slurm_cuda • Rank 1.8 • Science 13%

Example on how to get started with Singularity and CUDA on a SLURM cluster

Updated 6 months ago

kernel_launcher • Science 44%

Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner

Updated 6 months ago

micm • Science 85%

A model-independent chemistry module for atmosphere models

Updated 6 months ago

stnls • Science 54%

Space-Time Attention with a Shifted Non-Local Search

Updated 6 months ago

plus • Science 54%

More versatile and extensible GPU-accelerated micromagnetic simulator

Updated 6 months ago

cuda-accelerated-visual-inertial-odometry-fusion • Science 44%

Harness the power of GPU acceleration for fusing visual odometry and IMU data with an advanced Unscented Kalman Filter (UKF) implementation. Developed in C++ and utilizing CUDA, cuBLAS, and cuSOLVER, this system offers unparalleled real-time performance in state and covariance estimation for robotics and autonomous system applications.

Updated 6 months ago

kmm • Science 26%

KMM: parallel dataflow scheduler and efficient memory management for multi-GPU platforms

Updated 5 months ago

https://github.com/beehive-lab/tornadovm • Science 67%

TornadoVM: A practical and efficient heterogeneous programming framework for managed languages

Updated 6 months ago

qudaz • Science 67%

Qompass AI Cuda library for Zig

Updated 6 months ago

qc-cugbasis • Science 39%

High performance CUDA/Python library for computing quantum chemistry density-based descriptors for larger systems using GPUs.

Updated 6 months ago

interopunitycuda • Science 67%

Demonstrate interoperability between Unity Engine and CUDA

Updated 6 months ago

kernel_float • Science 26%

CUDA/HIP header-only library for low-precision (16 bit, 8 bit) and vectorized GPU kernel development

Updated 6 months ago

simulateqcd • Science 49%

SIMULATeQCD is a multi-GPU Lattice QCD framework that makes it easy for physicists to implement lattice QCD formulas while still providing competitive performance.

Updated 6 months ago

cuda • Science 54%

Qompass AI on CUDA

Updated 6 months ago

thereminq-classiq • Science 44%

ThereminQ CLassiQ - QuantOPS : Orchestrate Qrack, Bonsai, Qimcifa and Tipsy in OpenCL, VCL and CUDA with an X WebUI

Updated 5 months ago

https://github.com/dansarie/socracked • Science 23%

Performs key-recovery attacks on the SoDark family of algorithms.

Updated 6 months ago

python-intan • Science 44%

Tools and demos for working with EMG data from intan using python

Updated 6 months ago

jaxngp • Science 44%

JAX implementation of instant-ngp (NeRF part)

Engineering (40%) Earth and Environmental Sciences (40%)