Scientific Software
Updated 9 months ago

libCEED — Peer-reviewed • Rank 18.8 • Science 100%

libCEED: Fast algebra for high-order element-based discretizations - Published in JOSS (2021)

Economics (40%)
Scientific Software · Peer-reviewed
Scientific Software
Updated 9 months ago

mpi4jax — Peer-reviewed • Rank 14.6 • Science 100%

mpi4jax: Zero-copy MPI communication of JAX arrays - Published in JOSS (2021)

Scientific Software · Peer-reviewed
Scientific Software
Updated 9 months ago

torchquad — Peer-reviewed • Rank 14.3 • Science 100%

torchquad: Numerical Integration in Arbitrary Dimensions with PyTorch - Published in JOSS (2021)

Scientific Software
Updated 9 months ago

GeophysicalFlows.jl — Peer-reviewed • Rank 9.8 • Science 100%

GeophysicalFlows.jl: Solvers for geophysical fluid dynamics problems in periodic domains on CPUs & GPUs - Published in JOSS (2021)

Scientific Software
Updated 9 months ago

Oceananigans.jl — Peer-reviewed • Rank 17.4 • Science 85%

Oceananigans.jl: Fast and friendly geophysical fluid dynamics on GPUs - Published in JOSS (2020)

Scientific Software
Updated 9 months ago

Makie.jl — Peer-reviewed • Rank 23.8 • Science 77%

Makie.jl: Flexible high-performance data visualization for Julia - Published in JOSS (2021)

Scientific Software · Peer-reviewed
Updated 9 months ago

veros • Rank 14.5 • Science 85%

The versatile ocean simulator, in pure Python, powered by JAX.

Updated 9 months ago

CUDA • Rank 21.1 • Science 77%

CUDA programming in Julia.

Updated 9 months ago

umpire • Rank 12.9 • Science 85%

An application-focused API for memory management on NUMA & GPU architectures

Updated 9 months ago

t-elf • Rank 5.5 • Science 85%

Tensor Extraction of Latent Features (T-ELF). Within T-ELF's arsenal are non-negative matrix and tensor factorization solutions, equipped with automatic model determination (also known as the estimation of latent factors - rank) for accurate data modeling. Our software suite encompasses cutting-edge data pre-processing and post-processing modules.

Updated 9 months ago

scalene • Rank 26.0 • Science 64%

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Updated 9 months ago

terragpu • Rank 3.3 • Science 85%

Python library to process and classify remote sensing imagery by means of GPUs and ML.

Updated 9 months ago

pennylane-lightning • Rank 21.8 • Science 64%

The Lightning plugin ecosystem provides fast quantum state-vector and tensor network simulators written in C++ for use with PennyLane.

Updated 9 months ago

flamegpu2 • Rank 8.2 • Science 77%

FLAME GPU 2 is a GPU accelerated agent based modelling framework for CUDA C++ and Python

Updated 9 months ago

habitat • Rank 5.5 • Science 77%

🔮 Execution time predictions for deep neural network training iterations across different GPUs.

Updated 9 months ago

aluminum • Rank 8.4 • Science 72%

High-performance, GPU-aware communication library

Updated 9 months ago

tutorial-multi-gpu • Rank 8.1 • Science 72%

Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial

Updated 9 months ago

cuvec • Rank 11.7 • Science 67%

Unifying Python/C++/CUDA memory: Python buffered array ↔️ `std::vector` ↔️ CUDA managed memory

Updated 9 months ago

cutlass • Rank 24.1 • Science 54%

CUDA Templates for Linear Algebra Subroutines

Updated 9 months ago

babelstream • Rank 10.7 • Science 67%

STREAM, for lots of devices written in many programming models

Updated 9 months ago

oneAPI • Rank 12.9 • Science 64%

Julia support for the oneAPI programming toolkit.

Updated 9 months ago

tiny-cuda-nn • Rank 11.8 • Science 64%

Lightning fast C++/CUDA neural network framework

Updated 9 months ago

mlora-cli • Rank 11.3 • Science 64%

An Efficient "Factory" to Build Multiple LoRA Adapters

Updated 9 months ago

gemmkernels.jl • Rank 7.8 • Science 67%

Flexible and performant GEMM kernels in Julia

Updated 9 months ago

mosec • Rank 20.4 • Science 54%

A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

Updated 9 months ago

devito • Rank 19.3 • Science 54%

DSL and compiler framework for automated finite-differences and stencil computation

Updated 9 months ago

RadonKA • Rank 5.2 • Science 67%

A simple yet sufficiently fast (attenuated) Radon and backproject implementation using KernelAbstractions.jl. Runs on CPU, CUDA, ...

Updated 9 months ago

arbor • Rank 17.2 • Science 54%

The Arbor multi-compartment neural network simulation library.

Updated 9 months ago

pyhpc-benchmarks • Rank 6.9 • Science 64%

A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python :rocket:

Updated 9 months ago

nx • Rank 26.9 • Science 44%

Multi-dimensional arrays (tensors) and numerical definitions for Elixir

Updated 9 months ago

k-wave-python • Rank 15.6 • Science 54%

A Python interface to k-Wave GPU accelerated binaries

Updated 9 months ago

pytorch-benchmark • Rank 11.9 • Science 57%

Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption

Updated 9 months ago

lc0 • Rank 12.7 • Science 54%

Open source neural network chess engine with GPU acceleration and broad hardware support.

Updated 9 months ago

DeconvOptim • Rank 7.5 • Science 57%

A multi-dimensional, high performance deconvolution framework written in Julia Lang for CPUs and GPUs.

Updated 9 months ago

pytriton • Rank 20.0 • Science 44%

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Updated 9 months ago

exponentialutilities.jl • Rank 8.4 • Science 54%

Fast and differentiable implementations of matrix exponentials, Krylov exponential matrix-vector multiplications ("expmv"), KIOPS, ExpoKit functions, and more. All your exponential needs in SciML form.

Updated 9 months ago

nvgraph.sh • Rank 5.1 • Science 54%

CLI for nvGraph, which is a GPU-based graph analytics library written by NVIDIA, using CUDA.

Updated 9 months ago

triton-model-navigator • Rank 14.9 • Science 44%

Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.

Updated 9 months ago

lpa-xrd • Rank 4.9 • Science 54%

Line profile analysis X-ray diffraction simulator.

Updated 9 months ago

beaver • Rank 14.7 • Science 44%

MLIR Toolkit in Elixir and Zig.

Updated 9 months ago

matx • Rank 10.7 • Science 44%

An efficient C++17 GPU numerical computing library with Python-like syntax

Updated 9 months ago

opencl-benchmark • Rank 5.5 • Science 44%

A small OpenCL benchmark program to measure peak GPU/CPU performance.

Updated 9 months ago

text2vec-service • Rank 2.3 • Science 44%

Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。

Updated 9 months ago

wakis • Science 67%

3D electromagnetic time-domain solver, specialized in wake potential and beam-coupling impedance computation for particle accelerators

Updated 9 months ago

phoebe • Science 57%

A high-performance framework for solving phonon and electron Boltzmann equations

Updated 9 months ago

powerfit-em • Science 67%

Rigid body fitting of atomic strucures in cryo-electron microscopy density maps

Updated 9 months ago

cytnx • Science 54%

Project Cytnx, A Cross-section of Python & C++,Tensor network library

Updated 9 months ago

neon • Science 67%

WIP Prototype of a modern CFD core

Updated 9 months ago

hybridbackend • Science 54%

A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster

Updated 9 months ago

futhark • Science 72%

:boom::computer::boom: A data-parallel functional programming language

Artificial Intelligence and Machine Learning
Updated 9 months ago

cudawrappers • Science 54%

C++ wrapper for the Nvidia/HIP C libraries (e.g. CUDA driver, nvrtc, hiprtc, cuFFT, hipFFT, etc.)

Updated 9 months ago

MHDFlows • Science 67%

Three Dimensional Magnetohydrodynamic(MHD) pseudospectral solvers written in julia with FourierFlows.jl

Updated 9 months ago

bundoora • Science 44%

Customized development container environment for consistent and efficient execution of machine learning projects.

Updated 9 months ago

datoviz • Science 54%

⚡ Datoviz: high-performance GPU rendering for scientific data visualization

Updated 9 months ago

micm • Science 85%

A model-independent chemistry module for atmosphere models

Updated 9 months ago

mcmlgpu • Science 44%

This repository contains the base code for Monte Carlo simulations in a GPU of light transport on turbid media in GPU.

Updated 9 months ago

vector-sum-cuda • Science 54%

Comparing performance of sequential vs CUDA-based vector element sum.

Updated 9 months ago

superterrainplus • Science 44%

SuperTerrain+: A real-time procedural 3D infinite terrain engine with geographical features and photorealistic rendering.

Updated 9 months ago

interopunitycuda • Science 67%

Demonstrate interoperability between Unity Engine and CUDA

Updated 9 months ago

pagerank-cuda-dynamic • Science 54%

Design of CUDA-based Parallel Dynamic PageRank algorithm for measuring importance.

Updated 9 months ago

intelliperf • Science 67%

Automated bottleneck detection and solution orchestration

Updated 9 months ago

transformers-bart-pretrain • Science 54%

Script to pre-train hugginface transformers BART with Tensorflow 2

Updated 9 months ago

sycl-bench • Science 57%

SYCL Benchmark Suite

Updated 9 months ago

kernel_launcher • Science 44%

Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner

Updated 9 months ago

vkcompviz • Science 44%

Vulkan image and data processing framework capable of running a cascade of compute shaders and displaying or storing the result.

Updated 9 months ago

server • Science 54%

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Updated 9 months ago

Argos • Science 54%

Reduced-space optimization, for optimal power flow.

Updated 9 months ago

astro-accelerate • Science 67%

AstroAccelerate is a many-core accelerated software package for processing time-domain radio-astronomy data.