Scientific Software
Updated 7 months ago

gym-electric-motor (GEM) — Peer-reviewed • Rank 14.2 • Science 98%

gym-electric-motor (GEM): A Python toolbox for the simulation of electric drive systems - Published in JOSS (2021)

Scientific Software
Updated 7 months ago

DeepBench — Peer-reviewed • Rank 8.6 • Science 95%

DeepBench: A simulation package for physical benchmarking data - Published in JOSS (2025)

Mathematics
Scientific Software · Peer-reviewed
Scientific Software
Updated 7 months ago

ctbench - compile-time benchmarking and analysis — Peer-reviewed • Rank 4.9 • Science 95%

ctbench - compile-time benchmarking and analysis - Published in JOSS (2023)

Updated 7 months ago

fastcrypto • Rank 22.3 • Science 67%

Common cryptographic library used in software at Mysten Labs.

Updated 7 months ago

yaib • Rank 10.1 • Science 77%

🧪Yet Another ICU Benchmark: a holistic framework for the standardization of clinical prediction model experiments. Provide custom datasets, cohorts, prediction tasks, endpoints, preprocessing, and models. Paper: https://arxiv.org/abs/2306.05109

Updated 7 months ago

asreview-insights • Rank 12.3 • Science 67%

Tools such as plots and metrics to analyze (simulated) reviews for ASReview LAB

Updated 7 months ago

tiny_qa_benchmark_pp • Rank 2.1 • Science 77%

Tiny QA Benchmark++ a micro-benchmark suite (52-item gold + on-demand multilingual synthetic packs), generator CLI, and CI-ready eval harness for ultra-fast LLM smoke-testing & regression-catching.

Updated 7 months ago

proteinworkshop • Rank 11.6 • Science 67%

Benchmarking framework for protein representation learning. Includes a large number of pre-training and downstream task datasets, models and training/task utilities. (ICLR 2024)

Updated 7 months ago

SciMLBenchmarks • Rank 11.5 • Science 67%

Scientific machine learning (SciML) benchmarks, AI for science, and (differential) equation solvers. Covers Julia, Python (PyTorch, Jax), MATLAB, R

Updated 7 months ago

seb • Rank 11.3 • Science 67%

A Scandinavian Benchmark for sentence embeddings

Updated 7 months ago

babelstream • Rank 10.7 • Science 67%

STREAM, for lots of devices written in many programming models

Updated 7 months ago

benchexec • Rank 18.5 • Science 59%

BenchExec: A Framework for Reliable Benchmarking and Resource Measurement

Updated 7 months ago

amlb • Rank 13.2 • Science 64%

OpenML AutoML Benchmarking Framework

Updated 7 months ago

BenchmarkPlots • Rank 20.1 • Science 54%

A benchmarking framework for the Julia language

Updated 7 months ago

compression_benchmark • Rank 4.8 • Science 67%

Benchmarking FASTQ compression with 'mature' compression algorithms

Updated 7 months ago

benchmarks-acoustic-propagation • Rank 3.0 • Science 67%

Coupled model development for acoustic propagation through multilayer systems for particle-velocity sensors

Updated 7 months ago

pytorch-benchmark • Rank 11.9 • Science 57%

Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption

Updated 7 months ago

hyperfine • Rank 14.9 • Science 54%

A command-line benchmarking tool

Updated 7 months ago

logpai • Rank 18.1 • Science 49%

A machine learning toolkit for log parsing [ICSE'19, DSN'16]

Updated 7 months ago

py-torchbenchmark • Rank 12.5 • Science 54%

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

Updated 7 months ago

benchmarl • Rank 8.2 • Science 54%

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically grounded in its two core tenets: reproducibility and standardization.

Updated 7 months ago

h5bench • Rank 7.9 • Science 54%

A benchmark suite for measuring HDF5 performance.

Updated 7 months ago

qcd • Rank 4.0 • Science 54%

Quantum Circuit Designer: A gymnasium-based set of environments for benchmarking reinforcement learning for quantum circuit design.

Updated 7 months ago

muld • Rank 3.8 • Science 54%

The Multitask Long Document Benchmark

Updated 7 months ago

aptv2 • Rank 3.3 • Science 54%

The official repo for the extension of [NeurIPS'22] "APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking": https://github.com/pandorgan/APT-36K

Updated 7 months ago

jreferral • Rank 3.2 • Science 54%

An open-source tool that recommends the most energy efficient JVM configuration for java software

Updated 7 months ago

birdset • Rank 7.0 • Science 46%

A benchmark dataset collection for bird sound classification

Updated 7 months ago

are-we-fast-yet • Rank 8.2 • Science 44%

Are We Fast Yet? Comparing Language Implementations with Objects, Closures, and Arrays

Updated 7 months ago

opencl-benchmark • Rank 5.5 • Science 44%

A small OpenCL benchmark program to measure peak GPU/CPU performance.

Updated 7 months ago

https://github.com/beuth-erdelt/benchmark-experiment-host-manager • Rank 9.9 • Science 39%

This python tool helps managing DBMS benchmarking experiments in a Kubernetes-based HPC cluster environment. It enables users to configure hardware / software setups for easily repeating tests over varying configurations.

Updated 7 months ago

polybench • Rank 7.5 • Science 36%

Multivariate polynomial arithmetic benchmark tests.

Updated 7 months ago

leakdb • Rank 5.2 • Science 36%

LeakDB (Leakage Diagnosis Benchmark) is a realistic leakage dataset for water distribution networks. The dataset is comprised of a large number of artificially created but realistic leakage scenarios, on different water distribution networks, under varying conditions. A scoring algorithm in MATLAB code is provided to evaluate the results of different algorithms.

Updated 7 months ago

hdnom • Rank 14.5 • Science 26%

🔮 Benchmarking and visualization toolkit for penalized Cox models

Updated 7 months ago

https://github.com/brucewlee/h-test • Rank 0.7 • Science 33%

[ACL 2024] Language Models Don't Learn the Physical Manifestation of Language

Updated 7 months ago

https://github.com/cdjellen/otbench • Rank 5.0 • Science 26%

Effective Benchmarks for Optical Turbulence Modeling

Updated 7 months ago

sceneflow_from_blender • Rank 2.8 • Science 28%

Get 3D motion vectors / scene flow directly from Blender

Updated 7 months ago

spring_utils • Rank 1.4 • Science 28%

Additional utility code for the Spring dataset and benchmark

Updated 7 months ago

https://github.com/aim-uofa/geobench • Rank 4.8 • Science 23%

A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.

Updated 7 months ago

https://github.com/citiususc/blinkg • Rank 0.7 • Science 26%

BLINKG: Benchmark for LLM-Integrated Knowledge Graph Generation

Updated 7 months ago

https://github.com/ai-forever/ruscode • Rank 2.1 • Science 23%

Official repository for RusCode benchmark dataset (NAACL 2025)

Updated 7 months ago

https://github.com/jurgisp/memory-maze • Rank 12.3 • Science 10%

Evaluating long-term memory of reinforcement learning algorithms

Updated 7 months ago

https://github.com/crate/tsperf • Rank 4.4 • Science 13%

TSPERF Time Series Database Benchmark Suite. Framework for evaluating and comparing the performance of time series databases, in the spirit of TimescaleDB's TSBS.

Updated 7 months ago

https://github.com/yegor256/plum • Rank 3.8 • Science 13%

Programming language ultimate metrics (PLUM) collected automatically from GitHub, Google Scholar, Twitter, etc.

Updated 7 months ago

pglib-opf • Rank 5.9 • Science 10%

Benchmarks for the Optimal Power Flow Problem

Updated 7 months ago

cellbench • Rank 5.6 • Science 10%

R package for benchmarking single cell analysis methods

Updated 7 months ago

https://github.com/aliireza/ddio-bench • Rank 5.2 • Science 10%

Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks

Updated 7 months ago

https://github.com/hyperledger-caliper/caliper-benchmarks • Science 13%

Sample benchmark files for Hyperledger Caliper https://wiki.hyperledger.org/display/caliper

Updated 7 months ago

hyperspectral-soilmoisture-dataset • Science 77%

Hyperspectral and soil-moisture data from a field campaign based on a soil sample. Karlsruhe (Germany), 2017.

Updated 7 months ago

fast_frechet-python • Science 44%

Comparison of different (fast) discrete Fréchet distance implementations in Python.

Updated 7 months ago

https://github.com/axect/scientific_bench • Science 26%

Benchmark some scientific computations for various languages & libraries

Updated 7 months ago

benchmark-privesc-linux • Science 54%

A comprehensive local Linux Privilege-Escalation Benchmark

Updated 7 months ago

variantbenchmarking • Science 57%

Pipeline to evaluate and validate the accuracy of variant calling methods in genomic research

Updated 7 months ago

leaderboard • Science 44%

You can find the most recent KGQA benchmark numbers from publications here.

Updated 7 months ago

maskedfacerepresentation • Science 39%

Masked face recognition focuses on identifying people using their facial features while they are wearing masks. We introduce benchmarks on face verification based on masked face images for the development of COVID-safe protocols in airports.

Updated 7 months ago

dpl • Science 44%

[NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benchmarks.

Updated 7 months ago

benchmarks • Science 44%

A collection of micro benchmarks.

Updated 7 months ago

https://github.com/bethgelab/model-vs-human • Science 36%

Benchmark your model on out-of-distribution datasets with carefully collected human comparison data (NeurIPS 2021 Oral)

Updated 7 months ago

hyphi-gym • Science 54%

A Gymnasium benchmark suite for evaluating the robustness and multi-task performance of reinforcement learning algorithms in various discrete and continuous environments.

Updated 7 months ago

posebench • Science 49%

Comprehensive benchmarking of protein-ligand structure prediction methods. (ICML 2024 AI4Science)

Updated 7 months ago

neteasecrowd-dataset • Science 54%

NetEaseCrowd dataset, a collection of data obtained from You Ling crowdsourcing platform, Fuxi AI Lab, NetEase.

Updated 7 months ago

https://github.com/alan-turing-institute/tcpdbench • Science 23%

The Turing Change Point Detection Benchmark: An Extensive Benchmark Evaluation of Change Point Detection Algorithms on real-world data

Updated 7 months ago

fingernat-ml • Science 49%

Data accompanying the manuscript on SIFts- and ML-based methods in Virtual Screening for RNA binding ligands.

Updated 7 months ago

stochastic-benchmark • Science 36%

Repository for Stochastic Optimization Solvers Benchmark code

Updated 7 months ago

small-object-detection-benchmark • Science 67%

icip2022 paper: sahi benchmark on visdrone and xview datasets using fcos, vfnet and tood detectors