Scientific Software
Updated 7 months ago

gym-electric-motor (GEM) — Peer-reviewed • Rank 14.2 • Science 98%

gym-electric-motor (GEM): A Python toolbox for the simulation of electric drive systems - Published in JOSS (2021)

Scientific Software
Updated 7 months ago

DeepBench — Peer-reviewed • Rank 8.6 • Science 95%

DeepBench: A simulation package for physical benchmarking data - Published in JOSS (2025)

Mathematics
Scientific Software · Peer-reviewed
Scientific Software
Updated 7 months ago

ctbench - compile-time benchmarking and analysis — Peer-reviewed • Rank 4.9 • Science 95%

ctbench - compile-time benchmarking and analysis - Published in JOSS (2023)

Updated 7 months ago

fastcrypto • Rank 22.3 • Science 67%

Common cryptographic library used in software at Mysten Labs.

Updated 7 months ago

yaib • Rank 10.1 • Science 77%

🧪Yet Another ICU Benchmark: a holistic framework for the standardization of clinical prediction model experiments. Provide custom datasets, cohorts, prediction tasks, endpoints, preprocessing, and models. Paper: https://arxiv.org/abs/2306.05109

Updated 7 months ago

asreview-insights • Rank 12.3 • Science 67%

Tools such as plots and metrics to analyze (simulated) reviews for ASReview LAB

Updated 7 months ago

tiny_qa_benchmark_pp • Rank 2.1 • Science 77%

Tiny QA Benchmark++ a micro-benchmark suite (52-item gold + on-demand multilingual synthetic packs), generator CLI, and CI-ready eval harness for ultra-fast LLM smoke-testing & regression-catching.

Updated 7 months ago

proteinworkshop • Rank 11.6 • Science 67%

Benchmarking framework for protein representation learning. Includes a large number of pre-training and downstream task datasets, models and training/task utilities. (ICLR 2024)

Updated 7 months ago

SciMLBenchmarks • Rank 11.5 • Science 67%

Scientific machine learning (SciML) benchmarks, AI for science, and (differential) equation solvers. Covers Julia, Python (PyTorch, Jax), MATLAB, R

Updated 7 months ago

seb • Rank 11.3 • Science 67%

A Scandinavian Benchmark for sentence embeddings

Updated 7 months ago

babelstream • Rank 10.7 • Science 67%

STREAM, for lots of devices written in many programming models

Updated 7 months ago

benchexec • Rank 18.5 • Science 59%

BenchExec: A Framework for Reliable Benchmarking and Resource Measurement

Updated 7 months ago

amlb • Rank 13.2 • Science 64%

OpenML AutoML Benchmarking Framework

Updated 7 months ago

BenchmarkPlots • Rank 20.1 • Science 54%

A benchmarking framework for the Julia language

Updated 7 months ago

compression_benchmark • Rank 4.8 • Science 67%

Benchmarking FASTQ compression with 'mature' compression algorithms

Updated 7 months ago

benchmarks-acoustic-propagation • Rank 3.0 • Science 67%

Coupled model development for acoustic propagation through multilayer systems for particle-velocity sensors

Updated 7 months ago

pytorch-benchmark • Rank 11.9 • Science 57%

Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption

Updated 7 months ago

hyperfine • Rank 14.9 • Science 54%

A command-line benchmarking tool

Updated 7 months ago

logpai • Rank 18.1 • Science 49%

A machine learning toolkit for log parsing [ICSE'19, DSN'16]

Updated 7 months ago

py-torchbenchmark • Rank 12.5 • Science 54%

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

Updated 7 months ago

benchmarl • Rank 8.2 • Science 54%

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically grounded in its two core tenets: reproducibility and standardization.

Updated 7 months ago

h5bench • Rank 7.9 • Science 54%

A benchmark suite for measuring HDF5 performance.

Updated 7 months ago

qcd • Rank 4.0 • Science 54%

Quantum Circuit Designer: A gymnasium-based set of environments for benchmarking reinforcement learning for quantum circuit design.

Updated 7 months ago

muld • Rank 3.8 • Science 54%

The Multitask Long Document Benchmark

Updated 7 months ago

aptv2 • Rank 3.3 • Science 54%

The official repo for the extension of [NeurIPS'22] "APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking": https://github.com/pandorgan/APT-36K

Updated 7 months ago

jreferral • Rank 3.2 • Science 54%

An open-source tool that recommends the most energy efficient JVM configuration for java software

Updated 7 months ago

birdset • Rank 7.0 • Science 46%

A benchmark dataset collection for bird sound classification

Updated 7 months ago

are-we-fast-yet • Rank 8.2 • Science 44%

Are We Fast Yet? Comparing Language Implementations with Objects, Closures, and Arrays

Updated 7 months ago

opencl-benchmark • Rank 5.5 • Science 44%

A small OpenCL benchmark program to measure peak GPU/CPU performance.

Updated 7 months ago

https://github.com/beuth-erdelt/benchmark-experiment-host-manager • Rank 9.9 • Science 39%

This python tool helps managing DBMS benchmarking experiments in a Kubernetes-based HPC cluster environment. It enables users to configure hardware / software setups for easily repeating tests over varying configurations.

Updated 7 months ago

polybench • Rank 7.5 • Science 36%

Multivariate polynomial arithmetic benchmark tests.

Updated 7 months ago

leakdb • Rank 5.2 • Science 36%

LeakDB (Leakage Diagnosis Benchmark) is a realistic leakage dataset for water distribution networks. The dataset is comprised of a large number of artificially created but realistic leakage scenarios, on different water distribution networks, under varying conditions. A scoring algorithm in MATLAB code is provided to evaluate the results of different algorithms.

Updated 7 months ago

hdnom • Rank 14.5 • Science 26%

🔮 Benchmarking and visualization toolkit for penalized Cox models

Updated 7 months ago

https://github.com/brucewlee/h-test • Rank 0.7 • Science 33%

[ACL 2024] Language Models Don't Learn the Physical Manifestation of Language

Updated 7 months ago

https://github.com/cdjellen/otbench • Rank 5.0 • Science 26%

Effective Benchmarks for Optical Turbulence Modeling

Updated 7 months ago

sceneflow_from_blender • Rank 2.8 • Science 28%

Get 3D motion vectors / scene flow directly from Blender

Updated 7 months ago

spring_utils • Rank 1.4 • Science 28%

Additional utility code for the Spring dataset and benchmark

Updated 7 months ago

https://github.com/aim-uofa/geobench • Rank 4.8 • Science 23%

A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.

Updated 7 months ago

https://github.com/citiususc/blinkg • Rank 0.7 • Science 26%

BLINKG: Benchmark for LLM-Integrated Knowledge Graph Generation

Updated 7 months ago

https://github.com/ai-forever/ruscode • Rank 2.1 • Science 23%

Official repository for RusCode benchmark dataset (NAACL 2025)

Updated 7 months ago

https://github.com/jurgisp/memory-maze • Rank 12.3 • Science 10%

Evaluating long-term memory of reinforcement learning algorithms

Updated 7 months ago

https://github.com/crate/tsperf • Rank 4.4 • Science 13%

TSPERF Time Series Database Benchmark Suite. Framework for evaluating and comparing the performance of time series databases, in the spirit of TimescaleDB's TSBS.

Updated 7 months ago

https://github.com/yegor256/plum • Rank 3.8 • Science 13%

Programming language ultimate metrics (PLUM) collected automatically from GitHub, Google Scholar, Twitter, etc.

Updated 7 months ago

pglib-opf • Rank 5.9 • Science 10%

Benchmarks for the Optimal Power Flow Problem

Updated 7 months ago

cellbench • Rank 5.6 • Science 10%

R package for benchmarking single cell analysis methods

Updated 7 months ago

https://github.com/aliireza/ddio-bench • Rank 5.2 • Science 10%

Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks

Updated 7 months ago

champkit • Science 41%

Benchmarking toolkit for patch-based histopathology image classification.

Updated 7 months ago

benchmarks • Science 44%

A collection of micro benchmarks.

Updated 7 months ago

agentdojo • Science 36%

A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.

Updated 7 months ago

molscore • Science 49%

An automated scoring function to facilitate and standardize the evaluation of goal-directed generative models for de novo molecular design

Updated 7 months ago

tsb-uad • Science 26%

An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection

Updated 7 months ago

opfgym • Science 57%

A gymnasium-compatible framework to create reinforcement learning (RL) environment for solving the optimal power flow (OPF) problem. Contains five OPF benchmark environments for comparable research.

Updated 7 months ago

dpl • Science 44%

[NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benchmarks.

Updated 7 months ago

benchpark • Science 65%

An open collaborative repository for reproducible specifications of HPC benchmarks and cross site benchmarking environments

Updated 7 months ago

maskedfacerepresentation • Science 39%

Masked face recognition focuses on identifying people using their facial features while they are wearing masks. We introduce benchmarks on face verification based on masked face images for the development of COVID-safe protocols in airports.

Updated 7 months ago

neteasecrowd-dataset • Science 54%

NetEaseCrowd dataset, a collection of data obtained from You Ling crowdsourcing platform, Fuxi AI Lab, NetEase.

Updated 7 months ago

symbolic-governed-mistral-artifact • Science 44%

Tier-10 sealed governance artifact for Mistral-7B with exact-match benchmarks and symbolic verifier.

Updated 7 months ago

ibp-sop-benchmarks-milp-cellphoneco • Science 26%

Benchmark instances modelling the supply chain of a fictive company producing and selling cell phones and accessories of different types. The instances formulate typical mixed-integer linear optimization problems in the standard MPS format.

Updated 7 months ago

ember • Science 67%

Code and data for the paper "Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction" (IJCAI 2022)

Updated 7 months ago

krown • Science 49%

KROWN 👑: A Benchmark for RDF Graph Materialization

Updated 7 months ago

https://github.com/grrvlr/tsmd • Science 26%

The TSMD project brings together Motif Discovery methods for Time Series, aiming to compare their performance through well-defined research questions and to simplify their practical use. It provides both guidelines for selecting the most suitable methods based on the data, and accessible implementations of the most relevant approaches.

Updated 7 months ago

https://github.com/hyperledger-caliper/caliper-benchmarks • Science 13%

Sample benchmark files for Hyperledger Caliper https://wiki.hyperledger.org/display/caliper

Updated 7 months ago

https://github.com/axect/scientific_bench • Science 26%

Benchmark some scientific computations for various languages & libraries

Updated 7 months ago

small-object-detection-benchmark • Science 67%

icip2022 paper: sahi benchmark on visdrone and xview datasets using fcos, vfnet and tood detectors

Updated 7 months ago

fast_frechet • Science 54%

Comparison of different (fast) discrete Fréchet distance implementations in C++ and CUDA.