Scientific Software
Updated 7 months ago

gym-electric-motor (GEM) — Peer-reviewed • Rank 14.2 • Science 98%

gym-electric-motor (GEM): A Python toolbox for the simulation of electric drive systems - Published in JOSS (2021)

Scientific Software
Updated 7 months ago

DeepBench — Peer-reviewed • Rank 8.6 • Science 95%

DeepBench: A simulation package for physical benchmarking data - Published in JOSS (2025)

Mathematics
Scientific Software · Peer-reviewed
Scientific Software
Updated 7 months ago

ctbench - compile-time benchmarking and analysis — Peer-reviewed • Rank 4.9 • Science 95%

ctbench - compile-time benchmarking and analysis - Published in JOSS (2023)

Updated 7 months ago

fastcrypto • Rank 22.3 • Science 67%

Common cryptographic library used in software at Mysten Labs.

Updated 7 months ago

yaib • Rank 10.1 • Science 77%

🧪Yet Another ICU Benchmark: a holistic framework for the standardization of clinical prediction model experiments. Provide custom datasets, cohorts, prediction tasks, endpoints, preprocessing, and models. Paper: https://arxiv.org/abs/2306.05109

Updated 7 months ago

asreview-insights • Rank 12.3 • Science 67%

Tools such as plots and metrics to analyze (simulated) reviews for ASReview LAB

Updated 7 months ago

tiny_qa_benchmark_pp • Rank 2.1 • Science 77%

Tiny QA Benchmark++ a micro-benchmark suite (52-item gold + on-demand multilingual synthetic packs), generator CLI, and CI-ready eval harness for ultra-fast LLM smoke-testing & regression-catching.

Updated 7 months ago

proteinworkshop • Rank 11.6 • Science 67%

Benchmarking framework for protein representation learning. Includes a large number of pre-training and downstream task datasets, models and training/task utilities. (ICLR 2024)

Updated 7 months ago

SciMLBenchmarks • Rank 11.5 • Science 67%

Scientific machine learning (SciML) benchmarks, AI for science, and (differential) equation solvers. Covers Julia, Python (PyTorch, Jax), MATLAB, R

Updated 7 months ago

seb • Rank 11.3 • Science 67%

A Scandinavian Benchmark for sentence embeddings

Updated 7 months ago

babelstream • Rank 10.7 • Science 67%

STREAM, for lots of devices written in many programming models

Updated 7 months ago

benchexec • Rank 18.5 • Science 59%

BenchExec: A Framework for Reliable Benchmarking and Resource Measurement

Updated 7 months ago

amlb • Rank 13.2 • Science 64%

OpenML AutoML Benchmarking Framework

Updated 7 months ago

BenchmarkPlots • Rank 20.1 • Science 54%

A benchmarking framework for the Julia language

Updated 7 months ago

compression_benchmark • Rank 4.8 • Science 67%

Benchmarking FASTQ compression with 'mature' compression algorithms

Updated 7 months ago

benchmarks-acoustic-propagation • Rank 3.0 • Science 67%

Coupled model development for acoustic propagation through multilayer systems for particle-velocity sensors

Updated 7 months ago

pytorch-benchmark • Rank 11.9 • Science 57%

Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption

Updated 7 months ago

hyperfine • Rank 14.9 • Science 54%

A command-line benchmarking tool

Updated 7 months ago

logpai • Rank 18.1 • Science 49%

A machine learning toolkit for log parsing [ICSE'19, DSN'16]

Updated 7 months ago

py-torchbenchmark • Rank 12.5 • Science 54%

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

Updated 7 months ago

benchmarl • Rank 8.2 • Science 54%

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically grounded in its two core tenets: reproducibility and standardization.

Updated 7 months ago

h5bench • Rank 7.9 • Science 54%

A benchmark suite for measuring HDF5 performance.

Updated 7 months ago

qcd • Rank 4.0 • Science 54%

Quantum Circuit Designer: A gymnasium-based set of environments for benchmarking reinforcement learning for quantum circuit design.

Updated 7 months ago

muld • Rank 3.8 • Science 54%

The Multitask Long Document Benchmark

Updated 7 months ago

aptv2 • Rank 3.3 • Science 54%

The official repo for the extension of [NeurIPS'22] "APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking": https://github.com/pandorgan/APT-36K

Updated 7 months ago

jreferral • Rank 3.2 • Science 54%

An open-source tool that recommends the most energy efficient JVM configuration for java software

Updated 7 months ago

birdset • Rank 7.0 • Science 46%

A benchmark dataset collection for bird sound classification

Updated 7 months ago

are-we-fast-yet • Rank 8.2 • Science 44%

Are We Fast Yet? Comparing Language Implementations with Objects, Closures, and Arrays

Updated 7 months ago

opencl-benchmark • Rank 5.5 • Science 44%

A small OpenCL benchmark program to measure peak GPU/CPU performance.

Updated 7 months ago

https://github.com/beuth-erdelt/benchmark-experiment-host-manager • Rank 9.9 • Science 39%

This python tool helps managing DBMS benchmarking experiments in a Kubernetes-based HPC cluster environment. It enables users to configure hardware / software setups for easily repeating tests over varying configurations.

Updated 7 months ago

polybench • Rank 7.5 • Science 36%

Multivariate polynomial arithmetic benchmark tests.

Updated 7 months ago

leakdb • Rank 5.2 • Science 36%

LeakDB (Leakage Diagnosis Benchmark) is a realistic leakage dataset for water distribution networks. The dataset is comprised of a large number of artificially created but realistic leakage scenarios, on different water distribution networks, under varying conditions. A scoring algorithm in MATLAB code is provided to evaluate the results of different algorithms.

Updated 7 months ago

hdnom • Rank 14.5 • Science 26%

🔮 Benchmarking and visualization toolkit for penalized Cox models

Updated 7 months ago

https://github.com/brucewlee/h-test • Rank 0.7 • Science 33%

[ACL 2024] Language Models Don't Learn the Physical Manifestation of Language

Updated 7 months ago

https://github.com/cdjellen/otbench • Rank 5.0 • Science 26%

Effective Benchmarks for Optical Turbulence Modeling

Updated 7 months ago

sceneflow_from_blender • Rank 2.8 • Science 28%

Get 3D motion vectors / scene flow directly from Blender

Updated 7 months ago

spring_utils • Rank 1.4 • Science 28%

Additional utility code for the Spring dataset and benchmark

Updated 7 months ago

https://github.com/aim-uofa/geobench • Rank 4.8 • Science 23%

A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.

Updated 7 months ago

https://github.com/citiususc/blinkg • Rank 0.7 • Science 26%

BLINKG: Benchmark for LLM-Integrated Knowledge Graph Generation

Updated 7 months ago

https://github.com/ai-forever/ruscode • Rank 2.1 • Science 23%

Official repository for RusCode benchmark dataset (NAACL 2025)

Updated 7 months ago

https://github.com/jurgisp/memory-maze • Rank 12.3 • Science 10%

Evaluating long-term memory of reinforcement learning algorithms

Updated 7 months ago

https://github.com/crate/tsperf • Rank 4.4 • Science 13%

TSPERF Time Series Database Benchmark Suite. Framework for evaluating and comparing the performance of time series databases, in the spirit of TimescaleDB's TSBS.

Updated 7 months ago

https://github.com/yegor256/plum • Rank 3.8 • Science 13%

Programming language ultimate metrics (PLUM) collected automatically from GitHub, Google Scholar, Twitter, etc.

Updated 7 months ago

pglib-opf • Rank 5.9 • Science 10%

Benchmarks for the Optimal Power Flow Problem

Updated 7 months ago

cellbench • Rank 5.6 • Science 10%

R package for benchmarking single cell analysis methods

Updated 7 months ago

https://github.com/aliireza/ddio-bench • Rank 5.2 • Science 10%

Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks

Updated 7 months ago

imdd-task • Science 57%

Short-reach Optical Communication: A Real-world Task for Neuromorphic Hardware

Updated 7 months ago

molscore • Science 49%

An automated scoring function to facilitate and standardize the evaluation of goal-directed generative models for de novo molecular design

Updated 7 months ago

cbombench • Science 67%

A testbed for benchmarking Cryptographic Bills of Materials (CBOMs).

Updated 7 months ago

benchscofi • Science 67%

Package which contains implementations of published collaborative filtering-based algorithms for drug repurposing.

Updated 7 months ago

supergleber • Science 39%

German Language Understanding Evaluation Benchmark @NAACL24

Updated 7 months ago

https://github.com/becksteinlab/parallel-analysis-in-the-mdanalysis-library • Science 26%

Benchmarking MDAnalysis with Dask (and MPI). Supplementary Information for SciPy 2017 paper.

Updated 7 months ago

variantbenchmarking • Science 57%

Pipeline to evaluate and validate the accuracy of variant calling methods in genomic research

Updated 7 months ago

evalplus • Science 54%

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Updated 7 months ago

small-object-detection-benchmark • Science 67%

icip2022 paper: sahi benchmark on visdrone and xview datasets using fcos, vfnet and tood detectors

Updated 7 months ago

gocannon • Science 44%

:boom: Performance-focused HTTP load testing tool written in Go

Updated 7 months ago

https://github.com/compnet/signedbenchmark • Science 13%

Benchmark to study partitioning problems on signed graphs

Updated 7 months ago

https://github.com/birgitrijvers/copd-microbiome-shift-analysis • Science 13%

This is the repository for a bioinformatics project about identifying the optimal pipeline for identifying a potential microbiome shift in COPD patients after ceasing azithromycin treatment.

Updated 7 months ago

benchmark-privesc-linux • Science 54%

A comprehensive local Linux Privilege-Escalation Benchmark

Updated 7 months ago

https://github.com/ccao-data/report-model-benchmark • Science 13%

Benchmark of timing for CCAO models on different hardware

Updated 7 months ago

foundation-model-benchmarking-tool • Science 54%

Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.

Updated 7 months ago

https://github.com/cosmaadrian/rocode • Science 10%

Official repository for "RoCode: A Dataset for Measuring Code Intelligence from Romanian Problem Definitions"

Updated 7 months ago

https://github.com/alan-turing-institute/tcpdbench • Science 23%

The Turing Change Point Detection Benchmark: An Extensive Benchmark Evaluation of Change Point Detection Algorithms on real-world data

Updated 7 months ago

leaderboard • Science 44%

You can find the most recent KGQA benchmark numbers from publications here.

Updated 7 months ago

https://github.com/hyperledger-caliper/caliper-benchmarks • Science 13%

Sample benchmark files for Hyperledger Caliper https://wiki.hyperledger.org/display/caliper

Updated 7 months ago

https://github.com/grrvlr/tsmd • Science 26%

The TSMD project brings together Motif Discovery methods for Time Series, aiming to compare their performance through well-defined research questions and to simplify their practical use. It provides both guidelines for selecting the most suitable methods based on the data, and accessible implementations of the most relevant approaches.