Scientific Software
Updated 9 months ago

gym-electric-motor (GEM) — Peer-reviewed • Rank 14.2 • Science 98%

gym-electric-motor (GEM): A Python toolbox for the simulation of electric drive systems - Published in JOSS (2021)

Scientific Software
Updated 9 months ago

DeepBench — Peer-reviewed • Rank 8.6 • Science 95%

DeepBench: A simulation package for physical benchmarking data - Published in JOSS (2025)

Mathematics
Scientific Software · Peer-reviewed
Scientific Software
Updated 9 months ago

ctbench - compile-time benchmarking and analysis — Peer-reviewed • Rank 4.9 • Science 95%

ctbench - compile-time benchmarking and analysis - Published in JOSS (2023)

Updated 9 months ago

fastcrypto • Rank 22.3 • Science 67%

Common cryptographic library used in software at Mysten Labs.

Updated 9 months ago

yaib • Rank 10.1 • Science 77%

🧪Yet Another ICU Benchmark: a holistic framework for the standardization of clinical prediction model experiments. Provide custom datasets, cohorts, prediction tasks, endpoints, preprocessing, and models. Paper: https://arxiv.org/abs/2306.05109

Updated 9 months ago

asreview-insights • Rank 12.3 • Science 67%

Tools such as plots and metrics to analyze (simulated) reviews for ASReview LAB

Updated 9 months ago

tiny_qa_benchmark_pp • Rank 2.1 • Science 77%

Tiny QA Benchmark++ a micro-benchmark suite (52-item gold + on-demand multilingual synthetic packs), generator CLI, and CI-ready eval harness for ultra-fast LLM smoke-testing & regression-catching.

Updated 9 months ago

proteinworkshop • Rank 11.6 • Science 67%

Benchmarking framework for protein representation learning. Includes a large number of pre-training and downstream task datasets, models and training/task utilities. (ICLR 2024)

Updated 9 months ago

SciMLBenchmarks • Rank 11.5 • Science 67%

Scientific machine learning (SciML) benchmarks, AI for science, and (differential) equation solvers. Covers Julia, Python (PyTorch, Jax), MATLAB, R

Updated 9 months ago

seb • Rank 11.3 • Science 67%

A Scandinavian Benchmark for sentence embeddings

Updated 9 months ago

babelstream • Rank 10.7 • Science 67%

STREAM, for lots of devices written in many programming models

Updated 9 months ago

benchexec • Rank 18.5 • Science 59%

BenchExec: A Framework for Reliable Benchmarking and Resource Measurement

Updated 9 months ago

amlb • Rank 13.2 • Science 64%

OpenML AutoML Benchmarking Framework

Updated 9 months ago

BenchmarkPlots • Rank 20.1 • Science 54%

A benchmarking framework for the Julia language

Updated 9 months ago

compression_benchmark • Rank 4.8 • Science 67%

Benchmarking FASTQ compression with 'mature' compression algorithms

Updated 9 months ago

benchmarks-acoustic-propagation • Rank 3.0 • Science 67%

Coupled model development for acoustic propagation through multilayer systems for particle-velocity sensors

Updated 9 months ago

pytorch-benchmark • Rank 11.9 • Science 57%

Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption

Updated 9 months ago

hyperfine • Rank 14.9 • Science 54%

A command-line benchmarking tool

Updated 9 months ago

logpai • Rank 18.1 • Science 49%

A machine learning toolkit for log parsing [ICSE'19, DSN'16]

Updated 9 months ago

py-torchbenchmark • Rank 12.5 • Science 54%

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

Updated 9 months ago

benchmarl • Rank 8.2 • Science 54%

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically grounded in its two core tenets: reproducibility and standardization.

Updated 9 months ago

h5bench • Rank 7.9 • Science 54%

A benchmark suite for measuring HDF5 performance.

Updated 9 months ago

qcd • Rank 4.0 • Science 54%

Quantum Circuit Designer: A gymnasium-based set of environments for benchmarking reinforcement learning for quantum circuit design.

Updated 9 months ago

muld • Rank 3.8 • Science 54%

The Multitask Long Document Benchmark

Updated 9 months ago

aptv2 • Rank 3.3 • Science 54%

The official repo for the extension of [NeurIPS'22] "APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking": https://github.com/pandorgan/APT-36K

Updated 9 months ago

jreferral • Rank 3.2 • Science 54%

An open-source tool that recommends the most energy efficient JVM configuration for java software

Updated 9 months ago

birdset • Rank 7.0 • Science 46%

A benchmark dataset collection for bird sound classification

Updated 9 months ago

are-we-fast-yet • Rank 8.2 • Science 44%

Are We Fast Yet? Comparing Language Implementations with Objects, Closures, and Arrays

Updated 9 months ago

opencl-benchmark • Rank 5.5 • Science 44%

A small OpenCL benchmark program to measure peak GPU/CPU performance.

Updated 9 months ago

https://github.com/beuth-erdelt/benchmark-experiment-host-manager • Rank 9.9 • Science 39%

This python tool helps managing DBMS benchmarking experiments in a Kubernetes-based HPC cluster environment. It enables users to configure hardware / software setups for easily repeating tests over varying configurations.

Updated 9 months ago

polybench • Rank 7.5 • Science 36%

Multivariate polynomial arithmetic benchmark tests.

Updated 9 months ago

leakdb • Rank 5.2 • Science 36%

LeakDB (Leakage Diagnosis Benchmark) is a realistic leakage dataset for water distribution networks. The dataset is comprised of a large number of artificially created but realistic leakage scenarios, on different water distribution networks, under varying conditions. A scoring algorithm in MATLAB code is provided to evaluate the results of different algorithms.

Updated 9 months ago

hdnom • Rank 14.5 • Science 26%

🔮 Benchmarking and visualization toolkit for penalized Cox models

Updated 9 months ago

https://github.com/brucewlee/h-test • Rank 0.7 • Science 33%

[ACL 2024] Language Models Don't Learn the Physical Manifestation of Language

Updated 9 months ago

https://github.com/cdjellen/otbench • Rank 5.0 • Science 26%

Effective Benchmarks for Optical Turbulence Modeling

Updated 9 months ago

sceneflow_from_blender • Rank 2.8 • Science 28%

Get 3D motion vectors / scene flow directly from Blender

Updated 9 months ago

spring_utils • Rank 1.4 • Science 28%

Additional utility code for the Spring dataset and benchmark

Updated 9 months ago

https://github.com/aim-uofa/geobench • Rank 4.8 • Science 23%

A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.

Updated 9 months ago

https://github.com/citiususc/blinkg • Rank 0.7 • Science 26%

BLINKG: Benchmark for LLM-Integrated Knowledge Graph Generation

Updated 9 months ago

https://github.com/ai-forever/ruscode • Rank 2.1 • Science 23%

Official repository for RusCode benchmark dataset (NAACL 2025)

Updated 9 months ago

https://github.com/jurgisp/memory-maze • Rank 12.3 • Science 10%

Evaluating long-term memory of reinforcement learning algorithms

Updated 9 months ago

https://github.com/crate/tsperf • Rank 4.4 • Science 13%

TSPERF Time Series Database Benchmark Suite. Framework for evaluating and comparing the performance of time series databases, in the spirit of TimescaleDB's TSBS.

Updated 9 months ago

https://github.com/yegor256/plum • Rank 3.8 • Science 13%

Programming language ultimate metrics (PLUM) collected automatically from GitHub, Google Scholar, Twitter, etc.

Updated 9 months ago

pglib-opf • Rank 5.9 • Science 10%

Benchmarks for the Optimal Power Flow Problem

Updated 9 months ago

cellbench • Rank 5.6 • Science 10%

R package for benchmarking single cell analysis methods

Updated 9 months ago

https://github.com/aliireza/ddio-bench • Rank 5.2 • Science 10%

Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks

Updated 9 months ago

foundation-model-benchmarking-tool • Science 54%

Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.

Updated 9 months ago

https://github.com/aksw/frankgraphbench • Science 49%

The FranKGraphBench is a Framework to allow KG Aware RSs to be benchmarked in a reproducible and easy to implement manner. It was first created on Google Summer of Code 2023 for Data Integration between DBpedia and some standard RS datasets in a reproducible framework.

Updated 9 months ago

robustbench • Science 54%

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

Updated 9 months ago

https://github.com/alan-turing-institute/tcpdbench • Science 23%

The Turing Change Point Detection Benchmark: An Extensive Benchmark Evaluation of Change Point Detection Algorithms on real-world data

Updated 9 months ago

https://github.com/cosmaadrian/rocode • Science 10%

Official repository for "RoCode: A Dataset for Measuring Code Intelligence from Romanian Problem Definitions"

Updated 9 months ago

neteasecrowd-dataset • Science 54%

NetEaseCrowd dataset, a collection of data obtained from You Ling crowdsourcing platform, Fuxi AI Lab, NetEase.

Updated 9 months ago

https://github.com/birgitrijvers/copd-microbiome-shift-analysis • Science 13%

This is the repository for a bioinformatics project about identifying the optimal pipeline for identifying a potential microbiome shift in COPD patients after ceasing azithromycin treatment.

Updated 9 months ago

https://github.com/hyperledger-caliper/caliper-benchmarks • Science 13%

Sample benchmark files for Hyperledger Caliper https://wiki.hyperledger.org/display/caliper

Updated 9 months ago

https://github.com/grrvlr/tsmd • Science 26%

The TSMD project brings together Motif Discovery methods for Time Series, aiming to compare their performance through well-defined research questions and to simplify their practical use. It provides both guidelines for selecting the most suitable methods based on the data, and accessible implementations of the most relevant approaches.

Updated 9 months ago

symbolic-governed-mistral-artifact • Science 44%

Tier-10 sealed governance artifact for Mistral-7B with exact-match benchmarks and symbolic verifier.

Updated 9 months ago

dpl • Science 44%

[NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benchmarks.

Updated 9 months ago

agentdojo • Science 36%

A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.

Updated 9 months ago

benchmarks • Science 44%

A collection of micro benchmarks.

Updated 9 months ago

optalcp-benchmarks • Science 26%

Benchmarks, demos and utilities for OptalCP solver

Updated 9 months ago

https://github.com/anibulus/benchmark.net • Science 26%

A console application to benchmark different ways to order. Just an experiment

Updated 9 months ago

web-framework-benchmark-thesis • Science 31%

Benchmark comparison of Leptos to Leading JavaScript Web Frameworks

Updated 9 months ago

disa-windows-server-2016 • Science 52%

This repository is part of the paper Automated Implementation of Windows-related Security-Configuration Guides presented at the 35th IEEE/ACM International Conference on Automated Software Engineering.