PyTupli: Enabling Collaboration in Offline Reinforcement Learning

PyTupli: Enabling Collaboration in Offline Reinforcement Learning - Published in JOSS (2026)

https://github.com/tumcps/pytupli

Science Score: 87.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in JOSS metadata
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
✓
JOSS paper metadata
Published in Journal of Open Source Software

Last synced: 4 days ago · JSON representation

Repository

Infrastructure for collaborative offline RL datasets and benchmarks

Basic Info

Host: GitHub
Owner: TUMcps
License: mit
Language: Python
Default Branch: main
Homepage: https://pytupli.readthedocs.io/
Size: 1.42 MB

Statistics

Stars: 2
Watchers: 2
Forks: 1
Open Issues: 1
Releases: 1

Created about 1 year ago · Last pushed about 1 month ago

Metadata Files

Readme Contributing License Code of conduct Security

PyTupli

Latest Release

PyTupli is a Python library for creating, storing, and sharing benchmark problems and datasets for offline reinforcement learning (RL). PyTupli includes a lightweight client library with defined interfaces for uploading and retrieving benchmarks and data. It supports fine-grained filtering at both the episode and tuple level, allowing researchers to curate high-quality, task-specific datasets. A containerized server component enables production-ready deployment with authentication, access control, and automated certificate provisioning for secure use. By addressing key barriers in dataset infrastructure, PyTupli facilitates more collaborative, reproducible, and scalable offline RL research.

By using PyTupli, you can:

✅ Create benchmarks from any Gymnasium-compatible environment
✅ Share environments without exposing sensitive implementation details
✅ Record episode data from interactions with the environment or store static datasets associated to a benchmark
✅ Download datasets and convert them into formats compatible with popular offline RL libraries such as d3rlpy
✅ Store and manage artifacts like trained models or time series data

Installation

You can install PyTupli using pip:

bash pip install pytupli

Or if you're using Poetry:

bash poetry add pytupli

Development Setup

For local development in editable mode, clone the repository, navigate to the package directory and run bash poetry install

Optional Dependencies

PyTupli has several optional dependency groups that can be installed based on your needs:

Machine Learning Components: To install PyTorch for advanced quality metrics: bash poetry install --with ml Note: QFunctionMetric requires PyTorch. GeneralizedBehavioralEntropyMetric only requires PyTorch if you provide an observation_encoder. If you try to use these without installing PyTorch, you'll get a helpful error message.

Server Components: To install dependencies for running the PyTupli server: bash poetry install --with server

Documentation: To build the documentation: bash poetry install --with docs

Testing: To run tests: bash poetry install --with tests

You can combine multiple groups: bash poetry install --with server,docs,tests

Deployment

For deployment instructions, please refer to the deployment documentation.

Access Management

For a detailed guide of PyTupli's access management, please refer to the access management documentation.

CLI Usage

PyTupli provides a command-line interface for the TupliAPIClient. After deployment, first log in to the server and specify the URL:

bash pytupli login --username your_username --password your_password --url http://your-server:port

The URL will then be remembered for all future interactions with the server. As an alternative to handing it over during login, you can call bash pytupli set_url --url http://your-server:port

For user management, you can create new users and change passwords (requires admin privileges): ```bash

Create a new user

pytupli signup --username newuser --password initialpassword

Change a user's password

pytupli changepassword --username targetuser --password new_password ```

Other useful utilities of the CLI are listing benchmarks or artifacts: ```bash

List available benchmarks

pytupli list_benchmarks

List episodes for a benchmark

pytupli list_artifacts ```

You can get detailed help on any command by using the --help flag: ```bash

Show all available commands

pytupli --help

Get help on a specific command

pytupli command_name --help ```

Note: If you have IPython installed, it must be version <8.4 for the help functionality to work properly. This is due to a known issue that has been fixed in the underlying Fire library but is not yet available in the latest release.

Basic Usage Example

PyTupli makes it easy to create and share reinforcement learning benchmarks and associated datasets for offline RL. Here's an example of how a collaborative offline RL project based on PyTupli might look like:

Organization A has developed an environment for their specific use case (e.g., an energy management system). They have some historix data that they want to use to train an offline RL baseline.
They wrap their environment using PyTupli's wrapper classes to standardize the interface
They can then store and publish the benchmark through the PyTupli API.
They upload the historic data as episodes associated to the newly-created benchmark, making it available to other organizations.
Organization B can access the benchmark and download the data.
Before training their algorithms on it, B can filter the dataset, for example, for data created during a specific time period.
Finally, trained agents can be uploaded as artifacts associated to the benchmark

Code example (abbreviated):

```python

Organization A: Instantiate API storage object

tuplistorage = TupliAPIClient() tuplistorage.set_url("https://company-a-server.com/api")

Instantiate gymnasium environment

custom_env = PowerSystemEnv()

Wrap environment

tuplienv = TupliEnvWrapper(env=customenv, storage=tupli_storage)

Store and publish the benchmark

tuplienv.store( name='EMSbenchmark', description="Energy management system control task" ) tupli_env.publish()

Load the historical data

historicepisodes = loadhistoric_data()

Record and publish the episodes

for eps in historicepisodes: epsitem = Episode( benchmarkid=tuplienv.id, metadata=eps.metadata, tuples=eps.tuples ) epsheader = tuplistorage.record(epsitem) tuplistorage.publish(eps_header.id)

Organization B: Instantiate API storage object

tuplistorage = TupliAPIClient() tuplistorage.set_url("https://company-a-server.com/api")

We assume that this is the id of the previously stored benchmark

stored_id = "dl345kn456mlkl230"

Download benchmark

loadedtuplienv = TupliEnvWrapper.load( storage=tuplistorage, benchmarkid=stored_id )

Create dataset containing all episodes recorded during the summer months

months = ["June", "July", "August"] filtersummer = FilterOR(filters=[FilterEQ(key="month", value=m) for m in months]) filterbenchmark = FilterEQ(key='id', value=storedid) datasetsummer = TupliDataset( storage=tuplistorage ).withbenchmarkfilter(filterbenchmark).withepisodefilter(filtersummer) datasetsummer.load()

Convert to d3rlpy dataset

obs, act, rew, term, trunc = datasetsummer.convertto_tensors() ``` For a comprehensive guide covering most of PyTupli's functionality, including recording episodes, managing artifacts, and creating datasets, please refer to the Introduction.ipynb tutorial in the docs/source/tutorials directory.

Documentation

A detailed documention of the client library as well as our tutorials are available on ReadtheDocs.

Reference

PyTupli is maintained by the Cyber-Physical Systems Group at the Chair for Robotics and Embedded Systems at Technical University of Munich.

If you use PyTupli, please include the following reference @article{markgraf2025pytupli, title={PyTupli: A Scalable Infrastructure for Collaborative Offline Reinforcement Learning Projects}, author={Markgraf, Hannah and Eichelbeck, Michael and Cappey, Daria and Demirt{\"u}rk, Selin and Schattschneider, Yara and Althoff, Matthias}, journal={arXiv preprint arXiv:2505.16754}, year={2025} }

Owner

Login: TUMcps
Kind: user

Repositories: 2
Profile: https://github.com/TUMcps

JOSS Publication

PyTupli: Enabling Collaboration in Offline Reinforcement Learning

Published

June 29, 2026

DOI

10.21105/joss.10144

Volume 11, Issue 122, Page 10144

Authors

Hannah Markgraf

Technical University of Munich, Germany

Michael Eichelbeck

Technical University of Munich, Germany

Daria Cappey
Technical University of Munich, Germany

Selin Demirtürk
Technical University of Munich, Germany

Yara Schattschneider
Technical University of Munich, Germany

Matthias Althoff

Technical University of Munich, Germany

Editor

Owen Lockwood

GitHub Events

Total

Member event: 1
Push event: 31
Create event: 11

Last Year

Push event: 23
Create event: 6

Dependencies

deployment/docker_compose/Dockerfile docker

python 3.13-slim build

pyproject.toml pypi

myst-parser ^4.0.1 docs
nbsphinx ^0.9.7 docs
sphinx ^8.1.0 docs
sphinx-autodoc-typehints ^3.0.0 docs
sphinx-autorun ^2.0.0 docs
sphinx-rtd-theme ^3.0.2 docs
fire ^0.7.0
gymnasium ^1.1.1
jsonpickle ^4.0.2
keyring ^25.6.0
pandas ^2.2.3
pydantic ^2.10.6
pyjwt ^2.10.1
python ^3.11
requests ^2.32.3
tabulate ^0.9.0
bcrypt ^4.3.0 server
fastapi ^0.115.11 server
motor ^3.7.0 server
passlib ^1.7.4 server
pymongo ^4.11.3 server
python-dotenv ^1.0.1 server
python-multipart ^0.0.20 server
uvicorn ^0.34.0 server
asgi-lifespan ^2.1.0 tests
httpx ^0.28.1 tests
pytest 8.3.5 tests
pytest-asyncio ^0.26.0 tests
pytest-cov ^6.0.0 tests
pytest-forked ^1.6.0 tests

PyTupli: Enabling Collaboration in Offline Reinforcement Learning

Science Score: 87.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

PyTupli

Installation

Development Setup

Optional Dependencies

Deployment

Access Management

CLI Usage

Create a new user

Change a user's password

List available benchmarks

List episodes for a benchmark

Show all available commands

Get help on a specific command

Basic Usage Example

Organization A: Instantiate API storage object

Instantiate gymnasium environment

Wrap environment

Store and publish the benchmark

Load the historical data

Record and publish the episodes

Organization B: Instantiate API storage object

We assume that this is the id of the previously stored benchmark

Download benchmark

Create dataset containing all episodes recorded during the summer months

Convert to d3rlpy dataset

Documentation

Reference

Owner

JOSS Publication

PyTupli: Enabling Collaboration in Offline Reinforcement Learning

Authors

Editor

Tags

GitHub Events

Total

Last Year

Dependencies