updated_diffusion_q_learning

https://github.com/rdg0818/updated_diffusion_q_learning

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.9%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: RDG0818
License: apache-2.0
Language: Python
Default Branch: main
Size: 100 MB

Statistics

Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created over 1 year ago · Last pushed 12 months ago

Metadata Files

Readme License Citation

Updated Diffusion Q Learning

This repository provides a modern implementation of the paper Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning. The goal of this project is to provide an easy-to-use and extensible benchmark for further research in offline reinforcment learning with diffusion models.

Core Changes

Natively handles hopper, walker2d, halfcheetah, antmaze, kitchen, and pen environments from the Minari benchmarks.
All hyperparameters and settings are manage with Hydra, allowing for swappable configurations via the command line or config.yaml.
All logging is done through Weights & Biases for automatic visualization, analysis, and hyperparameter tuning.
Codebase is fully type-hinted and documented, and it also includes a evaluation.py script for performance tests on model checkpoints.

Installation

Clone the Repository:

bash git clone https://github.com/RDG0818/Updated_Diffusion_Q_Learning.git cd Updated_Diffusion_Q_Learning

Create Conda Environment and Install Dependencies:

bash conda create -n diff_policy python=3.10 conda activate diff_policy pip install -r requirements.txt pip install -e .

Download Datasets:

Run the setup script to download the necessary Minari datasets. Note that this may take some time and requires at least 5-10 GB of free disk space. bash python dataset_download.py

Usage

Training a Model

To start a training run, execute main.py. You can override any configuration setting from the command line.

bash python main.py Train on different environment:

```bash python main.py env=walker2dmediumexpert

python main.py env=penhuman agent.lr=0.00003 agent.eta=0.15 rewardtune="normalize" agent.gradnorm=1.0 evalepisodes=50 ```

Evaluating a Trained Model

Use the evaluation.py script to load a savedcheckpoint from a completed run and evaluate its performance.

bash python evaluation.py --run_dir results/hopper_medium_expert-0

To change the environment or modify the hyperparameters, look at config/config.yaml.

Results

Below are the training curves from W&B.

Citation

If you use this code in your research, please consider citing it:

bibtex @software{Goodwin_2025_DiffusionQL, author = {[Ryan] [Goodwin]}, title = {{A Refactored Implementation of Diffusion Policies for Offline RL}}, month = {6}, year = {2025}, url = {https://github.com/RDG0818/Updated_Diffusion_Q_Learning} }

Acknowledgements

This project is a refactoring and extension of the original work by Zhendong Wang, et al. from the following paper and codebase: Diffusion Policies as an Expressive Policy Class For Offline Reinforcement Learning.

License

This project is licensed under the Apache 2.0 License.

Owner

Login: RDG0818
Kind: user

Repositories: 1
Profile: https://github.com/RDG0818

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software in your work, please cite it as below."
authors:
  - family-names: "Goodwin"
    given-names: "Ryan"
title: "A Refactored Implementation of Diffusion Policies for Offline RL"
version: 1.0.0
date-released: 2025-06-26
repository-code: "https://github.com/RDG0818/Updated_Diffusion_Q_Learning"

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science