pbrl

Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation

https://github.com/asad-shahid/pbrl

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 6 DOI reference(s) in README
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (14.3%) to scientific vocabulary

Last synced: 11 months ago · JSON representation ·

Repository

Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation

Basic Info

Host: GitHub
Owner: Asad-Shahid
Language: Python
Default Branch: main
Homepage:
Size: 64.5 MB

Statistics

Stars: 8
Watchers: 1
Forks: 2
Open Issues: 0
Releases: 0

Created over 2 years ago · Last pushed over 1 year ago

Metadata Files

Readme Citation

PBRL - Population-Based Reinforcement Learning

Website | Paper | Videos

About this repository

This repository contains the code to train RL agents in Population-Based manner.

Installation

Create a new conda environment with:

sh conda create -n pbrl python=3.8 conda activate pbrl
Install IsaacGym (tested with Preview 4 Release). Follow the instructions to download the package.

```sh tar -xvf IsaacGymPreview4_Package.tar.gz cd isaacgym/python pip install -e .

Test IsaacGym installation

cd examples python joint_monkey.py ```
Install this repo:

sh git clone https://github.com/Asad-Shahid/PBRL.git cd PBRL pip install -e .

Getting Started

Navigate to the isaacgymenvs directory and run:

python python -m pbrl.main --task ShadowHand --num_envs 4096 --num_agents 4

Some key arguments are:

--task selects a task from isaacgymenvs. All tasks released in isaacgymenvs==1.5.1 are supported.
--num_envs selects the number of environments to run.
--num_agents chooses the number of agents to train in parallel for pbrl. Note: num_envs must divide num_agents.
--pbrl whether to use PBRL.\ Note: when True, num_agents must be a multiple of 4 (Top 25% of the agents are selected to replace/get replaced).
--algo which RL algorithm to use for training. Options are: ppo, sac, ddpg.
--pbrl_params name of .json file with hyperparameters to tune in cfg/pbrl. Only hyperparameters listed in corresponding .json files are currently supported. Initial values are sampled uniformly from the specified range. When training a single RL agent, specify inital values in the file.
--mut_scheme which mutation scheme to use for mutating hyperparameters.

All other arguments can be found in cfg/pbrl/_init__.py

Citing

Please cite this work as:

bibtex @article{Shahid2025pbrl, author = {Asad Ali Shahid and Yashraj Narang and Vincenzo Petrone and Enrico Ferrentino and Ankur Handa and Dieter Fox and Marco Pavone and Loris Roveda}, title = {Benchmarking Population-Based Reinforcement Learning across Robotics Tasks with GPU Accelerated Simulation}, journal = {arXiv preprint}, year = {2025}, doi = {10.48550/arXiv.2404.03336}, url = {https://doi.org/10.48550/arXiv.2404.03336}, }

If you reuse our code, you can site this repo as:

bibtex @software{Shahid2024pbrlrepo, author = {Shahid, Asad Ali and Narang, Yashraj and Petrone, Vincenzo and Ferrentino, Enrico and Handa, Ankur and Fox, Dieter and Pavone, Marco and Roveda, Loris}, doi = {10.5281/zenodo.TBD}, month = mar, title = {{Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation}}, url = {https://github.com/Asad-Shahid/PBRL}, version = {1.0.0}, year = {2024} }

Owner

Name: Asad A. Shahid
Login: Asad-Shahid
Kind: user

Repositories: 2
Profile: https://github.com/Asad-Shahid

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Shahid"
  given-names: "Asad Ali"
  orcid: "https://orcid.org/0009-0001-0312-9657"
- family-names: "Narang"
  given-names: "Yashraj"
  orcid: "https://orcid.org/0000-0001-5445-3759"
- family-names: "Petrone"
  given-names: "Vincenzo"
  orcid: "https://orcid.org/0000-0003-4777-1761"
- family-names: "Ferrentino"
  given-names: "Enrico"
  orcid: "https://orcid.org/0000-0003-0768-8541"
- family-names: "Handa"
  given-names: "Ankur"
  orcid: "https://orcid.org/0000-0000-0000-0000"
- family-names: "Fox"
  given-names: "Dieter"
  orcid: "https://orcid.org/0000-0000-0000-0000"
- family-names: "Pavone"
  given-names: "Marco"
  orcid: "https://orcid.org/0000-0002-0206-4337"
- family-names: "Roveda"
  given-names: "Loris"
  orcid: "https://orcid.org/0000-0002-4427-536X"
title: "Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation"
version: 1.0.0
doi: 10.5281/zenodo.1234
date-released: 2024-03-04
url: "https://github.com/Asad-Shahid/PBRL"

GitHub Events

Total

Watch event: 3
Push event: 3

Last Year

Watch event: 3
Push event: 3

Dependencies

setup.py pypi

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science