DSSE
DSSE: An environment for simulation of reinforcement learning-empowered drone swarm maritime search and rescue missions - Published in JOSS (2024)
sb3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
torchrl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
irl-imitation
Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
RL
RL: Generic reinforcement learning codebase in TensorFlow - Published in JOSS (2019)
morl-baselines
Multi-Objective Reinforcement Learning algorithms implementations.
benchmarl
BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically grounded in its two core tenets: reproducibility and standardization.
https://github.com/google/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
barl-simpleoptions
A Python package that provides a simple framework for working with Options in Reinforcement Learning.
https://github.com/aim-uofa/omni-r1
Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
https://github.com/arbit3rr/nn-control
Control Methods for Dynamic Systems based on Neural Networks
https://github.com/aim-uofa/active-o3
ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO
opfgym
A gymnasium-compatible framework to create reinforcement learning (RL) environment for solving the optimal power flow (OPF) problem. Contains five OPF benchmark environments for comparable research.
rl_task_independent_joint_control
Learning Task-independent Joint Control for Robotic Manipulators with Reinforcement Learning and Curriculum Learning
https://github.com/araffin/rl-tutorial-jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
https://github.com/araffin/pytorch_agents
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR). Python2 compatible (branch python2)
https://github.com/bytedance/raylink
Framework to build and train RL algorithms