gym-pybullet-drones

https://github.com/jackyoung96/gym-pybullet-drones

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (10.1%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: jackyoung96
License: mit
Language: Python
Default Branch: master
Size: 98.2 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created over 4 years ago · Last pushed about 4 years ago

Metadata Files

Readme License Citation

NOTE

This repository's master branch is actively developed, please git pull frequently and feel free to open new issues for any undesired, unexpected, or (presumably) incorrect behavior. Thanks

gym-pybullet-drones

Simple OpenAI Gym environment based on PyBullet for multi-agent reinforcement learning with quadrotors

formation flight control info

The default DroneModel.CF2X dynamics are based on Bitcraze's Crazyflie 2.x nano-quadrotor
Everything after a $ is entered on a terminal, everything after >>> is passed to a Python interpreter
To better understand how the PyBullet back-end works, refer to its Quickstart Guide
Suggestions and corrections are very welcome in the form of issues and pull requests, respectively

Why Reinforcement Learning of Quadrotor Control

A lot of recent RL research for continuous actions has focused on policy gradient algorithms and actor-critic architectures. A quadrotor is (i) an easy-to-understand mobile robot platform whose (ii) control can be framed as a continuous states and actions problem but, beyond 1-dimension, (iii) it adds the complexity that many candidate policies lead to unrecoverable states, violating the assumption of the existence of a stationary state distribution on the entailed Markov chain.

Overview

| | gym-pybullet-drones | AirSim | Flightmare | |---------------------------------: | :-------------------: | :-------------------------------------------: | :-------------------------------------------------: | | Physics | PyBullet | FastPhysicsEngine/PhysX | Ad hoc/Gazebo | | Rendering | PyBullet | Unreal Engine 4 | Unity | | Language | Python | C++/C# | C++/Python |
| RGB/Depth/Segm. views | Yes | Yes | Yes | | Multi-agent control | Yes | Yes | Yes | | ROS interface | ROS2/Python | ROS/C++ | ROS/C++ | | Hardware-In-The-Loop | No | Yes | No | | Fully steppable physics | Yes | No | Yes | | Aerodynamic effects | Drag, downwash, ground| Drag | Drag | | OpenAI Gym interface | Yes | No | Yes | | RLlib MultiAgentEnv interface | Yes | No | No |

Performance

Simulation speed-up with respect to the wall-clock when using - 240Hz (in simulation clock) PyBullet physics for EACH drone - AND 48Hz (in simulation clock) PID control of EACH drone - AND nearby obstacles AND a mildly complex background (see GIFs) - AND 24FPS (in sim. clock), 64x48 pixel capture of 6 channels (RGBA, depth, segm.) on EACH drone

| | Lenovo P52 (i7-8850H/Quadro P2000) | 2020 MacBook Pro (i7-1068NG7) | | --------------------------------: | :--------------------------------: | :---------------------------: | | Rendering | OpenGL | CPU-based TinyRenderer | | Single drone, no vision | 15.5x | 16.8x | | Single drone with vision | 10.8x | 1.3x | | Multi-drone (10), no vision | 2.1x | 2.3x | | Multi-drone (5) with vision | 2.5x | 0.2x | | 80 drones in 4 env, no vision | 0.8x | 0.95x |

Note: use gui=False and aggregate_phy_steps=int(SIM_HZ/CTRL_HZ) for better performance

While it is easy toconsciously or notcherry pick statistics, ~5kHz PyBullet physics (CPU-only) is faster than AirSim (1kHz) and more accurate than Flightmare's 35kHz simple single quadcopter dynamics

Exploiting parallel computationi.e., multiple (80) drones in multiple (4) environments (see script parallelism.sh)achieves PyBullet physics updates at ~20kHz

Multi-agent 6-ch. video capture at ~750kB/s with CPU rendering ((64*48)*(4+4+2)*24*5*0.2) is comparable to Flightmare's 240 RGB frames/s ((32*32)*3*240)although in more complex Unity environmentsand up to an order of magnitude faster on Ubuntu, with OpenGL rendering

Requirements and Installation

The repo was written using Python 3.7 with conda on macOS 10.15 and tested on macOS 11, Ubuntu 18.04

On macOS and Ubuntu

Major dependencies are gym, pybullet, stable-baselines3, and rllib pip3 install --upgrade numpy Pillow matplotlib cycler pip3 install --upgrade gym pybullet stable_baselines3 'ray[rllib]' Video recording requires to have ffmpeg installed, on macOS $ brew install ffmpeg On Ubuntu $ sudo apt install ffmpeg The repo is structured as a Gym Environment and can be installed with pip install --editable $ git clone https://github.com/utiasDSL/gym-pybullet-drones.git $ cd gym-pybullet-drones/ $ pip3 install -e . On Ubuntu and with a GPU available, optionally uncomment line 203 of BaseAviary.py to use the eglPlugin

On Windows

Check these step-by-step instructions written by Dr. Karime Pereida for Windows 10

Examples

There are 2 basic template scripts in examples/: fly.py and learn.py

fly.py runs an independent flight using PID control implemented in class DSLPIDControl $ cd gym-pybullet-drones/examples/ $ python3 fly.py # Try 'python3 fly.py -h' to show the script's customizable parameters > Tip: use the GUI's sliders and button Use GUI RPM to override the control with interactive inputs

sparse way points flight control info

yaw saturation control info

learn.py is an RL example to learn take-off using stable-baselines3's A2C or rllib's PPO $ cd gym-pybullet-drones/examples/ $ python3 learn.py # Try 'python3 learn.py -h' to show the script's customizable parameters

Other scripts in folder examples/ are

downwash.py is a flight script with only 2 drones, to test the downwash model $ cd gym-pybullet-drones/examples/ $ python3 downwash.py # Try 'python3 downwash.py -h' to show the script's customizable parameters

downwash example control info

compare.py which replays and compare to a trace saved in example_trace.pkl $ cd gym-pybullet-drones/examples/ $ python3 compare.py # Try 'python3 compare.py -h' to show the script's customizable parameters

Experiments

Folder experiments/learning contains scripts with template learning pipelines

For single agent RL problems, using stable-baselines3, run the training script as $ cd gym-pybullet-drones/experiments/learning/ $ python3 singleagent.py --env <env> --algo <alg> --obs <ObservationType> --act <ActionType> --cpu <cpu_num> Run the replay script to visualize the best trained agent(s) as $ python3 test_singleagent.py --exp ./results/save-<env>-<algo>-<obs>-<act>-<time-date> For multi-agent RL problems, using rllib run the train script as $ cd gym-pybullet-drones/experiments/learning/ $ python3 multiagent.py --num_drones <num_drones> --env <env> --obs <ObservationType> --act <ActionType> --algo <alg> --num_workers <num_workers> Run the replay script to visualize the best trained agent(s) as $ python3 test_multiagent.py --exp ./results/save-<env>-<num_drones>-<algo>-<obs>-<act>-<date>

Class `BaseAviary`

A flight arena for one (ore more) quadrotor can be created as a subclass of BaseAviary() ```

env = BaseAviary( dronemodel=DroneModel.CF2X, # See DroneModel Enum class for other quadcopter models numdrones=1, # Number of drones neighbourhoodradius=np.inf, # Distance at which drones are considered neighbors, only used for multiple drones initialxyzs=None, # Initial XYZ positions of the drones initialrpys=None, # Initial roll, pitch, and yaw of the drones in radians physics: Physics=Physics.PYB, # Choice of (PyBullet) physics implementation freq=240, # Stepping frequency of the simulation aggregatephysteps=1, # Number of physics updates within each call to BaseAviary.step() gui=True, # Whether to display PyBullet's GUI, only use this for debbuging record=False, # Whether to save a .mp4 video (if gui=True) or .png frames (if gui=False) in gym-pybullet-drones/files/, see script /files/videos/ffmpegpng2mp4.sh for encoding obstacles=False, # Whether to add obstacles to the environment userdebuggui=True) # Whether to use addUserDebugLine and addUserDebugParameter calls (it can slow down the GUI) `And instantiated with `gym.make()`see [`learn.py`](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/examples/learn.py) for an example env = gym.make('rl-takeoff-aviary-v0') # See learn.py Then, the environment can be stepped with obs = env.reset() for _ in range(10*240): obs, reward, done, info = env.step(env.action_space.sample()) env.render() if done: obs = env.reset() env.close() ```

Creating New Aviaries

A new RL problem can be created as a subclass of BaseAviary (i.e. class NewAviary(BaseAviary): ...) and implementing the following 7 abstract methods ```

1

def _actionSpace(self): # e.g. return spaces.Box(low=np.zeros(4), high=np.ones(4), dtype=np.float32)

2

def _observationSpace(self): # e.g. return spaces.Box(low=np.zeros(20), high=np.ones(20), dtype=np.float32)

3

def computeObs(self): # e.g. return self.getDroneStateVector(0)

4

def _preprocessAction(self, action): # e.g. return np.clip(action, 0, 1)

5

def _computeReward(self): # e.g. return -1

6

def _computeDone(self): # e.g. return False

7

def _computeInfo(self): # e.g. return {"answer": 42} # Calculated by the Deep Thought supercomputer in 7.5M years ``See [CtrlAviary](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gym_pybullet_drones/envs/CtrlAviary.py), [VisionAviary](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gym_pybullet_drones/envs/VisionAviary.py), [HoverAviary](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gym_pybullet_drones/envs/single_agent_rl/HoverAviary.py), and [FlockAviary`](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gympybulletdrones/envs/multiagentrl/FlockAviary.py) for examples

Action Spaces Examples

The action space's definition of an environment must be implemented in each subclass of BaseAviary by function ```

def _actionSpace(self): ... ``In [CtrlAviary](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gym_pybullet_drones/envs/CtrlAviary.py) and [VisionAviary](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gym_pybullet_drones/envs/VisionAviary.py), it is a [Dict()](https://github.com/openai/gym/blob/master/gym/spaces/dict.py) of [Box(4,)`](https://github.com/openai/gym/blob/master/gym/spaces/box.py) containing the drones' commanded RPMs

The dictionary's keys are "0", "1", .., "n"where n is the number of drones

Each subclass of BaseAviary also needs to implement a preprocessing step translating actions into RPMs ```

def preprocessAction(self, action): ... ``[CtrlAviary](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gym_pybullet_drones/envs/CtrlAviary.py), [VisionAviary](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gym_pybullet_drones/envs/VisionAviary.py), [HoverAviary](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gym_pybullet_drones/envs/single_agent_rl/HoverAviary.py), and [FlockAviary](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gym_pybullet_drones/envs/multi_agent_rl/FlockAviary.py) all simply clip the inputs toMAXRPM`

DynAviary's action input to DynAviary.step() is a Dict() of Box(4,) containing - The desired thrust along the drone's z-axis - The desired torque around the drone's x-axis - The desired torque around the drone's y-axis - The desired torque around the drone's z-axis

From these, desired RPMs are computed by DynAviary._preprocessAction()

Observation Spaces Examples

The observation space's definition of an environment must be implemented by every subclass of BaseAviary ```

def observationSpace(self): ... ``In [CtrlAviary](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gym_pybullet_drones/envs/CtrlAviary.py), it is a [Dict()](https://github.com/openai/gym/blob/master/gym/spaces/dict.py) of pairs{"state": Box(20,), "neighbors": MultiBinary(numdrones)}`

The dictionary's keys are "0", "1", .., "n"where n is the number of drones

Each Box(20,) contains the drone's - X, Y, Z position in WORLD_FRAME (in meters, 3 values) - Quaternion orientation in WORLD_FRAME (4 values) - Roll, pitch and yaw angles in WORLD_FRAME (in radians, 3 values) - The velocity vector in WORLD_FRAME (in m/s, 3 values) - Angular velocity in WORLD_FRAME (3 values) - Motors' speeds (in RPMs, 4 values)

Each MultiBinary(num_drones) contains the drone's own row of the multi-robot system adjacency matrix

The observation space of VisionAviary is the same asCtrlAviary but also includes keys rgb, dep, and seg (in each drone's dictionary) for the matrices containing the drone's RGB, depth, and segmentation views

To fill/customize the content of obs, every subclass of BaseAviary needs to implement ```

def _computeObs(self, action): ... ``See [BaseAviary.exportImage()`](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gympybullet_drones/envs/BaseAviary.py)) and its use in VisionAviary._computeObs() to save frames as PNGs

Obstacles

Objects can be added to an environment using loadURDF (or loadSDF, loadMJCF) in method _addObstacles() ```

def _addObstacles(self): ... p.loadURDF("sphere2.urdf", [0,0,0], p.getQuaternionFromEuler([0,0,0]), physicsClientId=self.CLIENT) ```

Drag, Ground Effect, and Downwash Models

Simple drag, ground effect, and downwash models can be included in the simulation initializing BaseAviary() with physics=Physics.PYB_GND_DRAG_DW, these are based on the system identification of Forster (2015) (Eq. 4.2), the analytical model used as a baseline for comparison by Shi et al. (2019) (Eq. 15), and DSL's experimental work

Check the implementations of _drag(), _groundEffect(), and _downwash() in BaseAviary for more detail

RGB, Depth, and Segmentation Views

rgb view depth view segmentation view

PID Control

Folder control contains the implementations of 2 PID controllers

DSLPIDControl (for DroneModel.CF2X/P) and SimplePIDControl (for DroneModel.HB) can be used as ```

ctrl = [DSLPIDControl(dronemodel=DroneModel.CF2X) for i in range(numdrones)] # Initialize "numdrones" controllers ... for i in range(numdrones): # Compute control for each drone action[str(i)], , _ = ctrl[i].computeControlFromState(. # Write the action in a dictionary controltimestep=env.TIMESTEP, state=obs[str(i)]["state"], targetpos=TARGETPOS) ``For high-level coordinationusing a *velocity input*[VelocityAviary](https://github.com/utiasDSL/gym-pybullet-drones/blob/master/gym_pybullet_drones/envs/VelocityAviary.py) integrates PID control within agym.Env`.

Method setPIDCoefficients can be used to change the coefficients of one of the given PID controllersand, for example, implement learning problems whose goal is parameter tuning (see TuneAviary).

Logger

Class Logger contains helper functions to save and plot simulation data, as in this example ```

logger = Logger(loggingfreqhz=freq, numdrones=numdrones) # Initialize the logger ... for i in range(NUMDRONES): # Log information for each drone logger.log(drone=i, timestamp=K/env.SIMFREQ, state= obs[str(i)]["state"], control=np.hstack([ TARGET_POS, np.zeros(9) ]))
... logger.save() # Save data to file logger.plot() # Plot data ```

ROS2 Python Wrapper

Workspace ros2 contains two ROS2 Foxy Fitzroy Python nodes - AviaryWrapper is a wrapper node for a single-drone CtrlAviary environment - RandomControl reads AviaryWrapper's obs topic and publishes random RPMs on topic action

With ROS2 installed (on either macOS or Ubuntu, edit ros2_and_pkg_setups.(zsh/bash) accordingly), run $ cd gym-pybullet-drones/ros2/ $ source ros2_and_pkg_setups.zsh # On macOS, on Ubuntu use $ source ros2_and_pkg_setups.bash $ colcon build --packages-select ros2_gym_pybullet_drones $ source ros2_and_pkg_setups.zsh # On macOS, on Ubuntu use $ source ros2_and_pkg_setups.bash $ ros2 run ros2_gym_pybullet_drones aviary_wrapper In a new terminal terminal, run $ cd gym-pybullet-drones/ros2/ $ source ros2_and_pkg_setups.zsh # On macOS, on Ubuntu use $ source ros2_and_pkg_setups.bash $ ros2 run ros2_gym_pybullet_drones random_control

Desiderata/WIP

Template scripts using PyMARL
Google Colaboratory example
Alternative multi-contribution downwash effect

Citation

If you wish, please cite our work (link) as @INPROCEEDINGS{panerati2021learning, title={Learning to Fly---a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter Control}, author={Jacopo Panerati and Hehui Zheng and SiQi Zhou and James Xu and Amanda Prorok and Angela P. Schoellig}, booktitle={2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)}, year={2021}, volume={}, number={}, pages={}, doi={} }

References

Nathan Michael, Daniel Mellinger, Quentin Lindsey, Vijay Kumar (2010) The GRASP Multiple Micro UAV Testbed
Benoit Landry (2014) Planning and Control for Quadrotor Flight through Cluttered Environments
Julian Forster (2015) System Identification of the Crazyflie 2.0 Nano Quadrocopter
Carlos Luis and Jeroome Le Ny (2016) Design of a Trajectory Tracking Controller for a Nanoquadcopter
Shital Shah, Debadeepta Dey, Chris Lovett, and Ashish Kapoor (2017) AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles
Eric Liang, Richard Liaw, Philipp Moritz, Robert Nishihara, Roy Fox, Ken Goldberg, Joseph E. Gonzalez, Michael I. Jordan, and Ion Stoica (2018) RLlib: Abstractions for Distributed Reinforcement Learning
Antonin Raffin, Ashley Hill, Maximilian Ernestus, Adam Gleave, Anssi Kanervisto, and Noah Dormann (2019) Stable Baselines3
Guanya Shi, Xichen Shi, Michael OConnell, Rose Yu, Kamyar Azizzadenesheli, Animashree Anandkumar, Yisong Yue, and Soon-Jo Chung (2019) Neural Lander: Stable Drone Landing Control Using Learned Dynamics
Mikayel Samvelyan, Tabish Rashid, Christian Schroeder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob Foerster, and Shimon Whiteson (2019) The StarCraft Multi-Agent Challenge
C. Karen Liu and Dan Negrut (2020) The Role of Physics-Based Simulators in Robotics
Yunlong Song, Selim Naji, Elia Kaufmann, Antonio Loquercio, and Davide Scaramuzza (2020) Flightmare: A Flexible Quadrotor Simulator

Bonus GIF for scrolling this far

formation flight control info

University of Toronto's Dynamic Systems Lab / Vector Institute / University of Cambridge's Prorok Lab / Mitacs

Owner

Login: jackyoung96
Kind: user

Repositories: 13
Profile: https://github.com/jackyoung96

GitHub Events

Total

Last Year

Dependencies

requirement.txt pypi

Deprecated ==1.2.13
Markdown ==3.3.6
Pillow ==9.0.1
PyQt5 ==5.15.6
PyQt5-Qt5 ==5.15.2
PyQt5-sip ==12.9.1
PyWavelets ==1.2.0
PyYAML ==5.4.1
Werkzeug ==2.0.3
absl-py ==1.0.0
appdirs ==1.4.4
asyncqt ==0.8.0
attrs ==21.4.0
cachetools ==5.0.0
certifi ==2021.10.8
charset-normalizer ==2.0.12
click ==8.0.3
cloudpickle ==1.6.0
cycler ==0.11.0
dm-tree ==0.1.6
filelock ==3.4.2
fonttools ==4.29.1
freetype-py ==2.2.0
google-auth ==2.6.0
google-auth-oauthlib ==0.4.6
grpcio ==1.43.0
gym ==0.19.0
hsluv ==5.0.2
idna ==3.3
imageio ==2.15.0
importlib-metadata ==4.11.0
importlib-resources ==5.4.0
jsonschema ==4.4.0
kiwisolver ==1.3.2
libusb-package ==1.0.25.0
lz4 ==3.1.10
matplotlib ==3.5.1
msgpack ==1.0.3
networkx ==2.6.3
numpy ==1.21.5
oauthlib ==3.2.0
opencv-python-headless ==4.5.5.62
packaging ==21.3
pandas ==1.3.5
protobuf ==3.19.4
pyasn1 ==0.4.8
pyasn1-modules ==0.2.8
pybullet ==3.2.1
pyparsing ==3.0.7
pyqtgraph ==0.12.3
pyrsistent ==0.18.1
pyserial ==3.5
python-dateutil ==2.8.2
pytz ==2021.3
pyusb ==1.2.1
pyzmq ==22.3.0
qtm ==2.1.1
ray ==1.10.0
redis ==4.1.3
requests ==2.27.1
requests-oauthlib ==1.3.1
rsa ==4.8
scikit-image ==0.19.1
scipy ==1.7.3
six ==1.16.0
stable-baselines3 ==1.4.0
tabulate ==0.8.9
tensorboard ==2.8.0
tensorboard-data-server ==0.6.1
tensorboard-plugin-wit ==1.8.1
tensorboardX ==2.4.1
tifffile ==2021.11.2
torch ==1.10.2
torchsummary ==1.5.1
typing_extensions ==4.0.1
urllib3 ==1.26.8
vispy ==0.9.6
wrapt ==1.13.3
zipp ==3.7.0

ros2/src/ros2_gym_pybullet_drones/setup.py pypi

setuptools *

setup.py pypi

Pillow *
cycler *
gym *
matplotlib *
numpy *
pybullet *
ray *
stable_baselines3 *

gym-pybullet-drones

Science Score: 49.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

NOTE

gym-pybullet-drones

Why Reinforcement Learning of Quadrotor Control

Overview

Performance

Requirements and Installation

On macOS and Ubuntu

On Windows

Examples

Experiments

Class BaseAviary

Creating New Aviaries

1

2

3

4

5

6

7

Action Spaces Examples

Observation Spaces Examples

Obstacles

Drag, Ground Effect, and Downwash Models

RGB, Depth, and Segmentation Views

PID Control

Logger

ROS2 Python Wrapper

Desiderata/WIP

Citation

References

Owner

GitHub Events

Total

Last Year

Dependencies

Class `BaseAviary`