gym-marl-reconnaissance
Gym environment for cooperative multi-agent reinforcement learning in heterogeneous robot teams
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
1 of 1 committers (100.0%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.6%) to scientific vocabulary
Keywords
Repository
Gym environment for cooperative multi-agent reinforcement learning in heterogeneous robot teams
Basic Info
Statistics
- Stars: 46
- Watchers: 3
- Forks: 4
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
gym-marl-reconnaissance
Gym environments for heterogeneous multi-agent reinforcement learning in non-stationary worlds
This repository's
masterbranch is work in progress, pleasegit pullfrequently and feel free to open new issues for any undesired, unexpected, or (presumably) incorrect behavior. Thanks 🙏Also see how to programmatically control real RoboMaster hardware (S1 UGV, Tello Talent UAV) in Python here

Install on Ubuntu/macOS
(optional) Create and access a Python 3.7 environment using conda
$ conda create -n recon python=3.7 # Create environment (named 'recon' here)
$ conda activate recon # Activate environment 'recon'
Clone and install the gym-marl-reconnaissance repository
$ git clone https://github.com/JacopoPan/gym-marl-reconnaissance # Clone repository
$ cd gym-marl-reconnaissance # Enter the repository
$ pip install -e . # Install the repository
Configure
Set the parameters of the simulation environment
seed: -1
ctrl_freq: 2
pyb_freq: 30
gui: False
record: False
episode_length_sec: 30
action_type: 'task_assignment' # Alternatively, 'tracking'
obs_type: 'global'
reward_choice: 'reward_c'
adv_type: 'avoidant' # Alternatively, 'blind'
visibility_threshold: 12
setup:
edge: 10
obstacles: 0
tt: 1
s1: 1
adv: 2
neu: 1
debug: False

Use
Step an environment with random action inputs
$ python3 ./experiments/debug.py --random True
Step an environment with a greedy policy (only for task_assignment)
$ python3 ./experiments/debug.py
Learn using stable-baselines3
$ python3 ./experiments/train.py --algo <a2c | ppo> --yaml <filname in ./experiments/configurations/>
Replay a trained agent
$ python3 ./experiments/test.py --exp ./results/exp--<algo>--<config>--<date>_<time>
Results
Task assignment (1 UAV and 1 UGV vs 2 targets and 1 neutral)

Tracking (1 UAV or 1 UGV vs 1 target, with or without 1 neutral)
![]()
University of Toronto's Dynamic Systems Lab / Vector Institute / Mitacs
Owner
- Name: Jacopo Panerati
- Login: JacopoPan
- Kind: user
- Website: jacopopanerati.github.io
- Repositories: 1
- Profile: https://github.com/JacopoPan
Went to the desert—formerly @utiasDSL, @VectorInstitute, @proroklab, @MISTLab
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Panerati" given-names: "Jacopo" orcid: "https://orcid.org/0000-0003-2994-5422" title: "gym-marl-reconnaissance" version: 0.0.1 doi: 10.5281/zenodo.1234 date-released: 2021-09-04 url: "https://github.com/JacopoPan/gym-marl-reconnaissance"
GitHub Events
Total
- Watch event: 7
Last Year
- Watch event: 7
Committers
Last synced: 8 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Jacopo Panerati | j****i@u****a | 26 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 8 months ago
All Time
- Total issues: 3
- Total pull requests: 0
- Average time to close issues: 8 days
- Average time to close pull requests: N/A
- Total issue authors: 2
- Total pull request authors: 0
- Average comments per issue: 1.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- Tie-1 (1)
- hit-JB (1)