gym-marl-reconnaissance

Gym environment for cooperative multi-agent reinforcement learning in heterogeneous robot teams

https://github.com/jacopopan/gym-marl-reconnaissance

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
✓
Committers with academic emails
1 of 1 committers (100.0%) from academic institutions
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.6%) to scientific vocabulary

Keywords

coordination gym heterogeneous multi-agent multi-robot non-stationary pybullet reinforcement-learning

Last synced: 6 months ago · JSON representation ·

Repository

Gym environment for cooperative multi-agent reinforcement learning in heterogeneous robot teams

Basic Info

Host: GitHub
Owner: JacopoPan
License: mit
Language: Python
Default Branch: main
Homepage:
Size: 24 MB

Statistics

Stars: 46
Watchers: 3
Forks: 4
Open Issues: 0
Releases: 0

Topics

coordination gym heterogeneous multi-agent multi-robot non-stationary pybullet reinforcement-learning

Created over 4 years ago · Last pushed about 4 years ago

Metadata Files

Readme License Citation

gym-marl-reconnaissance

Gym environments for heterogeneous multi-agent reinforcement learning in non-stationary worlds

This repository's master branch is work in progress, please git pull frequently and feel free to open new issues for any undesired, unexpected, or (presumably) incorrect behavior. Thanks 🙏

Also see how to programmatically control real RoboMaster hardware (S1 UGV, Tello Talent UAV) in Python here

Install on Ubuntu/macOS

(optional) Create and access a Python 3.7 environment using conda $ conda create -n recon python=3.7 # Create environment (named 'recon' here) $ conda activate recon # Activate environment 'recon' Clone and install the gym-marl-reconnaissance repository $ git clone https://github.com/JacopoPan/gym-marl-reconnaissance # Clone repository $ cd gym-marl-reconnaissance # Enter the repository $ pip install -e . # Install the repository

Configure

Set the parameters of the simulation environment seed: -1 ctrl_freq: 2 pyb_freq: 30 gui: False record: False episode_length_sec: 30 action_type: 'task_assignment' # Alternatively, 'tracking' obs_type: 'global' reward_choice: 'reward_c' adv_type: 'avoidant' # Alternatively, 'blind' visibility_threshold: 12 setup: edge: 10 obstacles: 0 tt: 1 s1: 1 adv: 2 neu: 1 debug: False

Use

Step an environment with random action inputs $ python3 ./experiments/debug.py --random True Step an environment with a greedy policy (only for task_assignment) $ python3 ./experiments/debug.py Learn using stable-baselines3 $ python3 ./experiments/train.py --algo <a2c | ppo> --yaml <filname in ./experiments/configurations/> Replay a trained agent $ python3 ./experiments/test.py --exp ./results/exp--<algo>--<config>--<date>_<time>

Results

Task assignment (1 UAV and 1 UGV vs 2 targets and 1 neutral)

Tracking (1 UAV or 1 UGV vs 1 target, with or without 1 neutral)

University of Toronto's Dynamic Systems Lab / Vector Institute / Mitacs

Owner

Name: Jacopo Panerati
Login: JacopoPan
Kind: user

Website: jacopopanerati.github.io
Repositories: 1
Profile: https://github.com/JacopoPan

Went to the desert—formerly @utiasDSL, @VectorInstitute, @proroklab, @MISTLab

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Panerati"
  given-names: "Jacopo"
  orcid: "https://orcid.org/0000-0003-2994-5422"
title: "gym-marl-reconnaissance"
version: 0.0.1
doi: 10.5281/zenodo.1234
date-released: 2021-09-04
url: "https://github.com/JacopoPan/gym-marl-reconnaissance"

GitHub Events

Total

Watch event: 7

Last Year

Watch event: 7

Committers

Last synced: 8 months ago

All Time

Total Commits: 26
Total Committers: 1
Avg Commits per committer: 26.0
Development Distribution Score (DDS): 0.0

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
Jacopo Panerati	j**i@u**a	26

Committer Domains (Top 20 + Academic)

utoronto.ca: 1

Issues and Pull Requests

Last synced: 8 months ago

All Time

Total issues: 3
Total pull requests: 0
Average time to close issues: 8 days
Average time to close pull requests: N/A
Total issue authors: 2
Total pull request authors: 0
Average comments per issue: 1.0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

gym-marl-reconnaissance

Science Score: 54.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

gym-marl-reconnaissance

Install on Ubuntu/macOS

Configure

Use

Results

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels