centralized-control-lux

https://github.com/roger-creus/centralized-control-lux

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (9.9%) to scientific vocabulary

Last synced: 8 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: roger-creus
Language: Python
Default Branch: main
Size: 163 MB

Statistics

Stars: 7
Watchers: 2
Forks: 3
Open Issues: 0
Releases: 0

Created about 3 years ago · Last pushed almost 3 years ago

Metadata Files

Readme Citation

Centralized control for multi-agent RL in a complex Real-Time-Strategy game

→ See the full article here

This repository contains the source code for the project "Centralized control for multi-agent RL in a complex Real-Time-Strategy game", which was submitted as the final project in the COMP579 - Reinforcement Learning course at McGill given by Prof. Doina Precup in Winter 2023.

→ The main scripts for understanding the code are fully commented. We present the PDF report and the code in the following sections.

→ The full report of the project is available here.

→ The Weights & Biases logs of our experiments are available here.

Running the code

→ There are 2 main scripts of ~1000 and ~900 lines of code which are src/envs_folder/custom_env.py and src/ppo_res_gridnet_multigpu.py.

→ The repository contains many variations of gridnet scripts but the simplest one and fully commented is src/ppo_res_gridnet_multigpu.py.

To train our gridnet in Lux:

1) Clone this repository

2) Install the requirements

3) Train Gridnet (example uses 1 GPU and 1 process)

cd src torchrun --standalone --nproc_per_node 1 ppo_res_gridnet_multigpu.py --device-ids 0

The best agent was trained using the best parameters discovered in the hyperparameter sweep and 16 processes on 8 GPUs, running:

``` torchrun --standalone --nprocpernode 16 ppopixelgridnet_multigpu.py --total-timesteps 1000000000 --clip-coef=0.14334778465053272 --ent-coef=0.002408486638907176 --gae-lambda=0.9322312137190516 --gamma=0.9945973988514306 --learning-rate=0.0016166261475302418 --max-grad-norm=0.28978755223510055 --minibatch-size=128 --num-envs=256 --num-steps=64 --pool-size=5 --save-every=50 --update-epochs=7 --vf-coef=0.2734614814048212 --device-ids 0 0 1 1 2 2 3 3 4 4 5 5 6 6 7 7

```

Video Presentation

Description

In this project we implement an RL agent to compete in the Lux AI v-2 Kaggle Competition. Lux is a 1vs1 real-time-strategy game in which players must compete for resources and grow lichen in Mars. Lux is a multi-agent environment because players control variable-sized fleets of units of different natures (e.g. light and heavy robots, and factories). The full specifications of the lux environment are available here.

Our approach

We propose a pixel-to-pixel architecture that we train with Proximal Policy Optimization (PPO). The encoder is a stack of Residual Blocks with Squeeze-and-Excitation layers and ReLU activations and the decoders are both a stack of Transposed Convolutions and ReLU actiovations. The critic uses and AveragePool layer and 2 fully connected layers with a ReLU activation.

Results

Training curves for the final configuration

Citation

If you use this code, please cite it as below

@article{castanyer2023centralized, title={Centralized control for multi-agent RL in a complex Real-Time-Strategy game}, author={Castanyer, Roger Creus}, journal={arXiv preprint arXiv:2304.13004}, year={2023} }

Owner

Name: Roger Creus
Login: roger-creus
Kind: user
Location: Montréal, Québec, Canada.
Company: Mila Québec

Website: https://roger-creus.github.io/
Twitter: creus_roger
Repositories: 13
Profile: https://github.com/roger-creus

Research MSc @mila-iqia @montrealrobotics. Deep Reinforcement Learning

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this code, please cite it as below."
authors:
- family-names: "Creus Castanyer"
  given-names: "Roger"
  orcid: "https://orcid.org/0000-0003-1952-3357"
title: "Centralized Control for Multi-Agent RL in a complex Real-Time-Strategy game"
version: 1.0.0
date-released: 2023-04-28
url: "https://github.com/roger-creus/lux-ai-rl"

GitHub Events

Total

Watch event: 1
Fork event: 2

Last Year

Watch event: 1
Fork event: 2

Issues and Pull Requests

Last synced: over 1 year ago

All Time

Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science