https://github.com/cptanalatriste/horna-cuevas-rally
Two deep-reinforcement learning agents that play tennis.
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.5%) to scientific vocabulary
Keywords
Repository
Two deep-reinforcement learning agents that play tennis.
Basic Info
Statistics
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
horna-cuevas-rally
Two deep-reinforcement learning agents that play tennis, trained using the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm.
Project Details:
Horna and Cuevas are deep-reinforcement learning agents designed for a tailored version of the Tennis environment, from the Unity ML-Agents Toolkit.

Each agent perceives a state represented via a vector of 24 elements. This vector contains information of the position and velocity of the ball and their racket. Their actions are composed by vectors of 2 real-valued elements between -1 and 1. These values represent horizontal and vertical movements.
Each agent is rewarded with +0.1 points every time they pass the ball over the net. If the ball does not cross the net or falls outside the court, the offending agent is penalised with -0.01 points. We consider the environment solved when the maximum score among both agents reaches 0.5 on average, over 100 episodes.
Getting Started
Before running the agents, be sure to accomplish this first:
- Clone this repository.
- Download the Tennis environment appropriate to your operating system (available here ).
- Place the environment file in the cloned repository folder.
- Setup an appropriate Python environment. Instructions available here.
Instructions
You can start running and training the agents by exploring Tennis.ipynb. Also available in the repository:
tennis_agent.pycontains the agents' code.tennis_manager.pyhas the code for training the agents.
Owner
- Name: Carlos Gavidia-Calderon
- Login: cptanalatriste
- Kind: user
- Location: London, United Kingdom
- Company: @alan-turing-institute
- Website: https://carlos.gavidia.me/
- Twitter: cptan_alatriste
- Repositories: 74
- Profile: https://github.com/cptanalatriste
Systems engineer by training, software developer by trade. Research Software Engineer at @alan-turing-institute .
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: over 1 year ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0