https://github.com/cptanalatriste/banana-hunter
A deep-reinforcement learning agent that loves bananas.
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: nature.com -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.1%) to scientific vocabulary
Keywords
Repository
A deep-reinforcement learning agent that loves bananas.
Basic Info
- Host: GitHub
- Owner: cptanalatriste
- Language: Jupyter Notebook
- Default Branch: master
- Size: 325 KB
Statistics
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
banana-hunter
A deep-reinforcement learning agent that loves bananas, trained using Deep Q-Networks (DQN).
Project Details:
Banana-Hunter is a deep-reinforcement learning agent designed for the Banana Collectors environment from the Unity ML-Agents Toolkit.

The state is represented via a vector of 37 elements, corresponding to the agent's perception of the objects (i.e. bananas) around him. Our agent has four possible actions:
- Move forward, represented by 0.
- Move backwards, represented by 1.
- Turn left, represented by 2.
- Turn right, represented by 3.
We considered our agent has mastered the task when he reached an average score of 13, over 100 episodes.
Getting Started
Before running your agent, be sure to accomplish this first: 1. Clone this repository. 1. Download the banana collector environment appropriate to your operating system (available here). 1. Place the environment file in the cloned repository folder. 1. Setup an appropriate Python environment. Instructions available here.
Instructions
You can start running and training the agent by exploring Navigation.ipynb. Also available in the repository:
banana_hunter.pycontains the agent code.banana_manager.pyhas the code for training the agent.
Owner
- Name: Carlos Gavidia-Calderon
- Login: cptanalatriste
- Kind: user
- Location: London, United Kingdom
- Company: @alan-turing-institute
- Website: https://carlos.gavidia.me/
- Twitter: cptan_alatriste
- Repositories: 74
- Profile: https://github.com/cptanalatriste
Systems engineer by training, software developer by trade. Research Software Engineer at @alan-turing-institute .
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: 12 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0