https://github.com/chychen/agents
Efficient Batched Reinforcement Learning in TensorFlow
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (15.5%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
Repository
Efficient Batched Reinforcement Learning in TensorFlow
Basic Info
Statistics
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
- Releases: 0
Fork of google-research/batch-ppo
Created about 8 years ago
· Last pushed about 8 years ago
https://github.com/chychen/agents/blob/master/
TensorFlow Agents ================= This project provides optimized infrastructure for reinforcement learning. It extends the [OpenAI gym interface][post-gym] to multiple parallel environments and allows agents to be implemented in TensorFlow and perform batched computation. As a starting point, we provide BatchPPO, an optimized implementation of [Proximal Policy Optimization][post-ppo]. Please cite the [TensorFlow Agents paper][paper-agents] if you use code from this project in your research: ```bibtex @article{hafner2017agents, title={TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow}, author={Hafner, Danijar and Davidson, James and Vanhoucke, Vincent}, journal={arXiv preprint arXiv:1709.02878}, year={2017} } ``` Dependencies: Python 2/3, TensorFlow 1.3+, Gym, ruamel.yaml [paper-agents]: https://arxiv.org/pdf/1709.02878.pdf [post-gym]: https://blog.openai.com/openai-gym-beta/ [post-ppo]: https://blog.openai.com/openai-baselines-ppo/ Instructions ------------ Clone the repository and run the PPO algorithm by typing: ```shell python3 -m agents.scripts.train --logdir=/path/to/logdir --config=pendulum ``` The algorithm to use is defined in the configuration and `pendulum` started here uses the included PPO implementation. Check out more pre-defined configurations in `agents/scripts/configs.py`. If you want to resume a previously started run, add the `--timestamp=
Owner
- Name: Jay Chen
- Login: chychen
- Kind: user
- Location: Taiwan
- Repositories: 3
- Profile: https://github.com/chychen
TensorFlow Agents
=================
This project provides optimized infrastructure for reinforcement learning. It
extends the [OpenAI gym interface][post-gym] to multiple parallel environments
and allows agents to be implemented in TensorFlow and perform batched
computation. As a starting point, we provide BatchPPO, an optimized
implementation of [Proximal Policy Optimization][post-ppo].
Please cite the [TensorFlow Agents paper][paper-agents] if you use code from
this project in your research:
```bibtex
@article{hafner2017agents,
title={TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow},
author={Hafner, Danijar and Davidson, James and Vanhoucke, Vincent},
journal={arXiv preprint arXiv:1709.02878},
year={2017}
}
```
Dependencies: Python 2/3, TensorFlow 1.3+, Gym, ruamel.yaml
[paper-agents]: https://arxiv.org/pdf/1709.02878.pdf
[post-gym]: https://blog.openai.com/openai-gym-beta/
[post-ppo]: https://blog.openai.com/openai-baselines-ppo/
Instructions
------------
Clone the repository and run the PPO algorithm by typing:
```shell
python3 -m agents.scripts.train --logdir=/path/to/logdir --config=pendulum
```
The algorithm to use is defined in the configuration and `pendulum` started
here uses the included PPO implementation. Check out more pre-defined
configurations in `agents/scripts/configs.py`.
If you want to resume a previously started run, add the `--timestamp=