Updated 6 months ago

https://github.com/araffin/pytorch_agents • Science 10%

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR). Python2 compatible (branch python2)