This is a tensorflow implementation of proximal policy optimization (PPO) algorithm for continuous action
- python 3.5
- tensorflow 1.1.0
- openAI
For Training Run:
$ python3 trainer.py
For Demo Run:
$ python3 play.py
Reference Project
This is a tensorflow implementation of proximal policy optimization (PPO) algorithm for continuous action
For Training Run:
$ python3 trainer.py
For Demo Run:
$ python3 play.py
Reference Project