Pytorch implementation of the Persistent Advantage reinforcement learning operator proposed in paper 'Increasing the Action Gap: New Operators for Reinforcement Learning'
reinforcement-learning atari2600 deep-reinforcement-learning dqn persistent-advantage-learning advantage-learning al-algorithm pal-algorithm
-
Updated
Dec 8, 2018 - Python