Atari OnPolicy Memory #164

lgraesser · 2018-09-07T07:50:29Z

Adds OnPolicyAtariReplay memory so that policy based algorithms can be applied to the Atari suite.

Also fixes n-step returns

lgraesser added 4 commits September 7, 2018 00:02

Removing unnecessary if

f9698f7

Changing reward clipping

bd236db

OnPolicyAtariReplay memory

78e3698

Todo to fix for new memory

cc5978c

lgraesser and others added 5 commits September 8, 2018 14:59

Merge branch 'master' into ppo-fix

0706e9e

Ensure last_state has the right shape

ea4bdb5

Fix V shift for nstep returns

17d798a

Update ppo beamrider spec

b229ad4

Syntax fix

c1288dc

kengz merged commit 5626385 into master Sep 8, 2018

kengz deleted the ppo-fix branch September 8, 2018 23:37

Provide feedback