Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Atari OnPolicy Memory #164

Merged
merged 9 commits into from
Sep 8, 2018
Merged

Atari OnPolicy Memory #164

merged 9 commits into from
Sep 8, 2018

Conversation

lgraesser
Copy link
Collaborator

@lgraesser lgraesser commented Sep 7, 2018

Adds OnPolicyAtariReplay memory so that policy based algorithms can be applied to the Atari suite.

  • Fixed crashing ppo_conv_shared_beamrider

Also fixes n-step returns

  • V(s) was always being applied for V(s_t+1), now V(s_t+n) as it should be

@kengz kengz merged commit 5626385 into master Sep 8, 2018
@kengz kengz deleted the ppo-fix branch September 8, 2018 23:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants