Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ActorCritic and DDPG #118

Merged
merged 49 commits into from
Apr 19, 2017
Merged

ActorCritic and DDPG #118

merged 49 commits into from
Apr 19, 2017

Conversation

lgraesser
Copy link
Collaborator

@lgraesser lgraesser commented Apr 8, 2017

New Algorithms

ActorCritic

  • add ActorCritic agent
  • add its policies, Discrete: ArgmaxPolicy, SoftmaxPolicy; Continuous: BoundedPolicy, GaussianPolicy
  • add basic specs, solve Cartpole-v0, Cartpole-v1, yet to solve the others

DDPG

  • add DDPG agent with custom tensorflow ops
  • add its policies (only Continuous now): NoNoisePolicy, LinearNoisePolicy, GaussianWhiteNoisePolicy, OUNoisePolicy
  • add basic specs, solve Pendulum-v0

Improvements/Bug Fixes

  • use logger.warn instead of raise error when component locks are violated
  • fix matplotlib backend  #114, add OS X matplotlib backend #115 matplotlib backend setting issue. now single trial will live-plot and render
  • mute DoubleDQN as it breaks; instead revert to the single-model recompile from DQN

lgraesser and others added 30 commits April 8, 2017 17:17
@kengz kengz mentioned this pull request Apr 18, 2017
5 tasks
@kengz kengz changed the title Working discrete actor critic model ActorCritic and DDPG Apr 18, 2017
@kengz kengz merged commit 29bd213 into master Apr 19, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

matplotlib backend
2 participants