Skip to content

ActorCritic, DDPG

Compare
Choose a tag to compare
@kengz kengz released this 19 Apr 04:12
· 73 commits to master since this release

New Algorithms

ActorCritic

PR: #118

  • add ActorCritic agent
  • add its policies, Discrete: ArgmaxPolicy, SoftmaxPolicy; Continuous: BoundedPolicy, GaussianPolicy
  • add basic specs, solve Cartpole-v0, Cartpole-v1, yet to solve the others

DDPG

PR: #118

  • add DDPG agent with custom tensorflow ops
  • add its policies (only Continuous now): NoNoisePolicy, LinearNoisePolicy, GaussianWhiteNoisePolicy, OUNoisePolicy
  • add basic specs, solve Pendulum-v0

Improvements/Bug Fixes

PR: #118

  • use logger.warn instead of raise error when component locks are violated
  • fix #114, #115 matplotlib backend setting issue. now single trial will live-plot and render
  • mute DoubleDQN as it breaks; instead revert to the single-model recompile from DQN