ActorCritic, DDPG
New Algorithms
ActorCritic
PR: #118
- add
ActorCritic
agent - add its policies, Discrete:
ArgmaxPolicy, SoftmaxPolicy
; Continuous:BoundedPolicy, GaussianPolicy
- add basic specs, solve
Cartpole-v0
,Cartpole-v1
, yet to solve the others
DDPG
PR: #118
- add
DDPG
agent with custom tensorflow ops - add its policies (only Continuous now):
NoNoisePolicy, LinearNoisePolicy, GaussianWhiteNoisePolicy, OUNoisePolicy
- add basic specs, solve
Pendulum-v0
Improvements/Bug Fixes
PR: #118