Skip to content

Fix Boltzmann, refactor RENDER

Compare
Choose a tag to compare
@kengz kengz released this 05 Apr 12:36
· 160 commits to master since this release

Bug Fixes

BoltzmannPolicy

PR: #109

  • fix state reshape with dimension > 1 using np.expand_dims
  • guard underflow by doing np.clip before np.exp

Misc

  • rename class from DoubleDQNPolicy to DoubleDQNEpsilonGreedyPolicy for clarity
  • refactor useless RENDER key from rl/spec/problems.json into rl/experiment.py