Skip to content

Fix PER breakage

Compare
Choose a tag to compare
@kengz kengz released this 04 Apr 12:41
· 173 commits to master since this release

Bug Fixes

PER

PR: #108

  • fix PER breakage on negative error = reward by adding a bump min_priority = abs(10 * SOLVED_MEAN_REWARD)
  • add a positive min_priority for all problems since they may have negative rewards. We cannot do error = abs(reward) because it is sign sensitive for priority calculation
  • add assert guard to ensure priority is not nan