Add Learning Rate Annealing to PPO #22

awjuliani · 2017-09-21T16:38:34Z

Current implementation of PPO uses fixed learning rate for duration of training process. This can produce degenerate models later in training, when a smaller learning rate is necessary.

Learning rate should be annealed over time to 0.

awjuliani · 2017-09-22T16:23:14Z

Addressed in 77b04d1

lock · 2020-01-05T00:29:34Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

awjuliani added the enhancement label Sep 21, 2017

awjuliani self-assigned this Sep 21, 2017

awjuliani closed this as completed Sep 22, 2017

lock bot locked as resolved and limited conversation to collaborators Jan 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Learning Rate Annealing to PPO #22

Add Learning Rate Annealing to PPO #22

awjuliani commented Sep 21, 2017

awjuliani commented Sep 22, 2017

lock bot commented Jan 5, 2020

Add Learning Rate Annealing to PPO #22

Add Learning Rate Annealing to PPO #22

Comments

awjuliani commented Sep 21, 2017

awjuliani commented Sep 22, 2017

lock bot commented Jan 5, 2020