File tree 2 files changed +3
-1
lines changed
2 files changed +3
-1
lines changed Original file line number Diff line number Diff line change @@ -18,6 +18,7 @@ New Features:
18
18
Bug Fixes:
19
19
^^^^^^^^^^
20
20
- Fixed a bug where the environment was reset twice when using ``evaluate_policy ``
21
+ - Fix logging of ``clip_fraction `` in PPO (@diditforlulz273)
21
22
22
23
Deprecations:
23
24
^^^^^^^^^^^^^
@@ -398,3 +399,4 @@ And all the contributors:
398
399
@MarvineGothic @jdossgollin @SyllogismRXS @rusu24edward @jbulow @Antymon @seheevic @justinkterry @edbeeching
399
400
@flodorner @KuKuXia @NeoExtended @PartiallyTyped @mmcenta @richardwu @kinalmehta @rolandgvc @tkelestemur @mloo3
400
401
@tirafesi @blurLake @koulakis @joeljosephjin @shwang @rk37 @andyshih12 @RaphaelWag @xicocaio
402
+ @diditforlulz273
Original file line number Diff line number Diff line change @@ -228,7 +228,7 @@ def train(self) -> None:
228
228
logger .record ("train/policy_gradient_loss" , np .mean (pg_losses ))
229
229
logger .record ("train/value_loss" , np .mean (value_losses ))
230
230
logger .record ("train/approx_kl" , np .mean (approx_kl_divs ))
231
- logger .record ("train/clip_fraction" , np .mean (clip_fraction ))
231
+ logger .record ("train/clip_fraction" , np .mean (clip_fractions ))
232
232
logger .record ("train/loss" , loss .item ())
233
233
logger .record ("train/explained_variance" , explained_var )
234
234
if hasattr (self .policy , "log_std" ):
You can’t perform that action at this time.
0 commit comments