[Bug]: evaluate_policy called multiple times vor vectorized environments #1912
Labels
documentation
Improvements or additions to documentation
help wanted
Help from contributors is welcomed
🐛 Bug
When calling
with a vecenv, then the callback gets executed for each of the environments separately. However, the locals dict contains the aggregated results. Therefore you have to manually check for which environment the callback was called, or only execute it every
n_envs
time.To Reproduce
Relevant log output / Error message
No response
System Info
Checklist
The text was updated successfully, but these errors were encountered: