[RLlib] Duplicate custom metrics #24731
Labels
enhancement
Request for new feature and/or capability
P2
Important issue, but not time-critical
rllib
RLlib related issues
rllib-logging
This problem is related to logging metrics
What happened + What you expected to happen
What happened?
I constructed my custom callbacks to collect some metrics, run my experimnt with
tune
and found them twice in TensorBoard. The first time undertune/custom_metrics
and the second undertune/sampler_results/custom_metrics
.The reason for this behavior is that in the
Trainer
class theresults
dictionary gets provided with the custom metrics in_compile_step_results()
twice:results["custom_metrics"]
summarize_episodes()
assummarize_episodes()
collects them here.This somehow blows up the TensorBoard metrics (many pages) and might also use more disk space then needed.
What did you expect to happen?
I expected to find my custom metrics only once. ither in
tune/custom_metrics
or intune/sampler_results/custom_metrics
.Easy solution
An easy solution would probably be to remove the line for collecting the
custom_metrics
Versions / Dependencies
Linux Fedora 35
Python 3.9.0
ray dev2.0.0
Reproduction script
Issue Severity
Low: It annoys or frustrates me.
The text was updated successfully, but these errors were encountered: