[Bug Report] TensorBoard logging issues with `rl_games` library

### Problem

I need to use TensorBoard to log rewards and other metrics while training a custom task using the `rl_games` library. The task involves a UR5e robot arm with a processing tool mounted on its flange, which is required to reach multiple waypoints. I have copied [rl_games_ppo_cfg.yaml](https://github.com/isaac-sim/IsaacLab/blob/main/source/isaaclab_tasks/isaaclab_tasks/manager_based/manipulation/reach/config/ur_10/agents/rl_games_ppo_cfg.yaml) from the [Isaac-Reach-UR10-v0](https://github.com/isaac-sim/IsaacLab/blob/main/source/isaaclab_tasks/isaaclab_tasks/manager_based/manipulation/reach/config/ur_10/joint_pos_env_cfg.py) task. The training runs successfully, but unfortunately, there are issues with the logging frequency when using the PPO configuration from `rl_games`, but not for `rsl_rl` or `skrl` library.

### Description

These are the logs from one training run with `rl_games` PPO configuration:

<img width="1462" height="766" alt="Image" src="https://github.com/user-attachments/assets/eac8f1f7-0018-4064-94ce-8709e2f563dd" />

I have altered the following parameters and set them to `save_best_after: 10` and `save_frequency: 10`:

https://github.com/isaac-sim/IsaacLab/blob/75b67154b96ab839a8ee9352b19115719e8cfb84/source/isaaclab_tasks/isaaclab_tasks/manager_based/manipulation/reach/config/ur_10/agents/rl_games_ppo_cfg.yaml#L70-L71

For some reason, the change in metrics happens periodically at epoch **313**. These are the logs from the terminal around the change in metrics logging. Apparently, the best reward remains constant until epoch **313** is reached:

<img width="856" height="713" alt="Image" src="https://github.com/user-attachments/assets/42446a3e-88ba-40ac-810f-459d0fb90287" />

This change is also shown in this video for the timestep at **626**:

https://github.com/user-attachments/assets/64c9d19a-a19e-4955-9b9f-449102d1fc70

### Attempts to fix this error

I have also checked whether this logging issue occurs for the `rl_rsl` library using [rsl_rl_ppo_cfg.py](https://github.com/isaac-sim/IsaacLab/blob/main/source/isaaclab_tasks/isaaclab_tasks/manager_based/manipulation/reach/config/ur_10/agents/rsl_rl_ppo_cfg.py) with `max_iterations=500`, but there the logging does work as intended:

<img width="1463" height="690" alt="Image" src="https://github.com/user-attachments/assets/364f8ecc-e388-4b68-b817-dbd2bedea442" />

The same for the `skrl` library with [skrl_ppo_cfg.yaml](https://github.com/isaac-sim/IsaacLab/blob/main/source/isaaclab_tasks/isaaclab_tasks/manager_based/manipulation/reach/config/ur_10/agents/skrl_ppo_cfg.yaml):

<img width="1468" height="692" alt="Image" src="https://github.com/user-attachments/assets/8677a5f6-858a-4421-b228-00f947b62d3f" />

### UR10 Reach task

This is the Tensorboard logging output from the [Isaac-Reach-UR10-v0](https://github.com/isaac-sim/IsaacLab/blob/main/source/isaaclab_tasks/isaaclab_tasks/manager_based/manipulation/reach/config/ur_10/joint_pos_env_cfg.py) task showing two metrics. If I zoom in one logged metric, I can see that the update happens after every **15** epochs:

<img width="730" height="434" alt="Image" src="https://github.com/user-attachments/assets/322aaaa1-cb3a-43a6-9c91-24f6e6b6ef33" />


### System Info

- Commit: 75b67154b
- Isaac Sim Version: 5.0.0
- OS: Ubuntu 22.04.5 LTS
- GPU: Quadro RTX 6000
- CUDA: 13.0
- GPU Driver: 580.65.0

### Checklist

- [x] I have checked that there is no similar issue in the repo
- [x] I have checked that the issue is not in running Isaac Sim itself and is related to the repo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug Report] TensorBoard logging issues with `rl_games` library #3484

Problem

Description

Attempts to fix this error

UR10 Reach task

System Info

Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	save_best_after: 200
	save_frequency: 100

[Bug Report] TensorBoard logging issues with rl_games library #3484

Description

Problem

Description

Attempts to fix this error

UR10 Reach task

System Info

Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[Bug Report] TensorBoard logging issues with `rl_games` library #3484