You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The code is storing the statistics with respect to different quantities (epoch, step, and time) to the self.writer which is a tensorboardX.SummaryWriter(link to docs). But the statistics on wandb seem to only show the x-axis as "iter" (which is the same as epoch_num here) and they don't show performance as a function of the step or time. Is there a way to address such an issue here?
So I think the intended usage here is that we are supposed to adjust rewards/time so that the x-axis has Wall Time and rewards/step so that it uses global_step? (Somewhat confusingly, rewards/iter seems fine with the normal Step though it is clear in the code that iter is supposed to refer to an epoch.)
It would be nice if there was a way to automatically set all three plots so that they use the appropriate x-axis at the start. I'm not sure if this function is available.
If this is the intended usage, feel free to close this issue report. Thanks!
I am running PPO with wandb integration, but the statistics seem to not be recorded as intended.
I am testing this with Isaac Gym environments but I am unsure if this issue is specific to Isaac Gym.
Steps to reproduce: after installing following the IsaacGymEnvs instructions, run a command like this in the
isaacgymenvs/
directory:Where you can replace
danieltakeshi
with your username, and changeisaac-gym
to your project.After I run this, the reward goes up (good) but I also see this on wandb:
The code is recording the reward as a function of
iter
,step
, andtime
. It stores it inrl_games
here:rl_games/rl_games/common/a2c_common.py
Lines 947 to 955 in d8645b2
The code is storing the statistics with respect to different quantities (epoch, step, and time) to the
self.writer
which is atensorboardX.SummaryWriter
(link to docs). But the statistics on wandb seem to only show the x-axis as "iter
" (which is the same asepoch_num
here) and they don't show performance as a function of the step or time. Is there a way to address such an issue here?(Also posting on the Isaac Gym repo isaac-sim/IsaacGymEnvs#87)
The text was updated successfully, but these errors were encountered: