[Bug] Use of gym.make() stops "rollout/" data from being printed #232

pstansell · 2020-11-19T01:58:46Z

🐛 Bug

If gym.make is used to define the environment the rollout data is not printed.

To Reproduce

Example where gym.make() is used:

import gym
from stable_baselines3 import SAC
env = gym.make('Pendulum-v0')
model = SAC('MlpPolicy', env, verbose = 1)
model.learn(200, log_interval = 1)

Output is missing the rollout/ data:

Using cpu device
Wrapping the env in a DummyVecEnv.
---------------------------------
| time/              |          |
|    episodes        | 1        |
|    fps             | 71       |
|    time_elapsed    | 2        |
|    total timesteps | 200      |
| train/             |          |
|    actor_loss      | 7.2      |
|    critic_loss     | 2.08     |
|    ent_coef        | 0.971    |
|    ent_coef_loss   | -0.0491  |
|    learning_rate   | 0.0003   |
|    n_updates       | 99       |
---------------------------------

Expected behavior

Example where gym.make() is not used:

import gym
from stable_baselines3 import SAC
model = SAC('MlpPolicy', 'Pendulum-v0', verbose = 1)
model.learn(200, log_interval = 1)

Output includes the rollout/ data:

Using cpu device
Creating environment from the given name 'Pendulum-v0'
Wrapping the env in a DummyVecEnv.
----------------------------------
| rollout/           |           |
|    ep_len_mean     | 200       |
|    ep_rew_mean     | -1.33e+03 |
| time/              |           |
|    episodes        | 1         |
|    fps             | 57        |
|    time_elapsed    | 3         |
|    total timesteps | 200       |
| train/             |           |
|    actor_loss      | 7.34      |
|    critic_loss     | 1.02      |
|    ent_coef        | 0.971     |
|    ent_coef_loss   | -0.0488   |
|    learning_rate   | 0.0003    |
|    n_updates       | 99        |
----------------------------------

System Info

Describe the characteristic of your environment:

pip install stable-baselines3
Python 3.6.8
PyTorch version 1.7.0
Gym version 0.17.3

The text was updated successfully, but these errors were encountered:

araffin · 2020-11-19T08:40:48Z

Hello,

This is not a bug, you need to wrap your environment using a Monitor wrapper.
See Documentation and hill-a/stable-baselines#339

pstansell · 2020-11-19T12:14:40Z

Thank you very much for your quick reply. I'm sorry I missed the need for the monitor wrapper.

The example at hill-a/stable-baselines#24 it very useful to show how it is applied.

My example above does what I want if I use:

import gym
from stable_baselines3 import SAC
from stable_baselines3.common.monitor import Monitor
env = gym.make('Pendulum-v0')
env = Monitor(env)
model = SAC('MlpPolicy', env, verbose = 1)
model.learn(200, log_interval = 1)

The output is now:

Using cpu device
Wrapping the env in a DummyVecEnv.
---------------------------------
| rollout/           |          |
|    ep_len_mean     | 200      |
|    ep_rew_mean     | -973     |
| time/              |          |
|    episodes        | 1        |
|    fps             | 169      |
|    time_elapsed    | 1        |
|    total timesteps | 200      |
| train/             |          |
|    actor_loss      | 6.08     |
|    critic_loss     | 2.26     |
|    ent_coef        | 0.971    |
|    ent_coef_loss   | -0.049   |
|    learning_rate   | 0.0003   |
|    n_updates       | 99       |
---------------------------------

It appears that a number of people have tripped up on the same thing. If there had been a message along the lines of

Wrapping the env in a Monitor.

I would probably have worked it out myself and not taken your time by submitting this issue as a bug.

Miffyli · 2020-11-19T12:18:12Z

It appears that a number of people have tripped up on the same thing. If there had there been a message along the lines of
Wrapping the env in a Monitor.

I think this is a good suggestion that could be included: Monitor is indeed a bit of a quirk but heavily depended on by SB3, so any clarity of its use would be nice to see.

araffin · 2020-11-19T12:19:48Z

It appears that a number of people have tripped up on the same thing. If there had there been a message along the lines of

fair enough, I think we can in fact wrap it automatically, since we have is_wrapped helper since #220 .

pstansell added the bug Something isn't working label Nov 19, 2020

araffin added RTFM Answer is the documentation and removed bug Something isn't working labels Nov 19, 2020

araffin added the enhancement New feature or request label Nov 19, 2020

araffin mentioned this issue Nov 20, 2020

Automatically wrap with a Monitor when possible #237

Merged

14 tasks

Miffyli closed this as completed in #237 Nov 20, 2020

This was referenced Nov 23, 2020

No tensorboard log is created during training #242

Closed

[Question] total_episode_reward_logger is wrongly handled due to the way of storing dones hill-a/stable-baselines#1049

Closed

araffin mentioned this issue Dec 4, 2020

[Question] SAC Tensorboard Logging Reward #251

Closed

araffin mentioned this issue Dec 16, 2021

Integrating minigrid #689

Closed

icbarbu mentioned this issue Jan 3, 2022

[Question] Should SB3 write the event files when Im using the Tensorboard Integration? #718

Closed

2 tasks

araffin mentioned this issue Sep 12, 2022

[Question] No reward info during parallel training? #1064

Closed

araffin mentioned this issue Aug 24, 2023

[Question] Difference between make_vec_env and SubprocVecEnv for multiprocessing #1654

Closed

4 tasks

Elcoid mentioned this issue Jan 12, 2024

rollout/ep_len_mean and rollout/ep_rew_mean not shown despite env wrapped in Monitor #1806

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Use of gym.make() stops "rollout/" data from being printed #232

[Bug] Use of gym.make() stops "rollout/" data from being printed #232

pstansell commented Nov 19, 2020 •

edited

Loading

araffin commented Nov 19, 2020

pstansell commented Nov 19, 2020 •

edited

Loading

Miffyli commented Nov 19, 2020

araffin commented Nov 19, 2020

[Bug] Use of gym.make() stops "rollout/" data from being printed #232

[Bug] Use of gym.make() stops "rollout/" data from being printed #232

Comments

pstansell commented Nov 19, 2020 • edited Loading

🐛 Bug

To Reproduce

Expected behavior

System Info

araffin commented Nov 19, 2020

pstansell commented Nov 19, 2020 • edited Loading

Miffyli commented Nov 19, 2020

araffin commented Nov 19, 2020

pstansell commented Nov 19, 2020 •

edited

Loading

pstansell commented Nov 19, 2020 •

edited

Loading