[BUG] Incompatible with latest gym normalize wrappers #185

vwxyzjn · 2022-08-10T15:04:17Z

Describe the bug

A clear and concise description of what the bug is.

To Reproduce

Steps to reproduce the behavior.

Please try to provide a minimal example to reproduce the bug. Error messages and stack traces are also helpful.

Please use the markdown code blocks for both code and stack traces.

import numpy as np
import envpool
import gym

envs = envpool.make(
    "HalfCheetah-v4",
    env_type="gym",
    num_envs=4,
)
envs.num_envs = 4
envs.single_action_space = envs.action_space
envs.single_observation_space = envs.observation_space
envs.is_vector_env = True
envs = gym.wrappers.ClipAction(envs)
envs = gym.wrappers.NormalizeObservation(envs)
envs = gym.wrappers.TransformObservation(envs, lambda obs: np.clip(obs, -10, 10))
envs = gym.wrappers.NormalizeReward(envs)
envs = gym.wrappers.TransformReward(envs, lambda reward: np.clip(reward, -10, 10))
obs = envs.reset()
envs.step(np.array([envs.action_space.sample() for _ in range(envs.num_envs)]))

Traceback (most recent call last):
  File "/home/costa/Documents/go/src/github.com/vwxyzjn/envpool-cleanrl/bug.py", line 22, in <module>
    envs.step(np.array([envs.action_space.sample() for _ in range(envs.num_envs)]))
  File "/home/costa/.cache/pypoetry/virtualenvs/envpool-cleanrl-uAHoRI5J-py3.9/lib/python3.9/site-packages/gym/core.py", line 532, in step
    step_returns = self.env.step(action)
  File "/home/costa/.cache/pypoetry/virtualenvs/envpool-cleanrl-uAHoRI5J-py3.9/lib/python3.9/site-packages/gym/wrappers/normalize.py", line 149, in step
    self.env.step(action), True, self.is_vector_env
  File "/home/costa/.cache/pypoetry/virtualenvs/envpool-cleanrl-uAHoRI5J-py3.9/lib/python3.9/site-packages/gym/core.py", line 493, in step
    step_returns = self.env.step(action)
  File "/home/costa/.cache/pypoetry/virtualenvs/envpool-cleanrl-uAHoRI5J-py3.9/lib/python3.9/site-packages/gym/wrappers/normalize.py", line 77, in step
    obs, rews, terminateds, truncateds, infos = step_api_compatibility(
  File "/home/costa/.cache/pypoetry/virtualenvs/envpool-cleanrl-uAHoRI5J-py3.9/lib/python3.9/site-packages/gym/utils/step_api_compatibility.py", line 178, in step_api_compatibility
    return step_to_new_api(step_returns, is_vector_env)
  File "/home/costa/.cache/pypoetry/virtualenvs/envpool-cleanrl-uAHoRI5J-py3.9/lib/python3.9/site-packages/gym/utils/step_api_compatibility.py", line 59, in step_to_new_api
    and not infos["_TimeLimit.truncated"][i]
KeyError: '_TimeLimit.truncated'

Expected behavior

It would be great if envpool is compatible with the gym normalize wrappers or the other way around.

System info

Describe the characteristic of your environment:

Describe how the library was installed (pip, source, ...)
Python version
Versions of any other relevant libraries

import envpool, numpy, sys
print(envpool.__version__, numpy.__version__, sys.version, sys.platform)

0.6.3.post1 1.22.4 3.9.5 (default, Jul 19 2021, 13:27:26) 
[GCC 10.3.0] linux

Reason and Possible fixes

I think the reason is the new gym.wrappers.NormalizeReward wrapper expects to see something like _TimeLimit.truncated in the info section...

Checklist

I have checked that there is no similar issue in the repo (required)
I have read the documentation (required)
I have provided a minimal working example to reproduce the bug (required)

The text was updated successfully, but these errors were encountered:

Trinkle23897 · 2022-08-11T03:23:16Z

Maybe it's a bug from gym instead of envpool? I can only find 3 places that use _TimeLimit.truncated in gym repo main branch. The buggy one is from step_to_new_api while the other two are from step_to_old_api. It seems like step_to_new_api refers to an unknown key before assigning actual value to info dict, however, step_to_old_api doesn't have this issue.

vwxyzjn · 2022-08-12T18:56:56Z

I think the reason of the bug is openai/gym#3021 (comment). They expect a _TimeLimit.truncated to indicate if the key actually exists.

vwxyzjn assigned Trinkle23897 Aug 10, 2022

vwxyzjn mentioned this issue Aug 11, 2022

[Bug Report] gym normalize wrappers are incompatible with envpool openai/gym#3021

Closed

1 task

Trinkle23897 added the bug Something isn't working label Aug 13, 2022

Trinkle23897 closed this as completed Nov 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Incompatible with latest gym normalize wrappers #185

[BUG] Incompatible with latest gym normalize wrappers #185

vwxyzjn commented Aug 10, 2022

Trinkle23897 commented Aug 11, 2022

vwxyzjn commented Aug 12, 2022

[BUG] Incompatible with latest gym normalize wrappers #185

[BUG] Incompatible with latest gym normalize wrappers #185

Comments

vwxyzjn commented Aug 10, 2022

Describe the bug

To Reproduce

Expected behavior

System info

Reason and Possible fixes

Checklist

Trinkle23897 commented Aug 11, 2022

vwxyzjn commented Aug 12, 2022