[Wrapper]: add `NormalizeObservation` and `NormalizeReward` #1635

zuoxingdong · 2019-08-02T15:25:04Z

No description provided.

pzhokhov · 2019-10-25T21:52:23Z

observation / reward normalization is a bit of contested topic - on one hand, it can greatly help learning across a variety of environments, on the other hand, it makes the environment stateful (i.e. reset does not return the environment quite to a clean state), agent become hard to serialize (because one has to save state of the environment along with the state of the agent) and training on environments with sparse rewards can be completely messed up. As such, I don't think this should a part of the gym.

zuoxingdong added 6 commits August 2, 2019 17:24

Create normalize_obs_reward.py

6c35833

Update __init__.py

ee46e05

Create test_normalize_obs_reward.py

2a2f82e

Update normalize_obs_reward.py

c2eb549

Update normalize_obs_reward.py

d2db3d4

Merge branch 'master' into patch-30

553f0ae

pzhokhov closed this Oct 25, 2019

tristandeleu mentioned this pull request Jul 29, 2021

Plans for Future Maintenance of Gym #2259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Wrapper]: add `NormalizeObservation` and `NormalizeReward` #1635

[Wrapper]: add `NormalizeObservation` and `NormalizeReward` #1635

zuoxingdong commented Aug 2, 2019

pzhokhov commented Oct 25, 2019

[Wrapper]: add NormalizeObservation and NormalizeReward #1635

[Wrapper]: add NormalizeObservation and NormalizeReward #1635

Conversation

zuoxingdong commented Aug 2, 2019

pzhokhov commented Oct 25, 2019

[Wrapper]: add `NormalizeObservation` and `NormalizeReward` #1635

[Wrapper]: add `NormalizeObservation` and `NormalizeReward` #1635