Observation and reward normalization #1

51616 · 2022-01-17T11:10:50Z

I have a question regarding the FACMAC implementation.
Did you use any wrapper such as observation/reward normalization or action clipping/rescaling? 'Cause in the original single-agent mode, the implementation usually use normalization and clipping wrapper for Mujoco tasks. I cannot find any wrapper in this repo so I wonder did you just use raw observation and reward to train the agents?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Observation and reward normalization #1

Observation and reward normalization #1

51616 commented Jan 17, 2022

Observation and reward normalization #1

Observation and reward normalization #1

Comments

51616 commented Jan 17, 2022