Allow PPO to turn off advantage normalization #61

vwxyzjn · 2022-02-22T15:31:30Z

Description

Allow PPO to turn of advantage normalization. Follow up from DLR-RM/stable-baselines3#763, #60

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist:

Note: we are using a maximum length of 127 characters per line

araffin

Please update the changelog too, otherwise LGTM ;) (and you need an additional issue if you have already one opened in SB3 repo)

araffin · 2022-02-22T15:47:37Z

tests/test_run.py

+
+@pytest.mark.parametrize("normalize_advantage", [False, True])
+def test_advantage_normalization(model_class, normalize_advantage):
+    model = MaskablePPO("MlpPolicy", "CartPole-v1", n_steps=64, normalize_advantage=normalize_advantage)


not sure if it works with CartPole, let's see...

Hey you may need to approve the workflow run ;)

i did and there are some failures already ;)
but best is to quickly run locally using the -k argument

vwxyzjn · 2022-02-22T18:05:11Z

Had the following error, seems to relate to DLR-RM/stable-baselines3#782

ImportError: cannot import name 'GoalEnv' from 'gym' (/home/costa/.cache/pypoetry/virtualenvs/cleanrl-ghSZGHE3-py3.9/lib/python3.9/site-packages/gym/__init__.py)

vwxyzjn · 2022-02-22T18:13:04Z

Ok should be good now.

vwxyzjn added 3 commits February 22, 2022 10:31

Allow PPO to turn off advantage normalization

d641eb7

Quick fix

491c4c3

Add test cases

e27f6aa

vwxyzjn force-pushed the norm-adv branch from 3874e6a to e27f6aa Compare February 22, 2022 15:31

araffin reviewed Feb 22, 2022

View reviewed changes

vwxyzjn added 3 commits February 22, 2022 10:50

Update docs

2b9f7c4

Quick fix

5629268

Quick fix

d463ccb

Fix sort

43fb6c2

araffin approved these changes Feb 23, 2022

View reviewed changes

araffin merged commit f5c1aaa into Stable-Baselines-Team:master Feb 23, 2022

araffin mentioned this pull request Feb 23, 2022

Allow PPO to turn off advantage normalization #60

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow PPO to turn off advantage normalization #61

Allow PPO to turn off advantage normalization #61

vwxyzjn commented Feb 22, 2022 •

edited

Loading

araffin left a comment

araffin Feb 22, 2022

vwxyzjn Feb 22, 2022

araffin Feb 22, 2022 •

edited

Loading

vwxyzjn commented Feb 22, 2022

vwxyzjn commented Feb 22, 2022

Allow PPO to turn off advantage normalization #61

Allow PPO to turn off advantage normalization #61

Conversation

vwxyzjn commented Feb 22, 2022 • edited Loading

Description

Types of changes

Checklist:

araffin left a comment

Choose a reason for hiding this comment

araffin Feb 22, 2022

Choose a reason for hiding this comment

vwxyzjn Feb 22, 2022

Choose a reason for hiding this comment

araffin Feb 22, 2022 • edited Loading

Choose a reason for hiding this comment

vwxyzjn commented Feb 22, 2022

vwxyzjn commented Feb 22, 2022

vwxyzjn commented Feb 22, 2022 •

edited

Loading

araffin Feb 22, 2022 •

edited

Loading