[RLlib] Attention Net prep PR #1: Smaller cleanups. #12447

sven1977 · 2020-11-26T13:26:44Z

The current attention net trajectory view PR (#11729) is too large (>1000 lines added).
Therefore, I'm moving smaller preparatory and cleanup changes in ~2 pre-PRs. This is the first one of these.

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

sven1977 · 2020-11-26T13:29:18Z

rllib/agents/ddpg/ddpg_torch_policy.py

    validate_spaces=validate_spaces,
    before_init=before_init_fn,
-    after_init=setup_late_mixins,
+    before_loss_init=setup_late_mixins,


This is the more accurate kwarg to use (torch did not have a loss init step before, so this is new). The old after_init still works the exact same and thus this does not cause an API-break.

sven1977 · 2020-11-26T13:29:42Z

rllib/agents/ppo/ppo_tf_policy.py

    # RNN case: Mask away 0-padded chunks at end of time axis.
    if state:
-        max_seq_len = tf.reduce_max(train_batch["seq_lens"])
+        # Derive max_seq_len from the data itself, not from the seq_lens


Prep for attention nets, where dynamic max'ing over the given sequences is not allowed.

sven1977 · 2020-11-26T13:30:14Z

rllib/evaluation/sampler.py

            episode._set_last_observation(agent_id, filtered_obs)
            episode._set_last_raw_obs(agent_id, raw_obs)
-            episode._set_last_info(agent_id, infos[env_id].get(agent_id, {}))
+            # Infos from the environment.


Adding "infos" to the collector's, if required.

…ntion_nets_prep_0

WIP.

b5a4bc1

sven1977 requested a review from ericl November 26, 2020 13:26

sven1977 assigned ericl Nov 26, 2020

Fix.

0437680

sven1977 commented Nov 26, 2020

View reviewed changes

sven1977 mentioned this pull request Nov 26, 2020

[RLlib] Attention Net prep PR #2: Smaller cleanups. #12449

Merged

6 tasks

sven1977 added 2 commits November 26, 2020 16:37

Fix.

5e269c4

Fix.

787810d

sven1977 added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Nov 26, 2020

Merge branch 'master' of https://github.com/ray-project/ray into atte…

866180e

…ntion_nets_prep_0

ericl approved these changes Nov 28, 2020

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Nov 28, 2020

ericl merged commit 0df55a1 into ray-project:master Nov 28, 2020

sven1977 deleted the attention_nets_prep_0 branch March 27, 2021 11:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RLlib] Attention Net prep PR #1: Smaller cleanups. #12447

[RLlib] Attention Net prep PR #1: Smaller cleanups. #12447

Uh oh!

sven1977 commented Nov 26, 2020 •

edited

Loading

Uh oh!

sven1977 Nov 26, 2020

Uh oh!

sven1977 Nov 26, 2020

Uh oh!

sven1977 Nov 26, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[RLlib] Attention Net prep PR #1: Smaller cleanups. #12447

[RLlib] Attention Net prep PR #1: Smaller cleanups. #12447

Uh oh!

Conversation

sven1977 commented Nov 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

sven1977 Nov 26, 2020

Choose a reason for hiding this comment

Uh oh!

sven1977 Nov 26, 2020

Choose a reason for hiding this comment

Uh oh!

sven1977 Nov 26, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sven1977 commented Nov 26, 2020 •

edited

Loading