[rllib] Add squash_to_range model option by ericl · Pull Request #2239 · ray-project/ray

ericl · 2018-06-12T01:33:40Z

What do these changes do?

PPO / A3C / PG currently do not respect Box action space low/high values, and will emit values beyond that range. This uses tf.sigmoid to squash to [0, 1] and then rescale to the right range.

need to test for performance regressions

Related issue number

#1862

AmplabJenkins · 2018-06-12T02:42:24Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/6013/
Test PASSed.

AmplabJenkins · 2018-06-19T00:07:29Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/6112/
Test FAILed.

AmplabJenkins · 2018-06-19T00:29:23Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/6111/
Test PASSed.

AmplabJenkins · 2018-06-19T00:43:56Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/6113/
Test FAILed.

richardliaw · 2018-06-19T20:20:12Z

Can you check for regressions on pendulum-ppo?

ericl · 2018-06-19T20:26:37Z

It's already done -- works fine.

…

On Tue, Jun 19, 2018 at 1:20 PM Richard Liaw ***@***.***> wrote: Can you check for regressions on pendulum-ppo? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#2239 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAA6SjAaIWuSUq26Wevy7u20eMLfmI3yks5t-V0DgaJpZM4Ujo5X> .

AmplabJenkins · 2018-06-20T02:31:19Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/6144/
Test PASSed.

* 'master' of https://github.com/ray-project/ray: (157 commits) Fix build failure while using make -j1. Issue 2257 (ray-project#2279) Cast locator with index type (ray-project#2274) fixing zero length partitions (ray-project#2237) Make actor handles work in Python mode. (ray-project#2283) [xray] Add error table and push error messages to driver through node manager. (ray-project#2256) addressing comments (ray-project#2210) Re-enable some actor tests. (ray-project#2276) Experimental: enable automatic GCS flushing with configurable policy. (ray-project#2266) [xray] Sets good object manager defaults. (ray-project#2255) [tune] Update Trainable doc to expose interface (ray-project#2272) [rllib] Add a simple REST policy server and client example (ray-project#2232) [asv] Pushing to s3 (ray-project#2246) [rllib] Remove need to pass around registry (ray-project#2250) Support multiple availability zones in AWS (fix ray-project#2177) (ray-project#2254) [rllib] Add squash_to_range model option (ray-project#2239) Mitigate randomly building failure: adding gen_local_scheduler_fbs to raylet lib. (ray-project#2271) [rllib] Refactor Multi-GPU for PPO (ray-project#1646) [rllib] Envs for vectorized execution, async execution, and policy serving (ray-project#2170) [Dataframe] Change pandas and ray.dataframe imports (ray-project#1942) [Java] Replace binary rewrite with Remote Lambda Cache (SerdeLambda) (ray-project#2245) ...

sigmoid

54bff0a

ericl added 4 commits June 18, 2018 13:04

Merge remote-tracking branch 'upstream/master' into squash-sigmoid

a27cea5

squash

94f33b1

squash true

75b10f4

git push

3807713

ericl changed the title ~~[WIP] [rllib] Squash continuous actions to specified range using tf.sigmoid~~ [rllib] Add squash_to_range model option Jun 18, 2018

ericl requested a review from richardliaw June 19, 2018 19:52

ericl assigned richardliaw Jun 19, 2018

richardliaw approved these changes Jun 19, 2018

View reviewed changes

Update catalog.py

90064f7

ericl merged commit 46cc51c into ray-project:master Jun 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib] Add squash_to_range model option#2239

[rllib] Add squash_to_range model option#2239
ericl merged 6 commits intoray-project:masterfrom
ericl:squash-sigmoid

ericl commented Jun 12, 2018 •

edited

Loading

Uh oh!

AmplabJenkins commented Jun 12, 2018

Uh oh!

AmplabJenkins commented Jun 19, 2018

Uh oh!

AmplabJenkins commented Jun 19, 2018

Uh oh!

AmplabJenkins commented Jun 19, 2018

Uh oh!

richardliaw commented Jun 19, 2018

Uh oh!

ericl commented Jun 19, 2018 via email

Uh oh!

AmplabJenkins commented Jun 20, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ericl commented Jun 12, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What do these changes do?

Related issue number

Uh oh!

AmplabJenkins commented Jun 12, 2018

Uh oh!

AmplabJenkins commented Jun 19, 2018

Uh oh!

AmplabJenkins commented Jun 19, 2018

Uh oh!

AmplabJenkins commented Jun 19, 2018

Uh oh!

richardliaw commented Jun 19, 2018

Uh oh!

ericl commented Jun 19, 2018 via email

Uh oh!

AmplabJenkins commented Jun 20, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ericl commented Jun 12, 2018 •

edited

Loading