Twrl-rebased #14

SeanNaren · 2016-11-30T11:57:32Z

Based on #8, merged v1 as well into branch. Let me know if there are any remaining issues!

Fix terminal state condition

Update Minecraft.lua

# Conflicts: # README.md # rlenvs/Minecraft.lua

Kaixhin · 2016-12-02T18:59:22Z

Sorry am trying to finish off a paper due mid-NIPS before I head off to NIPS, so will have to put this on hold for a week. Have started taking a look but definitely need to have a thorough look through the whole thing before merging to master.

SeanNaren · 2016-12-05T09:49:43Z

@Kaixhin sounds good, enjoy NIPS :)

Kaixhin · 2016-12-13T15:52:04Z

Remind what the conclusion on timeStepLimit and maxSteps was? Currently the env will run to the minimum of these, rather than maxSteps overriding timeStepLimit.

Also. in gym, what's reward_threshold? Looking at MountainCar, it seems like a scaled version of rlenvs' rewardSpace, but only limited one way.

SeanNaren · 2016-12-14T17:20:51Z

Currently the default maximum time step per episode is 1000. This can be changed by the environment via timeStepLimit in the Env.lua class, and by maxSteps set by the user in the options. However if maxSteps is greater than timeStepLimit (and not null) it is set to timeStepLimit

Does that make sense? Any changes you would make?

Kaixhin · 2016-12-14T17:43:12Z

Seems a little counterintuitive to have timeStepLimit overwrite maxSteps, but if that's what gym does then let's stick to that for consistency. If so then go ahead and merge this pull request!

I see that reward_threshold is used to determine if an environment is "solved", but that's not of concern to this library immediately.

SeanNaren · 2016-12-14T17:51:18Z

I think as kory said this was so that each episode is a maximum fixed size for their leaderboards! And sounds good, will just do one quick pass through and merge :)

SeanNaren · 2016-12-15T10:09:18Z

Thanks for your help @Kaixhin, was great stuff!

Kaixhin · 2016-12-15T11:43:09Z

Thanks a lot @SeanNaren! I'll check if anything needs to be backported to v1 myself, so good luck with twrl integration.

petrosgk and others added 8 commits November 14, 2016 19:10

Update Minecraft.lua

3fe0520

Merge pull request #12 from petrosgk/patch-1

6f0a9dd

Fix terminal state condition

Update Minecraft.lua

ea6b14d

Merge pull request #13 from petrosgk/patch-2

8a3aebc

Update Minecraft.lua

Fix Minecraft LUA_CPATH

f330a01

API modified to follow gym conventions

1bcbb6d

Test uses 2 space indentation

ee4a6b0

Merge remote-tracking branch 'origin/v1' into twrl-rebase

b7ab244

# Conflicts: # README.md # rlenvs/Minecraft.lua

SeanNaren mentioned this pull request Nov 30, 2016

Modify envs to be compatible with twrl #8

Closed

Kaixhin added 2 commits December 13, 2016 15:40

Tidy up README and code

aab044c

Add more details on API

d653ede

Add ref to OpenAI Gym

fe57638

SeanNaren merged commit 82a5084 into master Dec 15, 2016

SeanNaren deleted the twrl-rebase branch December 15, 2016 10:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Twrl-rebased #14

Twrl-rebased #14

SeanNaren commented Nov 30, 2016

Kaixhin commented Dec 2, 2016

SeanNaren commented Dec 5, 2016

Kaixhin commented Dec 13, 2016

SeanNaren commented Dec 14, 2016 •

edited

Loading

Kaixhin commented Dec 14, 2016

SeanNaren commented Dec 14, 2016

SeanNaren commented Dec 15, 2016

Kaixhin commented Dec 15, 2016

Twrl-rebased #14

Twrl-rebased #14

Conversation

SeanNaren commented Nov 30, 2016

Kaixhin commented Dec 2, 2016

SeanNaren commented Dec 5, 2016

Kaixhin commented Dec 13, 2016

SeanNaren commented Dec 14, 2016 • edited Loading

Kaixhin commented Dec 14, 2016

SeanNaren commented Dec 14, 2016

SeanNaren commented Dec 15, 2016

Kaixhin commented Dec 15, 2016

SeanNaren commented Dec 14, 2016 •

edited

Loading