-
Notifications
You must be signed in to change notification settings - Fork 34
Custom rollout depth #91
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
d0f593b
to
2ea8b4b
Compare
Hi Honza! Thanks for a quick attempt at this! I have a few comments. First, I like the approach of putting the depth as a parameter of the rollout estimator. I think this is elegant and easy to follow. Though as a consequence, it will be difficult to use the current depth in the tree in the rollouts. At least for me, this is not a big issue and it's simpler this way, but maybe in some use cases, this is needed. What do you think @zsunberg? This could be addressed by still passing the depth to the
Regarding the code, I tried to add commits in your branch that will let the tests run, I haven't figured out how to add them here (do you know how to do that?).
Also, you can test your code quickly from REPL. Some tests are failing right now, and this is how you run them:
Alternatively, you can run the I think as the next step, you can have a look at the test set, but let's wait for @zsunberg before fixing them. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These changes allow for runtest.jl to run.
Co-authored-by: Jan <[email protected]>
@WhiffleFish can you please review these changes and let me know what you think. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall, I think the general approach looks good. However, in my opinion, it still requires a few more changes before we merge.
As a final note, some tests failed with the changes. Line 26 of test/options.jl
errored because it still included depth as an argument in the value estimation call, so that should be rectified possibly along with others that may fail once this one doesn't error.
Thank you for the review! @BoZenKhaa @WhiffleFish
|
@BoZenKhaa I have added your suggestions. I have not found any inconsistencies in the documenttation. |
I meant this I very much like the state of the changes! |
Hi all, sorry for the delayed response - I have been quite busy here. The changes are quite nice. Thank you for being so thorough. One question to consider here: In this PR you have reduced the arguments for Any other thoughts? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In the docstrings for the solvers, please change the documentation for the depth
argument to reflect that it no longer applies to the rollouts.
@WhiffleFish are the changes you requested complete? |
We discussed this with @WhiffleFish in the review. The outcome was that making the new behavior default is a breaking change, keeping the legacy functionality would not be generally useful and would unnecessarily complicate things. If you agree, I think this is ready :-D |
Hi all, sorry that this fell stale - this semester was very busy for me, but it just ended. I think it is useful to allow the old behavior, so I will go through and add the depth argument back in and fix the other minor issues and merge this tomorrow. |
Thanks a bunch @jancervenka @BoZenKhaa ! I am sorry that this took so long, but it is a useful addition! |
Draft resolving #88