Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sampled MuZero #87

Closed
carlosgmartin opened this issue Jan 9, 2024 · 4 comments
Closed

Sampled MuZero #87

carlosgmartin opened this issue Jan 9, 2024 · 4 comments

Comments

@carlosgmartin
Copy link
Contributor

Would you consider adding support for Sampled MuZero?

@fidlej
Copy link
Collaborator

fidlej commented Jan 15, 2024

Maybe MCTS is not well suited for a large number of actions or for continuous actions.
You can use policy gradients or a simpler search there.

@puyuan1996
Copy link

Hello, thank you to the contributors for their outstanding work on this repository. Regarding the issue here, you might be interested in the project "LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios". This repository not only supports the AlphaZero algorithm but also extends support to MuZero and a series of related algorithms and environments (including SampledMuZero variants), which might meet your requirements. Best wishes.

@carlosgmartin
Copy link
Contributor Author

@puyuan1996 Does LightZero plan to support JAX? I previously asked about that here a while ago.

@puyuan1996
Copy link

Hello, given that the developers of this repository do not intend to support Sampled MuZero, we plan to optimize the existing pytorch+cpp version of the Sampled MuZero algorithm variant in LightZero in the near future. After the optimization is completed, considering that some users may not be familiar with cpp, we may provide a jax version to facilitate their use and research. At the same time, we are very welcoming and hope that community contributors can provide help in migration and discussion during this process, for which we will be extremely grateful. For related discussions and progress, you can follow this link (opendilab/LightZero#210). Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants