Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I dream of agent playing 2048 using this library~ #41

Closed
LinXueyuanStdio opened this issue Jun 5, 2023 · 5 comments
Closed

I dream of agent playing 2048 using this library~ #41

LinXueyuanStdio opened this issue Jun 5, 2023 · 5 comments
Labels
algorithm New algorithm config New or improved configuration enhancement New feature or request environment New or improved environment good first issue Good for newcomers

Comments

@LinXueyuanStdio
Copy link

May it come true?

@puyuan1996
Copy link
Collaborator

Hello,

Thank you very much for your attention. Yes, we are confident that this can be achieved. The Stochastic MuZero algorithm is designed to solve environments with stochasticity in dynamics, such as 2048. We are currently developing the 2048+Stochastic MuZero algorithm, and have completed about 80% of the work. We will submit the related PR in a few weeks, so please stay tuned. Thank you for your patience.

Best wishes.

@puyuan1996 puyuan1996 added enhancement New feature or request good first issue Good for newcomers labels Jun 7, 2023
@PaParaZz1
Copy link
Member

We are working on the implementation of Stochastic MuZero with 2048 env in #64. You can keep track of it.

@LinXueyuanStdio
Copy link
Author

Yeah, I have been following the PR for days.

@PaParaZz1
Copy link
Member

#77 shows some experiment results on 2048 environment.

@puyuan1996
Copy link
Collaborator

puyuan1996 commented Sep 12, 2023

  • Hello, we have comprehensively optimized the 2048 environment and the related configurations for Bot, MuZero, and Stochatic MuZero in this PR. We are now providing the benchmark results in the 2048 environment as shown in the following image:
  • Furthermore, we've provide the render video to illustrate the performance of Random action, Bot action, MuZero, and Stochastic MuZero agents in the 2048 environment.

    • Random action (~1000):
      game_2048_random

    • Bot action (~30000-70000):
      game_2048_bot_small

    • MuZero sim=100 (~27956):
      game_2048_muzero_ns100_s0

    • Stochastic MuZero sim=100 (~37356):
      game_2048_stochastic_muzero_ns100_s0_small

    • Where the term sim represents the number of simulations used for evaluation in MCTS, and the score within brackets corresponds to the score procured by each respective agent.

Thank you for your attention.

@puyuan1996 puyuan1996 added environment New or improved environment algorithm New algorithm config New or improved configuration labels Sep 12, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
algorithm New algorithm config New or improved configuration enhancement New feature or request environment New or improved environment good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

3 participants