MCTS

Monte Carlo Tree Search implementation for a receding horizon control setting.

Samples

CartPole-v1

intersection-v0 from https://github.com/eleurent/highway-env

Features

Supports sparse/dense rewards
Supports replanning for receding horizon control.
Simple and Stand-alone.

Tested on

CartPole-v1, FrozenLake8x8, intersection-v0 (from highway_env), Taxi-v0

Hyperparameters

gamma (discount factor): 0.99 should work across most.
replanning_horizon: number of timesteps after which MCTS is queried again for a new plan. It is set based on how long a sucessful episode could last and granularity of control required. For example, in FrozenLake8x8-v0, a replannig horizon of 5 could be appropriate, but in cartpole, a horizon of 50 works fine.
max_tree_depth: maximum depth to which the tree is expanded. needs to be finite to support continuous control tasks.
num_iterations: number of rollouts performed per MCTS query.

TODO

Reset tree statistics/visits on stepping the tree.
Parallelize implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
samples		samples
LICENSE		LICENSE
README.md		README.md
mcts_receding_horizon.py		mcts_receding_horizon.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MCTS

Samples

Features

Tested on

Hyperparameters

TODO

References

About

Releases

Packages

Languages

License

sritee/Monte-Carlo-Tree-Search

Folders and files

Latest commit

History

Repository files navigation

MCTS

Samples

Features

Tested on

Hyperparameters

TODO

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages