Puddle world environment for OpenAI Gym with continuous state space and discrete action space.
Run this line in the project directory.
pip3 install -e .
import gym
import gym_puddle # Don't forget this extra line!
env = gym.make('PuddleWorld-v0')
Rendering is available. The program runs much faster without it, but it's good for debugging.
Run this line after each time step to see the puddles and agent location.
env.render()
Off-Policy Actor-Critic. Thomas Degris, Martha White, Richard S. Sutton. In Proceedings of the Twenty-Ninth International Conference on Machine Learning (ICML), 2012.
http://rlpark.github.io/ (Java implementation)
https://github.com/samindaa/RLLib (C++ implementation)