gym-puddle

The grid-world environment with continuous state space and discrete action space described by Degris Thomas, Martha White, and Richard S. Sutton in "Off-policy actor-critic" arXiv preprint arXiv:1205.4839 (2012) for Gymnasium.

Setup

The gym-puddle package is managed by uv. To install the package (in edit mode by default) and all its extra dependencies, do:

uv sync --all-extras

This will install the project and its dependencies in a virtual environment under ./.venv.

Running the tests

To run the pytest tests, simply do:

pytest tests/

Usage

Below is a simple example of using a random policy for a maximum of 1000 time-steps.

import gymnasium as gym
import gym_puddle

def main() -> None:
    seed = 43
    env = gym.make("PuddleWorld-v0", render_mode="human", goal=[0.96, 0.96])
    observation, _ = env.reset(seed=seed)

    env.action_space.seed(seed=seed)
    for _ in range(1000):
        action = env.action_space.sample()
        observation, reward, terminated, truncated, _ = env.step(action)
        env.render()

        if terminated or truncated:
            env.reset()
            break

    env.close()

Notes:

In the above example, the agent-environment interaction is rendered visually on a canvas (since we've set the render_mode=human). To disable it, remove the input argument.
To truncate the episodes after a number of time steps have elapsed, pass max_episode_steps to the input arguments of make(). Note that the caller needs to reset the environment immediately after truncation or termination (see example above).
Rendering is fast, but disabling it will make the code even faster and is highly recommended to do for training agents.

References

https://github.com/EhsanEI/gym-puddle
Off-Policy Actor-Critic. Thomas Degris, Martha White, Richard S. Sutton. In Proceedings of the Twenty-Ninth International Conference on Machine Learning (ICML), 2012.

Acknowledgments

The code is based on and forked from EhsanEI's implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
examples		examples
gym_puddle		gym_puddle
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
screenshot.png		screenshot.png
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gym-puddle

Setup

Running the tests

Usage

References

Acknowledgments

About

Languages

License

dantp-ai/gym-puddle

Folders and files

Latest commit

History

Repository files navigation

gym-puddle

Setup

Running the tests

Usage

References

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Languages