Update docstrings for improved documentation #160

axb2035 · 2022-11-26T20:19:44Z

Description

Updates the toy_text docstrings to have better consitency and more information. As discussed on Discord. No dependcies required.

No related issue.

Type of change

Please delete options that are not relevant.

This change requires a documentation update

Screenshots

Please attach before and after screenshots of the change if applicable.

n/a

Checklist:

I have run the pre-commit checks with pre-commit run --all-files (see CONTRIBUTING.md instructions to set it up)
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

pseudo-rnd-thoughts · 2022-11-26T22:12:15Z

gymnasium/envs/toy_text/cliffwalking.py

+    - 3: Move left
+
+    ## Observation Space
+    There are 3 x 12 + 1 possible states. The player cannot be at the cliff, nor at


This is outside the scope of this pr, but should we be interested in adding an output with the coordinates of the agent. This could be an additional constructor argument and would change the output input, but would be easier to debug and as a demonstration to new users than an integer representing the state.

I agree it would be easier to understand and remove a layer of complexity for new users. It took me a while to figure out how it works. FYI: taxi and frozen lake use the same approach.

Ok, I don't think, this can be done in a future PR, possibly in the same PR as the transition probability PR

pseudo-rnd-thoughts · 2022-11-26T22:13:17Z

gymnasium/envs/toy_text/cliffwalking.py

+    ## Information
+
+    `step()` and `reset()` return a dict with the following keys:
+    - "p" - transition proability for the state.


Given that the transition probability is always 1.0, what information does the transition probability actually help with?

Agree, not a lot. It's there for completeness. Taxi, frozen lake and cliff walking have proability returned in their information, though only frozen lake actually has it implemented (see next comment regarding taxi).

For cliff walking, we could implement transitional probabilities similar to frozen lake / taxi. If we are not then suggest we change the info returned to {}. Though that may break some people's code...

I think it is more interesting to add the transition probability, we can do this in a future PR.
Could you make an issue outlining the problem and proposed solution.
For the taxi environment, I would only apply randomness on the movement actions and not pick up and drop off

Issue #161 created. It doesn't include the suggestion to output coordinates of the agent for Cliff Walking, Frozen Lake and Taxi to keep the changes discrete. Will add the output co-ordinate enhancement proposal at a later date.

pseudo-rnd-thoughts · 2022-11-26T22:16:52Z

gymnasium/envs/toy_text/taxi.py

-    As the steps are deterministic, "p" represents the probability of the transition which is always 1.0
+    As taxi is not stochastic, the transition probability is always 1.0. Implementing
+    a transitional probability in line with the Dietterich paper ('The fickle taxi task')
+    is a TODO.


Should this TODO be here. Do we need someone to implement this?

I've put my hand up to implement transitional probablities in taxi it once I finish the doc review. Replaces the "bugged will be fixed soon" comment :)

Sphinx tracks anchors globally. Each has to have a unique name to avoid linking to wrong anchor.

Update docstrings for improved documentation

b545e48

pseudo-rnd-thoughts reviewed Nov 26, 2022

View reviewed changes

pseudo-rnd-thoughts approved these changes Nov 29, 2022

View reviewed changes

Change reference anchors to be unique

7195685

Sphinx tracks anchors globally. Each has to have a unique name to avoid linking to wrong anchor.

pseudo-rnd-thoughts merged commit ae75ad2 into Farama-Foundation:main Nov 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update docstrings for improved documentation #160

Update docstrings for improved documentation #160

axb2035 commented Nov 26, 2022

pseudo-rnd-thoughts Nov 26, 2022

axb2035 Nov 26, 2022

pseudo-rnd-thoughts Nov 26, 2022

pseudo-rnd-thoughts Nov 26, 2022

axb2035 Nov 26, 2022

pseudo-rnd-thoughts Nov 26, 2022

axb2035 Nov 29, 2022

pseudo-rnd-thoughts Nov 26, 2022

axb2035 Nov 26, 2022

Update docstrings for improved documentation #160

Update docstrings for improved documentation #160

Conversation

axb2035 commented Nov 26, 2022

Description

Type of change

Screenshots

Checklist:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment