Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ex 4.1 #78

Open
StoyanVenDimitrov opened this issue Feb 11, 2021 · 1 comment
Open

Ex 4.1 #78

StoyanVenDimitrov opened this issue Feb 11, 2021 · 1 comment

Comments

@StoyanVenDimitrov
Copy link

Hi,

how do you come to value of state 11 being -14?

@Kin-Zhang
Copy link

In example 4.1 This is an undiscounted episodic task, the reward is -1 on all transitions until the terminal state is reached

Since state 11 is not the terminal state so it's reward is -14

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants