We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi,
how do you come to value of state 11 being -14?
The text was updated successfully, but these errors were encountered:
In example 4.1 This is an undiscounted episodic task, the reward is -1 on all transitions until the terminal state is reached
Since state 11 is not the terminal state so it's reward is -14
Sorry, something went wrong.
No branches or pull requests
Hi,
how do you come to value of state 11 being -14?
The text was updated successfully, but these errors were encountered: