-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Do something about the actor-critic Coursera assignment #398
Comments
@dniku, Have you done anything so far about this problem? Besides, I am facing issues in the assignment of week08 related to this issue. I am trying to fix both now; the policy loss and the reward are not correct in both studies (week08 and week06), although I have done everything that I can be done to fix both of them. I want to know if it is something regards the |
We haven't done anything about this assignment; but this issue is about the Coursera assignment specifically, and not the ones in the Your screenshots of plots seem to indicate that your agent isn't learning anything at all, and is behaving randomly. I'd guess that the reason is some bug in your code, e.g. a |
Hello @dniku, |
Currently,
week5_policy_based/practice_a3c.ipynb
has numerous problems.master
(it is a heavily modified version of master/week08/practice_pomdp which was never originally intended to be an actor-critic assignment).The difficulty is fixing this is that the videos that lead up to this assignment talk about A3C a lot.
The text was updated successfully, but these errors were encountered: