We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
It seems that you update critic before actor.
As far as I know, the actor_loss is calculated through critic network, so the backward of actor_loss will influence the grad of critic parameters.
Should we update actor first, and then update critic using both actor_loss and critic_loss?
The text was updated successfully, but these errors were encountered:
In the original paper, they update critic first tho.
Sorry, something went wrong.
No branches or pull requests
It seems that you update critic before actor.
As far as I know, the actor_loss is calculated through critic network, so the backward of actor_loss will influence the grad of critic parameters.
Should we update actor first, and then update critic using both actor_loss and critic_loss?
The text was updated successfully, but these errors were encountered: