You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @tessavdheiden,
I believe that in one call of update you want to update actor just for one actor (namely for actor attach to agent_i). That is the reason why you send one action sample with gradient attached (representing the action of agent agent_i) and others without.
Hi Shariq,
In your code you update the value function with actions computed by:
As far as I know, 1) has the gradient attached, while 2) does not.
Why did you implemented it this way?
The text was updated successfully, but these errors were encountered: