Updat value function with different action types, why? #32

tessavdheiden · 2020-09-28T15:45:33Z

Hi Shariq,

In your code you update the value function with actions computed by:

As far as I know, 1) has the gradient attached, while 2) does not.

Why did you implemented it this way?

uhlajs · 2020-12-29T09:29:14Z

Hi @tessavdheiden,
I believe that in one call of update you want to update actor just for one actor (namely for actor attach to agent_i). That is the reason why you send one action sample with gradient attached (representing the action of agent agent_i) and others without.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updat value function with different action types, why? #32

Updat value function with different action types, why? #32

tessavdheiden commented Sep 28, 2020

uhlajs commented Dec 29, 2020

Updat value function with different action types, why? #32

Updat value function with different action types, why? #32

Comments

tessavdheiden commented Sep 28, 2020

uhlajs commented Dec 29, 2020