Does the DQN fall in the paradigm of decentralized training and decentralized execution. #78

lml519 · 2020-04-26T03:15:59Z

Does the DQN fall in the paradigm of decentralized training and decentralized execution. I think it is the alogorithm to combine the Parallel computing with the DTDE. I'm not sure if my idea is right

merrymercy · 2020-04-28T00:57:55Z

I believe our DQN is in the paradigm of centralized training and decentralized execution.
During training, we collect all trajectories and train a single shared model, so the training is centralized.
During inference, we feed in different observations and agent embeddings, so the execution is decentralized.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does the DQN fall in the paradigm of decentralized training and decentralized execution. #78

Does the DQN fall in the paradigm of decentralized training and decentralized execution. #78

lml519 commented Apr 26, 2020

merrymercy commented Apr 28, 2020 •

edited

Loading

Does the DQN fall in the paradigm of decentralized training and decentralized execution. #78

Does the DQN fall in the paradigm of decentralized training and decentralized execution. #78

Comments

lml519 commented Apr 26, 2020

merrymercy commented Apr 28, 2020 • edited Loading

merrymercy commented Apr 28, 2020 •

edited

Loading