After some steps, all the NNs always output same action #75

Eify666666 · 2021-02-27T02:47:05Z

I'm training a A3C these days, but the NN always take the same action, after some steps.
The game I train for is similar to playing Go. There will be few reward in the short term. So it hard to learn something useful for the NN form the game. Maybe that is where the problem is. I tried ' torch.nn.utils.clip_grad_norm(lnet.parameters(), 50) ', and used relu as activate function. But it doesn't work.

RuoyuG · 2022-11-04T03:19:45Z

I meet same problem, it looks like stuck in a local optimal. Do you solve it?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

After some steps, all the NNs always output same action #75

After some steps, all the NNs always output same action #75

Eify666666 commented Feb 27, 2021

RuoyuG commented Nov 4, 2022

After some steps, all the NNs always output same action #75

After some steps, all the NNs always output same action #75

Comments

Eify666666 commented Feb 27, 2021

RuoyuG commented Nov 4, 2022