Cannot reproduce results #24

Nicolas99-9 · 2019-08-06T07:37:07Z

I tried to run mario_a2c.py, mario_ppo.py and mario_curio.py but for non of them I cannot improve the reward.
Did you use the same hyper-parameters as in the files to conduct the evaluation? (i.e. number of workers, learning rate)
Which version of the libraries did you use ?

For instance, A2C without ICM: (after 3M time-steps)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot reproduce results #24

Cannot reproduce results #24

Nicolas99-9 commented Aug 6, 2019 •

edited

Loading

Cannot reproduce results #24

Cannot reproduce results #24

Comments

Nicolas99-9 commented Aug 6, 2019 • edited Loading

Nicolas99-9 commented Aug 6, 2019 •

edited

Loading