Can not reproduce SMAC results. #222

ZiyiLiubird · 2023-02-14T12:55:55Z

Hello, I can not reproduce the smac experimental results, e.g, corridor and 3m, illustrated in "https://github.com/Denys88/rl_games/blob/master/docs/SMAC.md" with the same hyper-parameters.

It seems that IPPO can't learn meaningful policies even in 3m (and corridor) after 10M steps, and the win rate is lower than 0.2 from beginning to end.

Denys88 · 2023-02-14T14:13:57Z

Hi,
It sounds strange. Btw original paper was using TF implementation. I'll try to find that checkpoint.
But 3m should be solved in less than 2 minute with much less steps.

Denys88 · 2023-02-15T01:34:32Z

https://github.com/Denys88/rl_games/tree/0871084d8d95954fa165dbe93eadb54773b7a36a this commit was used in paper ( tensorflow 1 implementation) You should be able to reproduce it.

For the pytorh Ill install smac envs on the weekends and will take a look what is wrong.

ZiyiLiubird · 2023-02-15T02:21:26Z

Thanks a lot! I'm looking forward to hearing back from you.

Denys88 · 2023-02-16T04:43:38Z

@ZiyiLiubird everything is fixed.
There were a small issue with reporting rewards. Thanks!
Please take a look at different configs. BUT if you want to get exact same results as in paper you need to use old implementation with TensorFlow. I used conv1d. In this tests I tried to use mlp + lstm and different env configurations.
It would be really nice if someone find good configurations in pytorch with central value too.

ZiyiLiubird · 2023-02-16T05:11:37Z

Hi @Denys88 Thank you for your patience and advice, and for bringing such an excellent work!

Denys88 mentioned this issue Feb 16, 2023

Fixed MA env reporting Including SC2 #224

Merged

ZiyiLiubird closed this as completed Feb 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can not reproduce SMAC results. #222

Can not reproduce SMAC results. #222

ZiyiLiubird commented Feb 14, 2023 •

edited

Loading

Denys88 commented Feb 14, 2023

Denys88 commented Feb 15, 2023 •

edited

Loading

ZiyiLiubird commented Feb 15, 2023

Denys88 commented Feb 16, 2023 •

edited

Loading

ZiyiLiubird commented Feb 16, 2023

Can not reproduce SMAC results. #222

Can not reproduce SMAC results. #222

Comments

ZiyiLiubird commented Feb 14, 2023 • edited Loading

Denys88 commented Feb 14, 2023

Denys88 commented Feb 15, 2023 • edited Loading

ZiyiLiubird commented Feb 15, 2023

Denys88 commented Feb 16, 2023 • edited Loading

ZiyiLiubird commented Feb 16, 2023

ZiyiLiubird commented Feb 14, 2023 •

edited

Loading

Denys88 commented Feb 15, 2023 •

edited

Loading

Denys88 commented Feb 16, 2023 •

edited

Loading