-
Notifications
You must be signed in to change notification settings - Fork 167
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can not reproduce SMAC results. #222
Comments
Hi, |
https://github.com/Denys88/rl_games/tree/0871084d8d95954fa165dbe93eadb54773b7a36a this commit was used in paper ( tensorflow 1 implementation) You should be able to reproduce it. For the pytorh Ill install smac envs on the weekends and will take a look what is wrong. |
Thanks a lot! I'm looking forward to hearing back from you. |
@ZiyiLiubird everything is fixed. |
Hi @Denys88 Thank you for your patience and advice, and for bringing such an excellent work! |
Hello, I can not reproduce the smac experimental results, e.g, corridor and 3m, illustrated in "https://github.com/Denys88/rl_games/blob/master/docs/SMAC.md" with the same hyper-parameters.
It seems that IPPO can't learn meaningful policies even in 3m (and corridor) after 10M steps, and the win rate is lower than 0.2 from beginning to end.
The text was updated successfully, but these errors were encountered: