-
Notifications
You must be signed in to change notification settings - Fork 369
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Regarding pretrained models #14
Comments
Hi @LostXine Interesing, do you mind shearing your code for evaluating on pretrained models? |
Hi @cheng-chi , Thanks a lot for your response. I tried two versions:
They have the same behavior that only the model I trained works. I could also check the hash sum of the model I downloaded if you believe that is helpful. Thank you so much! |
Hi @LostXine, I'm not sure what exactly is the problem in your script, but I have just created a script (that is fairly similar to what you have) that can evaluate all provided checkpoints. |
Hi @cheng-chi , thank you so much for your effort. I'll check it and get back to you soon. |
Hi @cheng-chi |
@LostXine Oh great! Good to know! I will probably add sorting for the keys in the future. Dependent on yaml ordering is indeed a bit problematic. |
Hello,
Thanks again for this great project. It would be great if you could help diagnose this issue regarding the pretrained models.
When I try to evaluate your pretrained models of the hybrid CNN setting, I found them not working properly on Push-T, Transport ph, and transport mh. There could be more but I haven't tried them yet.
Basically, the action trajectory is relatively reasonable (not random noisy actions), but the agent just could not finish the task. (Push-T mean score: 0.09, Transport mean score: 0)
However, when I tried to train a model from scratch and evaluate it, it works fine, which may indicate that the evaluation code is correct.
I tested them on two machines at different locations, and all models are directly downloaded from your website. I also performed the integrity check and confirm that the two copies of the models on the two machines are identical. The training code can properly load the model file and the num of epochs matches the filename. But, it just does not generate the correct actions. After days of debugging, I could not find any possible directions to look into.
So could you please share some insights on what may cause this issue?
Thank you so much!
Best regards,
The text was updated successfully, but these errors were encountered: