Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

have you ever try to train swin large? #12

Open
Dongshengjiang opened this issue Oct 12, 2021 · 3 comments
Open

have you ever try to train swin large? #12

Dongshengjiang opened this issue Oct 12, 2021 · 3 comments

Comments

@Dongshengjiang
Copy link

I found swin_large_patch4_window7_224.yaml config file in your code. Here is an interesting question that how about the performance for larger mode?

@shallowtoil
Copy link

shallowtoil commented Oct 14, 2021

Hi @ChunyuanLI @jwyang, I'm also wondering about the linear probe accuracy of DINO/EsViT with Swin-L. Have you ever run any related experiments?

@ChunyuanLI
Copy link
Contributor

I ran the experiments on EsViT (Swin-L) once, but did not get better results than the best number 81.3% reported in our paper.

@Dongshengjiang
Copy link
Author

can you provide the final results of swin-L for 300 epoch, I got knn 74.6 at epoch 180 of 300 which is lower than msg_small at same epoch. I suspected there could be two reason, one is the 1k data is not enough for the big model to coverage; second, maybe swin_large need bigger drop path?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants