Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

re-implement Stand-Alone Self-Attention model #6

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

d-li14
Copy link

@d-li14 d-li14 commented Sep 6, 2020

Hi, @csrhddlam
As we discussed before, I am trying to re-implement the baseline "Conv-stem+Attention" in Stand-Alone Self-Attention in Vision Models, which is referred in your paper.
Could you please help check the correctness? It will be better if you could provide further optimization of this implementation. Thanks!

@csrhddlam
Copy link
Owner

Hi, @d-li14
Thanks for contributing. It looks correct to me, but the unfolding implementation could take a lot of memory. Could you check if the model really runs on 224x224 images and if it can reproduce the results in the paper?
Thanks!

@d-li14
Copy link
Author

d-li14 commented Sep 6, 2020

Yes, it is very memory-consuming, a simple test shows more than 7G memory is used with 8 images per GPU.
I will try to verify the accuracy of this model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants