re-implement Stand-Alone Self-Attention model #6

d-li14 · 2020-09-06T13:33:10Z

Hi, @csrhddlam
As we discussed before, I am trying to re-implement the baseline "Conv-stem+Attention" in Stand-Alone Self-Attention in Vision Models, which is referred in your paper.
Could you please help check the correctness? It will be better if you could provide further optimization of this implementation. Thanks!

csrhddlam · 2020-09-06T15:03:01Z

Hi, @d-li14
Thanks for contributing. It looks correct to me, but the unfolding implementation could take a lot of memory. Could you check if the model really runs on 224x224 images and if it can reproduce the results in the paper?
Thanks!

d-li14 · 2020-09-06T16:12:45Z

Yes, it is very memory-consuming, a simple test shows more than 7G memory is used with 8 images per GPU.
I will try to verify the accuracy of this model.

add Stand-Alone Self-Attention model

b677312

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

re-implement Stand-Alone Self-Attention model #6

re-implement Stand-Alone Self-Attention model #6

d-li14 commented Sep 6, 2020

csrhddlam commented Sep 6, 2020

d-li14 commented Sep 6, 2020

re-implement Stand-Alone Self-Attention model #6

Are you sure you want to change the base?

re-implement Stand-Alone Self-Attention model #6

Conversation

d-li14 commented Sep 6, 2020

csrhddlam commented Sep 6, 2020

d-li14 commented Sep 6, 2020