Skip to content

Enable offloading multi-query attention by Flash Attention#990

Merged
vinx13 merged 6 commits intomlc-ai:mainfrom
masahi:flash-attn-mqa
Oct 4, 2023
Merged

Enable offloading multi-query attention by Flash Attention#990
vinx13 merged 6 commits intomlc-ai:mainfrom
masahi:flash-attn-mqa

Commits

Commits on Sep 26, 2023

Commits on Sep 27, 2023

Commits on Sep 28, 2023