Enable offloading multi-query attention by Flash Attention#990
Merged
vinx13 merged 6 commits intomlc-ai:mainfrom Oct 4, 2023
Merged
Enable offloading multi-query attention by Flash Attention#990vinx13 merged 6 commits intomlc-ai:mainfrom
vinx13 merged 6 commits intomlc-ai:mainfrom