[Performance] Support MQA/GQA in decode phase by using FlashAttention#2744
Closed
zhaoyang-star wants to merge 11 commits intovllm-project:mainfrom
Closed
[Performance] Support MQA/GQA in decode phase by using FlashAttention#2744zhaoyang-star wants to merge 11 commits intovllm-project:mainfrom
zhaoyang-star wants to merge 11 commits intovllm-project:mainfrom
Commits
Commits on Jan 17, 2024
- authored andcommitted

Commits on Jan 23, 2024
- committed
zhaoyang-star - committed
zhaoyang-star - committed
zhaoyang-star
Commits on Feb 2, 2024
Commits on Feb 4, 2024
- committed
- committed
- committed
- committed
- committed
Commits on Feb 5, 2024
- committed