Skip to content

[CUDA] GroupQueryAttention with XQA and Quantized KV Cache Support#27246

Merged
tianleiwu merged 8 commits into
mainfrom
tlwu/gqa_xqa_quantized_kv_cache
Feb 11, 2026
Merged

[CUDA] GroupQueryAttention with XQA and Quantized KV Cache Support#27246
tianleiwu merged 8 commits into
mainfrom
tlwu/gqa_xqa_quantized_kv_cache

review feedback

4ac7cfd
Select commit
Loading
Failed to load commit list.
GitHub Advanced Security / CodeQL completed Feb 9, 2026 in 1m 54s

1 configuration not found

Warning: Code scanning cannot determine the alerts introduced by this pull request, because 1 configuration present on refs/heads/main was not found:

API upload

  • ❓  <default>

View all branch alerts.