Skip to content

[CUDA] Support FP8 (E4M3) KV Cache for Group Query Attention#27321

Merged
tianleiwu merged 12 commits into
mainfrom
tlwu/20260211/gqa_fp8_kv_cache
Feb 18, 2026
Merged

[CUDA] Support FP8 (E4M3) KV Cache for Group Query Attention#27321
tianleiwu merged 12 commits into
mainfrom
tlwu/20260211/gqa_fp8_kv_cache

fix build

2a25780
Select commit
Loading
Failed to load commit list.
GitHub Advanced Security / CodeQL completed Feb 14, 2026 in 4s

1 configuration not found

Warning: Code scanning cannot determine the alerts introduced by this pull request, because 1 configuration present on refs/heads/main was not found:

API upload

  • ❓  <default>

View all branch alerts.