[CUDA] PagedAttention: add SM<80 fp16 fallback via memory-efficient attention#28200
Merged
tianleiwu merged 10 commits intoApr 28, 2026
Merged
GitHub Advanced Security / lintrunner
succeeded
Apr 25, 2026 in 1s
No new alerts in code changed by this pull request
Loading