Skip to content

[CUDA] PagedAttention: add SM<80 fp16 fallback via memory-efficient attention#28200

Merged
tianleiwu merged 10 commits intomicrosoft:mainfrom
elwhyjay:feature/paged-attention-mea-fallback
Apr 28, 2026
Merged

[CUDA] PagedAttention: add SM<80 fp16 fallback via memory-efficient attention#28200
tianleiwu merged 10 commits intomicrosoft:mainfrom
elwhyjay:feature/paged-attention-mea-fallback

Commits

Commits on Apr 23, 2026

Commits on Apr 24, 2026