Skip to content

[Bugfix] Revert "Zero-init MLA attention output buffers to prevent NaN from CUDA graph padding"#38359

Merged
tlrmchlsmth merged 3 commits intovllm-project:mainfrom
elvircrn:revert-mla-zero-init
Apr 1, 2026
Merged

[Bugfix] Revert "Zero-init MLA attention output buffers to prevent NaN from CUDA graph padding"#38359
tlrmchlsmth merged 3 commits intovllm-project:mainfrom
elvircrn:revert-mla-zero-init

Commits

Commits on Mar 27, 2026

Commits on Mar 28, 2026

Commits on Mar 30, 2026