Skip to content

[Hopper] optimize decoding performance for headdim 128 fp8#96

Merged
LucasWilkinson merged 1 commit intovllm-project:mainfrom
jmkuebler:headdim128_fp8_optim
Sep 29, 2025
Merged

[Hopper] optimize decoding performance for headdim 128 fp8#96
LucasWilkinson merged 1 commit intovllm-project:mainfrom
jmkuebler:headdim128_fp8_optim

Commits

Commits on Sep 29, 2025