Skip to content

[ROCm][DSv4] Share AITER decode dequant + fp8-cast buffers across layers (rebased, stacked on #902)#903

Draft
ChuanLi1101 wants to merge 9 commits intoROCm:hexwang/dsv4_adapt_upstreamfrom
ChuanLi1101:chuali/aiter-mla-dsv4-decode-cudagraph-workspace-rebased
Draft

[ROCm][DSv4] Share AITER decode dequant + fp8-cast buffers across layers (rebased, stacked on #902)#903
ChuanLi1101 wants to merge 9 commits intoROCm:hexwang/dsv4_adapt_upstreamfrom
ChuanLi1101:chuali/aiter-mla-dsv4-decode-cudagraph-workspace-rebased

Commits

Commits on Apr 28, 2026