[ROCM] Optimized deepseek-r1 fp8 model with + triton_gemm_a8w8 + batch_gemm_a8w8 + fused set_mla_kv_buffer kernel#13617
Merged
HaiShaw merged 6 commits intosgl-project:mainfrom Nov 20, 2025
Commits
Commits on Nov 19, 2025
- committed
root
Commits on Nov 20, 2025
- authored andcommitted

- committed
- committed
- committed
- authored