Skip to content

[TRTLLM-6994][feat] FP8 Context MLA integration (Cherry-pick https://github.com/NVIDIA/TensorRT-LLM/pull/6059 from release/1.1.0rc2)#7610

Merged
yuxianq merged 4 commits intoNVIDIA:mainfrom
yuxianq:fp8-context-mla-main
Sep 19, 2025
Merged

Commits

Commits on Sep 16, 2025

Commits on Sep 17, 2025

Commits on Sep 18, 2025