Skip to content

perf: Update trtllm-gen batched GEMM kernels - faster, more NVFP4 tile dims, MXFP8 with relu2 act#2667

Merged
aleozlx merged 6 commits intoflashinfer-ai:mainfrom
amitz-nv:update-batched-gemm-faster-gemms
Mar 3, 2026
Merged

perf: Update trtllm-gen batched GEMM kernels - faster, more NVFP4 tile dims, MXFP8 with relu2 act#2667
aleozlx merged 6 commits intoflashinfer-ai:mainfrom
amitz-nv:update-batched-gemm-faster-gemms