Skip to content

[TRTLLM-Gen Fmha] add optimized trtllm-gen decode kernels for high throughput + speculative decoding#2265

Merged
yzh119 merged 6 commits intoflashinfer-ai:mainfrom
PerkzZheng:user/perkzz/trtllm-gen-groups-tokens-heads
Jan 7, 2026
Merged

[TRTLLM-Gen Fmha] add optimized trtllm-gen decode kernels for high throughput + speculative decoding#2265
yzh119 merged 6 commits intoflashinfer-ai:mainfrom
PerkzZheng:user/perkzz/trtllm-gen-groups-tokens-heads

Commits

Commits on Dec 24, 2025

Commits on Jan 7, 2026