Skip to content

[TRTLLM-6854][feat] Enable guided decoding with CUDA graph padding and draft model chunked prefill#6774

Merged
syuoni merged 1 commit intoNVIDIA:mainfrom
syuoni:guided-with-cuda-graph
Aug 12, 2025
Merged

[TRTLLM-6854][feat] Enable guided decoding with CUDA graph padding and draft model chunked prefill#6774
syuoni merged 1 commit intoNVIDIA:mainfrom
syuoni:guided-with-cuda-graph

Commits

Commits on Aug 11, 2025