Skip to content

[Perf] Enable full CUDA graphs for spec decoding with FlashInfer#26937

Open
benchislett wants to merge 6 commits intovllm-project:mainfrom
CentML:flashinfer-spec-fullgraph
Open

[Perf] Enable full CUDA graphs for spec decoding with FlashInfer#26937
benchislett wants to merge 6 commits intovllm-project:mainfrom
CentML:flashinfer-spec-fullgraph

Commits

Commits on Oct 15, 2025

Commits on Oct 16, 2025

Commits on Oct 21, 2025

Commits on Oct 22, 2025