[Perf] Enable full CUDA graphs for spec decoding with FlashInfer#26937
Open
benchislett wants to merge 6 commits intovllm-project:mainfrom
Open
[Perf] Enable full CUDA graphs for spec decoding with FlashInfer#26937benchislett wants to merge 6 commits intovllm-project:mainfrom
benchislett wants to merge 6 commits intovllm-project:mainfrom