Skip to content

Vllm add dflash + Optimize draft models (CUDA graph management)#36733

Closed
HaizhouPeng wants to merge 35 commits into
vllm-project:mainfrom
HaizhouPeng:vllm-add-dflash
Closed

Vllm add dflash + Optimize draft models (CUDA graph management)#36733
HaizhouPeng wants to merge 35 commits into
vllm-project:mainfrom
HaizhouPeng:vllm-add-dflash

Refactor input handling in DFlash model

43f315d
Select commit
Loading
Failed to load commit list.
Meta CodeSync / Meta Internal-Only Changes Check succeeded Mar 12, 2026 in 0s

There is no internal Diff connected, this can be merged now