Skip to content

[Spec Decode] Allow DFlash drafter to coexist with quantized target KV via independent KV groups + dtype override#42102

Open
noonghunna wants to merge 2 commits into
vllm-project:mainfrom
noonghunna:dflash-noncausal-kv-quant
Open

[Spec Decode] Allow DFlash drafter to coexist with quantized target KV via independent KV groups + dtype override#42102
noonghunna wants to merge 2 commits into
vllm-project:mainfrom
noonghunna:dflash-noncausal-kv-quant