Introduce speculative decoding with draft models to vLLM#3029
Closed
sighingnow wants to merge 3 commits intovllm-project:mainfrom
Closed
Introduce speculative decoding with draft models to vLLM#3029sighingnow wants to merge 3 commits intovllm-project:mainfrom
sighingnow wants to merge 3 commits intovllm-project:mainfrom