Skip to content

perf: Avoid copying inputs_embeds tensors to GPU unless prompt_embeds is enabled#25739

Merged
vllm-bot merged 2 commits intovllm-project:mainfrom
protopia-ai:only-copy-to-gpu-on-prompt-embeds
Sep 26, 2025
Merged

perf: Avoid copying inputs_embeds tensors to GPU unless prompt_embeds is enabled#25739
vllm-bot merged 2 commits intovllm-project:mainfrom
protopia-ai:only-copy-to-gpu-on-prompt-embeds

Commits