Skip to content

offload prompt_embeds decode in render_prompts_async to avoid blocking#43792

Merged
DarkLight1337 merged 3 commits into
vllm-project:mainfrom
gagandhakrey:perf/render-prompts-async-offload
May 30, 2026
Merged

offload prompt_embeds decode in render_prompts_async to avoid blocking#43792
DarkLight1337 merged 3 commits into
vllm-project:mainfrom
gagandhakrey:perf/render-prompts-async-offload

Commits

Commits on May 29, 2026