Skip to content

[Fixbug][Perf] Qwen3-omni: code predictor with re-prefill + SDPA and eliminate decode hot-path CPU round-trips#2012

Merged
hsliuustc0106 merged 4 commits into
vllm-project:mainfrom
LJH-LBJ:refactor/qwen3-omni-code-predictor-v2
Mar 20, 2026
Merged

[Fixbug][Perf] Qwen3-omni: code predictor with re-prefill + SDPA and eliminate decode hot-path CPU round-trips#2012
hsliuustc0106 merged 4 commits into
vllm-project:mainfrom
LJH-LBJ:refactor/qwen3-omni-code-predictor-v2

Commits

Commits on Mar 19, 2026