[BugFix] Continue decode if don't need transfer kv cache between two … by princepride · Pull Request #2502 · vllm-project/vllm-omni

princepride · 2026-04-05T12:21:16Z

…stages

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

While the previous update resolved the synchronization error in KV cache transfers across stages, it introduced a side effect where Stage 0 output is terminated unconditionally, even when no transmission is occurring and img2text and text2text task can not output any text.

Test Plan

python3 examples/offline_inference/bagel/end2end.py   --modality text2text   --prompts "Where is the capital of France?"

Test Result

** before: **

** after: **

The capital of France is Paris.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan. Please provide the test scripts & test commands. Please state the reasons if your codes don't require additional test scripts. For test file guidelines, please check the test style doc
The test results. Please paste the results comparison before and after, or the e2e results.
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. Please run mkdocs serve to sync the documentation editions to ./docs.
(Optional) Release notes update. If your change is user-facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

…stages Signed-off-by: princepride <wangzhipeng628@gmail.com>

princepride · 2026-04-05T12:25:52Z

@natureofnature @lishunyang12 PTAL

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f6ec96f30a

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Signed-off-by: princepride <wangzhipeng628@gmail.com>

…tion deserialize_additional_information() reconstructs all entries (including tensors from bytes) on every call. Since _request_omits_kv_transfer_to_next_stage is invoked on each scheduler tick, this caused unnecessary CPU copies and memory churn during decode. Cache the boolean per request and clean up in _free_request. Signed-off-by: princepride <wangzhipeng628@gmail.com> Made-with: Cursor

hsliuustc0106

Has regression tests (test_bagel_understanding.py). Logic looks correct.

princepride · 2026-04-06T05:04:02Z

@hsliuustc0106 Can you help approve it?

vllm-project#2502) Signed-off-by: princepride <wangzhipeng628@gmail.com>

vllm-project#2502) Signed-off-by: princepride <wangzhipeng628@gmail.com> Signed-off-by: bob-021206 <binyan_github@163.com>

vllm-project#2502) Signed-off-by: princepride <wangzhipeng628@gmail.com>

[BugFix] Continue decode if don't need transfer kv cache between two …

f6ec96f

…stages Signed-off-by: princepride <wangzhipeng628@gmail.com>

princepride requested a review from hsliuustc0106 as a code owner April 5, 2026 12:21

chatgpt-codex-connector Bot reviewed Apr 5, 2026

View reviewed changes

Comment thread vllm_omni/core/sched/omni_ar_scheduler.py Outdated

princepride added 2 commits April 5, 2026 15:34

add text2text and img2text task

24b6e2d

Signed-off-by: princepride <wangzhipeng628@gmail.com>

hsliuustc0106 reviewed Apr 5, 2026

View reviewed changes

princepride added the ready label to trigger buildkite CI label Apr 6, 2026

princepride requested a review from hsliuustc0106 April 6, 2026 08:07

hsliuustc0106 merged commit 8dd66ce into vllm-project:main Apr 7, 2026
8 checks passed

skf-1999 pushed a commit to Semmer2/vllm-omni that referenced this pull request Apr 7, 2026

[BugFix] Continue decode if don't need transfer kv cache between two … (

34ac909

vllm-project#2502) Signed-off-by: princepride <wangzhipeng628@gmail.com>

vraiti pushed a commit to vraiti/vllm-omni that referenced this pull request Apr 9, 2026

[BugFix] Continue decode if don't need transfer kv cache between two … (

60ffce2

vllm-project#2502) Signed-off-by: princepride <wangzhipeng628@gmail.com>

lengrongfu pushed a commit to lengrongfu/vllm-omni that referenced this pull request May 1, 2026

[BugFix] Continue decode if don't need transfer kv cache between two … (

96fb878

vllm-project#2502) Signed-off-by: princepride <wangzhipeng628@gmail.com>

clodaghwalsh17 pushed a commit to clodaghwalsh17/nm-vllm-omni-ent that referenced this pull request May 12, 2026

[BugFix] Continue decode if don't need transfer kv cache between two … (

83e7f81

vllm-project#2502) Signed-off-by: princepride <wangzhipeng628@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Continue decode if don't need transfer kv cache between two …#2502

[BugFix] Continue decode if don't need transfer kv cache between two …#2502
hsliuustc0106 merged 3 commits into
vllm-project:mainfrom
princepride:continue-decode-if-donnot-transfer-kv

princepride commented Apr 5, 2026 •

edited

Loading

Uh oh!

princepride commented Apr 5, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

hsliuustc0106 left a comment

Uh oh!

princepride commented Apr 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

princepride commented Apr 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

princepride commented Apr 5, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

hsliuustc0106 left a comment

Choose a reason for hiding this comment

Uh oh!

princepride commented Apr 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

princepride commented Apr 5, 2026 •

edited

Loading