[CI failure] Revert "[BugFix] Fix the issue of thinker requests being preempted, causing shape mismatch." by Gaohan123 · Pull Request #3648 · vllm-project/vllm-omni

Gaohan123 · 2026-05-15T13:35:57Z

Reverts #3147 due to CI failure: https://buildkite.com/vllm/vllm-omni/builds/9719/canvas?sid=019e2aed-c3cc-45e8-8424-71d4506adbd4&tab=output

…ausing s…" This reverts commit e7ee5de.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 145dc177b4

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-05-15T13:40:33Z

        task = {
            "pooling_output": pooling_output,
            "request": request,


Restore preemption guard before enqueuing async chunks

save_async now unconditionally enqueues every chunk task, so when async-chunk requests are preempted and request.num_computed_tokens rolls back, stale chunks can be sent again while put_req_chunk continues forward. Those duplicate/stale payloads are then merged on the receiver side via tensor/list concatenation, which can desynchronize thinker/talker sequence shapes and reintroduce the preemption shape-mismatch failure under normal preempt-and-resume scheduling.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-15T13:40:33Z

+        if start_index >= len(thinker_output_token_ids) - 1:
+            # When the tokens output by the thinker are exhausted, an EOS token needs to be appended.


Avoid emitting EOS before the final decode embedding

The new exhaustion check uses start_index >= len(ids.output) - 1, which marks decode as exhausted one step early. Because decode handoff starts with start_index=1, chunks with only 1–2 thinker output tokens (a common early/short-response case) immediately emit EOS/pad and skip projecting actual decode embeddings, leading to truncated or empty spoken output.

Useful? React with 👍 / 👎.

amy-why-3459 · 2026-05-15T13:48:17Z

Please wait.

hsliuustc0106 · 2026-05-15T14:00:07Z

check #3650

hsliuustc0106 · 2026-05-15T14:50:49Z

#3650 merged

Revert "[BugFix] Fix the issue of thinker requests being preempted, c…

145dc17

…ausing s…" This reverts commit e7ee5de.

Gaohan123 requested review from ZeldaHuang, gcanlin, linyueqian, princepride, tzhouam, yenuo26 and yuanheng-zhao as code owners May 15, 2026 13:35

Gaohan123 added ready label to trigger buildkite CI merge-test label to trigger buildkite merge test CI labels May 15, 2026

chatgpt-codex-connector Bot reviewed May 15, 2026

View reviewed changes

hsliuustc0106 added the duplicate This issue or pull request already exists label May 15, 2026

hsliuustc0106 closed this May 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI failure] Revert "[BugFix] Fix the issue of thinker requests being preempted, causing shape mismatch."#3648

[CI failure] Revert "[BugFix] Fix the issue of thinker requests being preempted, causing shape mismatch."#3648
Gaohan123 wants to merge 1 commit into
mainfrom
revert-3147-bugfix

Gaohan123 commented May 15, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Uh oh!

amy-why-3459 commented May 15, 2026

Uh oh!

hsliuustc0106 commented May 15, 2026

Uh oh!

hsliuustc0106 commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if start_index >= len(thinker_output_token_ids) - 1:
		# When the tokens output by the thinker are exhausted, an EOS token needs to be appended.

Conversation

Gaohan123 commented May 15, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

amy-why-3459 commented May 15, 2026

Uh oh!

hsliuustc0106 commented May 15, 2026

Uh oh!

hsliuustc0106 commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants