[PerfFix] Avoid separate thread for MP executor shm spin (take 2) #28319

njhill · 2025-11-07T19:06:03Z

This is a re-apply of #28012 which was reverted in #28289 due to a bug related to aggregating kv connector outputs which broke for example Nixl P/D for TP > 1.

The second commit is a fix for that issue. The original PR was itself a fix for a significant performance regression.

As a follow-on I will likely refactor this a little more and improve the test coverage.

Signed-off-by: Nick Hill <[email protected]> (cherry picked from commit c9f66da)

Signed-off-by: Nick Hill <[email protected]>

gemini-code-assist

Code Review

This pull request refactors the multiprocessing executor to remove the I/O thread pool, aiming to fix a performance regression. The core change involves a new threadless FutureWrapper and a manual future queue to manage asynchronous RPC calls. While this is a clever way to avoid thread overhead, the new FutureWrapper implementations in both multiproc_executor.py and ray_utils.py have dropped support for timeouts, which is a functional regression from the standard Future API. Furthermore, the interaction between KVOutputAggregator and the new FutureWrapper in multiproc_executor is complex, tightly coupled, and introduces a critical bug that will cause a crash if a timeout is used. I've provided detailed comments and suggestions to address these issues.

vllm/distributed/kv_transfer/kv_connector/utils.py

vllm/v1/executor/multiproc_executor.py

vllm/v1/executor/ray_utils.py

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

vllm/v1/executor/uniproc_executor.py

Signed-off-by: Nick Hill <[email protected]>

…lm-project#28319) Signed-off-by: Nick Hill <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

…lm-project#28319) Signed-off-by: Nick Hill <[email protected]>

njhill added 2 commits November 7, 2025 11:00

[PerfFix] Avoid separate thread for MP executor shm spin

9401c39

Signed-off-by: Nick Hill <[email protected]> (cherry picked from commit c9f66da)

Fix kv connector result aggregation

4685829

Signed-off-by: Nick Hill <[email protected]>

njhill requested review from ApostaC and NickLucche as code owners November 7, 2025 19:06

mergify bot added v1 kv-connector labels Nov 7, 2025

njhill mentioned this pull request Nov 7, 2025

Revert "[PerfFix] Avoid separate thread for MP executor shm spin (#28012)" #28289

Merged

njhill added this to the v0.11.1 milestone Nov 7, 2025

gemini-code-assist bot reviewed Nov 7, 2025

View reviewed changes

vllm/distributed/kv_transfer/kv_connector/utils.py Outdated Show resolved Hide resolved

vllm/v1/executor/multiproc_executor.py Show resolved Hide resolved

vllm/v1/executor/ray_utils.py Show resolved Hide resolved

aarnphm approved these changes Nov 7, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Nov 7, 2025

View reviewed changes

vllm/v1/executor/uniproc_executor.py Show resolved Hide resolved

njhill added 2 commits November 7, 2025 11:13

fix precommit

d753d10

Signed-off-by: Nick Hill <[email protected]>

fixes from AI review

a4bf03c

Signed-off-by: Nick Hill <[email protected]>

njhill changed the title ~~[PerfFix] Avoid separate thread for MP executor shm spin~~ [PerfFix] Avoid separate thread for MP executor shm spin (take 2) Nov 7, 2025

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 7, 2025

njhill mentioned this pull request Nov 7, 2025

[Core] Async scheduling + structured outputs compatibility #26866

Merged

njhill enabled auto-merge (squash) November 7, 2025 21:04

njhill merged commit 67a2da8 into vllm-project:main Nov 7, 2025
54 checks passed

njhill deleted the reapply-mp-perf-fix branch November 7, 2025 22:11

njhill mentioned this pull request Nov 7, 2025

[Core] Simplify async KV output aggregation #28327

Merged

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Nov 13, 2025

[PerfFix] Avoid separate thread for MP executor shm spin (take 2) (vl…

80a4975

…lm-project#28319) Signed-off-by: Nick Hill <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[PerfFix] Avoid separate thread for MP executor shm spin (take 2) (vl…

12a7c2f

…lm-project#28319) Signed-off-by: Nick Hill <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[PerfFix] Avoid separate thread for MP executor shm spin (take 2) #28319

[PerfFix] Avoid separate thread for MP executor shm spin (take 2) #28319

Uh oh!

njhill commented Nov 7, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[PerfFix] Avoid separate thread for MP executor shm spin (take 2) #28319

[PerfFix] Avoid separate thread for MP executor shm spin (take 2) #28319

Uh oh!

Conversation

njhill commented Nov 7, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

njhill commented Nov 7, 2025 •

edited by github-actions bot

Loading