[Bugfix] Fix LOGITPROC_SOURCE_ENTRYPOINT test to use spawn-compatible dist-info registration for XPU/ROCm by dzhengAP · Pull Request #42040 · vllm-project/vllm

dzhengAP · 2026-05-08T07:42:26Z

Follow-up to #41423, also discussed in #41895.

Problem

test_custom_logitsprocs[LOGITPROC_SOURCE_ENTRYPOINT] and
test_rejects_custom_logitsprocs[LOGITPROC_SOURCE_ENTRYPOINT] relied on
fork-based monkey-patching of importlib.metadata.entry_points to inject a
fake logitproc entrypoint.

That works with VLLM_WORKER_MULTIPROC_METHOD=fork, but it is not compatible
with XPU/ROCm platforms where the tests need to run with spawn-based
multiprocessing. With spawn, the monkey-patched entrypoint state is not
inherited by worker subprocesses, so the fake custom logits processor entrypoint
cannot be discovered.

Fix

Replace the spawn path’s in-memory monkey-patch with a real temporary
.dist-info package written to disk and exposed through PYTHONPATH.

Since importlib.metadata discovers entrypoints from installed package metadata
on disk, spawned subprocesses can discover the fake logitproc entrypoint without
requiring fork.

This PR adds/updates the shared fake-entrypoint setup in
tests/v1/logits_processors/utils.py to:

Create a temporary .dist-info directory with METADATA and
entry_points.txt.
Add the temporary package directory to PYTHONPATH so spawned subprocesses
can discover the entrypoint.
Prepend the same directory to sys.path so the current driver process can
discover the entrypoint as well.
Use spawn-compatible registration when spawn multiprocessing is required.
Keep the existing monkey-patched importlib.metadata.entry_points behavior
for fork-based test execution.

The follow-up commits also apply this setup consistently across the custom
offline and online logits processor tests.

This makes the custom logits processor entrypoint tests compatible with
spawn-based multiprocessing and fixes the XPU/ROCm CI failures.

… dist-info registration Signed-off-by: dqzhengAP <dqzheng1996@gmail.com>

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

gemini-code-assist

Code Review

This pull request refactors the custom logits processor tests to support spawned subprocesses by replacing manual monkey-patching of importlib.metadata.entry_points with a disk-based dist-info registration. A new utility function, register_fake_entrypoint, creates a temporary package and updates PYTHONPATH. Feedback indicates that sys.path should also be updated for the current process to ensure the driver process can successfully discover the entry point.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: David Zheng <153074367+dzhengAP@users.noreply.github.com>

zhenwei-intel · 2026-05-08T08:15:01Z

tests/v1/logits_processors/test_custom_online.py
Could you please also handle this test?

AndreasKaratzas · 2026-05-08T21:27:07Z

CI is blocked, so I could not wait for the author. Opened a second PR here with their commits as well to honor their contributions:

[ROCm][CI] Fix custom logits entrypoints for ROCm/XPU spawn #42115

UPDATE: Author is back and people can officially call me impatient.

Signed-off-by: Andreas Karatzas <akaratza@amd.com> (cherry picked from commit a093d02) Signed-off-by: dqzhengAP <dqzheng1996@gmail.com>

Signed-off-by: Andreas Karatzas <akaratza@amd.com> (cherry picked from commit 82f2f93)

dzhengAP · 2026-05-08T23:59:31Z

Good sign is the fix focused in this PR is passed in Intel CI. The only fail is LoRA, which has been already discussed here. It can be waived due to current XPU limitation support on LoRA. #41895 (comment)

Deeper insight: So this Qwen3.5 dense model path uses a GDN/Mamba-style layer where LoRA projections are not supported on XPU. The correct fix is to skip this test on XPU, not try to make it pass. @zhenwei-intel @jikunshang

jikunshang · 2026-05-09T00:26:12Z

Good sign is the fix focused in this PR is passed in Intel CI. The only fail is LoRA, which has been already discussed here. It can be waived due to current XPU limitation support on LoRA. #41895 (comment)

Deeper insight: So this Qwen3.5 dense model path uses a GDN/Mamba-style layer where LoRA projections are not supported on XPU. The correct fix is to skip this test on XPU, not try to make it pass. @zhenwei-intel @jikunshang

we disable some lora case on main, please rebase and check whether it pass.

AndreasKaratzas · 2026-05-09T00:35:57Z

@dzhengAP could you rebase? I think AMD docker build is having some very temporary issues.

jikunshang · 2026-05-09T00:58:51Z

rebased. let's see what CI say.

dzhengAP · 2026-05-09T04:12:54Z

Intel CI all passed, but AMD CI still running after 3hours, do we have any experience or estimation of the typical AMD CI ruining time?@AndreasKaratzas and @jikunshang

AndreasKaratzas · 2026-05-09T04:13:42Z

@dzhengAP Yep, but AMD CI is like that(and it is not blocking), I was only interested in the blocking test group, and it is passing now. I am going to ping people in slack.

tjtanaa

LGTM

[Bugfix] Fix LOGITPROC_SOURCE_ENTRYPOINT test to use spawn-compatible…

f067088

… dist-info registration Signed-off-by: dqzhengAP <dqzheng1996@gmail.com>

claude Bot reviewed May 8, 2026

View reviewed changes

mergify Bot added rocm Related to AMD ROCm intel-gpu Related to Intel GPU v1 bug Something isn't working labels May 8, 2026

github-project-automation Bot added this to AMD May 8, 2026

github-project-automation Bot moved this to Todo in AMD May 8, 2026

gemini-code-assist Bot reviewed May 8, 2026

View reviewed changes

Comment thread tests/v1/logits_processors/utils.py

Update tests/v1/logits_processors/utils.py

e663bad

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: David Zheng <153074367+dzhengAP@users.noreply.github.com>

dzhengAP mentioned this pull request May 8, 2026

[Bugfix] Fix XPU/ROCm compatibility in spawn_new_process_for_each_test #41895

Merged

AndreasKaratzas added the ready ONLY add when PR is ready to merge/full CI is needed label May 8, 2026

dzhengAP mentioned this pull request May 8, 2026

[CI][BugFix] Fix failure CI "amd-v1-sample-plus-logits-mi300-1" #42106

Closed

4 tasks

AndreasKaratzas mentioned this pull request May 8, 2026

[ROCm][CI] Fix custom logits entrypoints for ROCm/XPU spawn #42115

Closed

AndreasKaratzas added 2 commits May 8, 2026 14:57

[ROCm][CI] Fix custom logits entrypoints for ROCm/XPU spawn

725dee2

Signed-off-by: Andreas Karatzas <akaratza@amd.com> (cherry picked from commit a093d02) Signed-off-by: dqzhengAP <dqzheng1996@gmail.com>

[ROCm][CI] Fix custom logits entrypoints for ROCm/XPU spawn

5051bc9

Signed-off-by: Andreas Karatzas <akaratza@amd.com> (cherry picked from commit 82f2f93)

Merge branch 'main' into bugfix/fix-entrypoint-spawn-compatible

eafc461

tjtanaa approved these changes May 9, 2026

View reviewed changes

tjtanaa merged commit df2636a into vllm-project:main May 9, 2026
17 checks passed

github-project-automation Bot moved this from Todo to Done in AMD May 9, 2026

dzhengAP mentioned this pull request May 9, 2026

[Bugfix] Remove incorrect @pytest.mark.asyncio from test_custom_logitsprocs #42036

Closed

ZhanqiuHu mentioned this pull request May 9, 2026

[CI Summary 2026-05-09] 9 failed (8 new, 1 recurring), 7 fixed ZhanqiuHu/vllm-ci-watch#115

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Fix LOGITPROC_SOURCE_ENTRYPOINT test to use spawn-compatible dist-info registration for XPU/ROCm #42040

[Bugfix] Fix LOGITPROC_SOURCE_ENTRYPOINT test to use spawn-compatible dist-info registration for XPU/ROCm #42040
tjtanaa merged 5 commits intovllm-project:mainfrom
dzhengAP:bugfix/fix-entrypoint-spawn-compatible

dzhengAP commented May 8, 2026 •

edited

Loading

Uh oh!

claude Bot left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

zhenwei-intel commented May 8, 2026 •

edited

Loading

Uh oh!

AndreasKaratzas commented May 8, 2026 •

edited

Loading

Uh oh!

dzhengAP commented May 8, 2026 •

edited

Loading

Uh oh!

jikunshang commented May 9, 2026

Uh oh!

AndreasKaratzas commented May 9, 2026

Uh oh!

jikunshang commented May 9, 2026

Uh oh!

dzhengAP commented May 9, 2026

Uh oh!

AndreasKaratzas commented May 9, 2026

Uh oh!

tjtanaa left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

dzhengAP commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Fix

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

zhenwei-intel commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndreasKaratzas commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dzhengAP commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jikunshang commented May 9, 2026

Uh oh!

AndreasKaratzas commented May 9, 2026

Uh oh!

jikunshang commented May 9, 2026

Uh oh!

dzhengAP commented May 9, 2026

Uh oh!

AndreasKaratzas commented May 9, 2026

Uh oh!

tjtanaa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dzhengAP commented May 8, 2026 •

edited

Loading

zhenwei-intel commented May 8, 2026 •

edited

Loading

AndreasKaratzas commented May 8, 2026 •

edited

Loading

dzhengAP commented May 8, 2026 •

edited

Loading