[Bugfix] Limit Qwen-Image-Edit-2511 input image count by david6666666 · Pull Request #2840 · vllm-project/vllm-omni

david6666666 · 2026-04-16T07:29:30Z

Summary

limit QwenImageEditPlusPipeline to at most 4 input images during pre-processing
fail early with a clear validation error instead of reaching deeper OOM / sequence-length failures
add a unit test covering the over-limit case

Root Cause

Qwen-Image-Edit-2511 accepts multi-image inputs, but very large image counts can blow past the practical prompt/conditioning limits for this pipeline and eventually surface as OOM or deeper runtime failures. The fastest and smallest safe fix is to reject oversized requests at the input validation boundary.

Why This Fix

This keeps the change minimal and low-risk:

one early validation gate in the existing pre-processing path
no changes to the inference core
clearer user-facing failure mode

Validation

pytest -q tests/diffusion/models/qwen_image/test_qwen_image_edit_plus.py

Test Plan

Start a local API server with the local Qwen-Image-Edit-2511 checkpoint via:
CUDA_VISIBLE_DEVICES=7 PYTHONPATH=/mnt/data4/cwq/worktree/issue2793-qwen-image-edit-oom python -m vllm_omni.entrypoints.cli.main serve /mnt/data1/huggingface/hub/models--Qwen--Qwen-Image-Edit-2511/snapshots/6f3ccc0b56e431dc6a0c2b2039706d7d26f22cb9 --omni --port 8023 --uvicorn-log-level warning
Send a /v1/images/edits request with 5 input images via curl and verify the request is rejected with a 400 validation error.
Send a /v1/images/edits request with 4 input images via curl and verify the request succeeds and returns a generated image payload.

Test Result

pytest -q tests/diffusion/models/qwen_image/test_qwen_image_edit_plus.py
- Passed.
E2E with vllm serve + curl against local Qwen-Image-Edit-2511 on GPU 7:
- 5-image request returned 400 with:
  Received 5 input images. At most 4 images are supported by this model.
- 4-image request returned 200 and produced a valid 512x512 PNG response.

Fixes #2793.

chatgpt-codex-connector · 2026-04-16T08:15:37Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

hsliuustc0106 · 2026-04-16T09:27:40Z

Blocking Issues

[Reliability/Safety] vllm_omni/entrypoints/openai/api_server.py:1669 - _get_max_edit_input_images hardcodes return 4 without any model-specific lookup. This function should query the OD config or diffusion pipeline for the actual limit per model, otherwise future models with different limits will break or need manual updates to this helper.

VERDICT: REQUEST_CHANGES

The validation logic is correct, but _get_max_edit_input_images hardcodes return 4. This should be model-configurable - either query the OD config or the diffusion pipeline's limit instead of hardcoding per PR.

Why is 4 the right limit? The PR mentions "practical prompt/conditioning limits" but doesn't show calculations. Consider adding a comment or linking to the issue discussion explaining the sequence-length/math behind this threshold.

david6666666 · 2026-04-16T11:05:36Z

Blocking Issues

[Reliability/Safety] vllm_omni/entrypoints/openai/api_server.py:1669 - _get_max_edit_input_images hardcodes return 4 without any model-specific lookup. This function should query the OD config or diffusion pipeline for the actual limit per model, otherwise future models with different limits will break or need manual updates to this helper.

VERDICT: REQUEST_CHANGES

The validation logic is correct, but _get_max_edit_input_images hardcodes return 4. This should be model-configurable - either query the OD config or the diffusion pipeline's limit instead of hardcoding per PR.

Why is 4 the right limit? The PR mentions "practical prompt/conditioning limits" but doesn't show calculations. Consider adding a comment or linking to the issue discussion explaining the sequence-length/math behind this threshold.

fixed

lishunyang12

Looks good overall -- clean, minimal, and well-tested. The dual-layer validation (API server + pipeline) is the right approach. A few observations:

Positive

Early rejection before _load_input_images is a smart optimization -- avoids decoding/fetching images that will be rejected anyway.
Extracting _get_diffusion_od_config as a shared helper is a nice refactor that removes duplication.
Good test coverage: unit test on the pipeline pre-process path, plus two API-level tests (one confirming _load_input_images is never called).

Minor suggestions (non-blocking)

_get_max_edit_input_images string matching is fragile. The "Qwen-Image-Edit-2511" in identifier substring check will match unrelated model names that happen to contain that substring (unlikely today, but brittle). Consider comparing against the canonical HF repo ID or using identifier.endswith(...) / an exact match against a known set.
_get_diffusion_od_config is called twice when images exceed the limit: once inside _supports_multimodal_image_inputs and once directly in _get_max_edit_input_images. This is fine for correctness (cheap call), but you could cache the result in a local variable if you want to be tidy.
The pipeline-level ValueError vs. the API-level HTTPException. If someone bypasses the API server and calls the pipeline directly with 5 images, they get a ValueError. That's reasonable, but worth noting that the two error messages are slightly different in wording ("Received 5 input images. At most 4..." vs "Received 5 input images. At most 4...") -- actually they match, which is good. Just confirming they stay in sync since the constant is shared.
Test file test_qwen_image_edit_plus.py -- mock VAE config is minimal. The test writes {"z_dim": 16} which is enough today, but if get_qwen_image_edit_plus_pre_process_func ever reads additional VAE config keys at init time, this test will break with a confusing error. A small comment in the test noting this is intentionally minimal would help future maintainers.

LGTM -- approving.

SamitHuang

The dual-layer validation prevents OOM effectively. However, the model name is still hardcoded in api_server.py, which makes it brittle for future models and violates separation of concerns. Please fix this architectural issue before merging.

SamitHuang · 2026-04-17T02:45:08Z

+    # then defer to the owning pipeline constant.
+    od_config = _get_diffusion_od_config(raw_request, engine_client)
+    model_identifiers = [model_name]
+    if od_config is not None:


This string matching is fragile and hardcodes model-specific logic in the API server. Consider adding a generic attribute like max_multimodal_image_inputs to OmniDiffusionConfig or the model's configuration. This will keep the API server model-agnostic and prevent manual updates for future pipelines.

SamitHuang · 2026-04-17T02:45:08Z

+        return 1
+
+    # Keep the API-side limit model-specific: this helper should not hardcode a
+    # generic "multi-image means 4" rule because future edit pipelines may have


_get_diffusion_od_config is called twice, once here and once inside _supports_multimodal_image_inputs. Fetch od_config once at the beginning of the function to avoid redundant calls. You can check getattr(od_config, 'supports_multimodal_inputs', False) directly.

SamitHuang · 2026-04-17T02:45:08Z

+def test_qwen_image_edit_plus_rejects_too_many_input_images(tmp_path: Path):
+    vae_dir = tmp_path / "vae"
+    vae_dir.mkdir()
+    (vae_dir / "config.json").write_text(json.dumps({"z_dim": 16}))


This mock VAE config is extremely minimal. Add a brief comment indicating it is intentionally minimal. This helps future maintainers understand why the test might break if get_qwen_image_edit_plus_pre_process_func starts reading more keys at initialization.

Signed-off-by: david6666666 <530634352@qq.com>

david6666666 · 2026-04-17T03:31:01Z

The dual-layer validation prevents OOM effectively. However, the model name is still hardcoded in api_server.py, which makes it brittle for future models and violates separation of concerns. Please fix this architectural issue before merging.

fixed

SamitHuang

LGTM, pls fix the CI error

Signed-off-by: david6666666 <530634352@qq.com>

Gaohan123 · 2026-04-17T09:15:19Z

Please fix CI failure. Thanks

) Signed-off-by: david6666666 <530634352@qq.com> Co-authored-by: Gao Han <hgaoaf@connect.ust.hk>

#2877 (#2878) Signed-off-by: david6666666 <530634352@qq.com> Signed-off-by: David Chen <530634352@qq.com> Signed-off-by: WeiQing Chen <40507679+david6666666@users.noreply.github.com>

) Signed-off-by: david6666666 <530634352@qq.com> Co-authored-by: Gao Han <hgaoaf@connect.ust.hk>

david6666666 mentioned this pull request Apr 16, 2026

[Bug]: Qwen-Image-Edit OOM when inputting 20 images #2793

Closed

1 task

david6666666 changed the title ~~[codex] Limit Qwen-Image-Edit-2511 input image count~~ [Bugfix] Limit Qwen-Image-Edit-2511 input image count Apr 16, 2026

david6666666 force-pushed the codex/issue-2793-qwen-image-edit-oom branch from c6987f0 to 543349c Compare April 16, 2026 07:41

david6666666 marked this pull request as ready for review April 16, 2026 08:15

david6666666 requested a review from hsliuustc0106 as a code owner April 16, 2026 08:15

david6666666 added the ready label to trigger buildkite CI label Apr 16, 2026

david6666666 mentioned this pull request Apr 16, 2026

[RFC][0.20.0]: Qwen-Image、Qwen-Image-Layered、Qwen-Image-Edit-Plus、Wan2.2 Production-grade Feature Monitoring JiusiServe/vllm-omni#181

Closed

1 task

lishunyang12 approved these changes Apr 16, 2026

View reviewed changes

SamitHuang requested changes Apr 17, 2026

View reviewed changes

david6666666 added 12 commits April 17, 2026 02:52

Fix Qwen image edit plus prompt encoding memory

0e2f009

Signed-off-by: david6666666 <530634352@qq.com>

Clamp Qwen image edit plus prompt length

eec0785

Signed-off-by: david6666666 <530634352@qq.com>

Limit Qwen image edit plus input images

72af603

Signed-off-by: david6666666 <530634352@qq.com>

Translate omni image edit input errors

f1900fe

Signed-off-by: david6666666 <530634352@qq.com>

Validate image edit limits in API layer

c95d20c

Signed-off-by: david6666666 <530634352@qq.com>

Refine image edit input limit helpers

731c536

Signed-off-by: david6666666 <530634352@qq.com>

Simplify image edit input limit helper

8c857c3

Signed-off-by: david6666666 <530634352@qq.com>

Wrap image edit limit error message

0c25a06

Signed-off-by: david6666666 <530634352@qq.com>

Use model-specific image edit limits

826c74a

Signed-off-by: david6666666 <530634352@qq.com>

Reject over-limit image edits before loading

297d06b

Signed-off-by: david6666666 <530634352@qq.com>

Document image edit limit handling

4ea2271

Signed-off-by: david6666666 <530634352@qq.com>

[Fix] Make image edit input limits config-driven

f414061

Signed-off-by: david6666666 <530634352@qq.com>

david6666666 force-pushed the codex/issue-2793-qwen-image-edit-oom branch from 697ba08 to f414061 Compare April 17, 2026 02:55

david6666666 added 2 commits April 17, 2026 03:14

[Refactor] Move diffusion image limits into shared metadata

05a7a5d

Signed-off-by: david6666666 <530634352@qq.com>

[Fix] Apply pre-commit import ordering

f3e7ce9

Signed-off-by: david6666666 <530634352@qq.com>

SamitHuang approved these changes Apr 17, 2026

View reviewed changes

[Docs] Clarify shared diffusion image limit metadata

3015646

Signed-off-by: david6666666 <530634352@qq.com>

Merge branch 'main' into codex/issue-2793-qwen-image-edit-oom

ddca44b

david6666666 added ready label to trigger buildkite CI and removed ready label to trigger buildkite CI labels Apr 17, 2026

Merge branch 'main' into codex/issue-2793-qwen-image-edit-oom

a6ee807

Gaohan123 enabled auto-merge (squash) April 17, 2026 08:45

david6666666 mentioned this pull request Apr 17, 2026

[cherry-pick][release/v0.18.0.post1] cherry-pick #2847 #2780 #2840 #2876 #2877 #2878

Merged

Merge branch 'main' into codex/issue-2793-qwen-image-edit-oom

8e257fa

david6666666 added ready label to trigger buildkite CI and removed ready label to trigger buildkite CI labels Apr 17, 2026

hsliuustc0106 disabled auto-merge April 17, 2026 12:07

hsliuustc0106 merged commit f658bcb into vllm-project:main Apr 17, 2026
5 of 8 checks passed

lvliang-intel pushed a commit to lvliang-intel/vllm-omni that referenced this pull request Apr 20, 2026

[Bugfix] Limit Qwen-Image-Edit-2511 input image count (vllm-project#2840

98b60fc

) Signed-off-by: david6666666 <530634352@qq.com> Co-authored-by: Gao Han <hgaoaf@connect.ust.hk>

This was referenced Apr 20, 2026

[Bugfix] Return 400 for Qwen image edit validation errors #2930

Closed

[CI Failure]: ComfyUI image-to-image DALL-E endpoint cases #2886

Closed

vraiti mentioned this pull request Apr 24, 2026

[WIP] Add HTTP 400 error propegation to diffusion pipelines via OmniInputVa… #3119

Open

5 tasks

lengrongfu pushed a commit to lengrongfu/vllm-omni that referenced this pull request May 1, 2026

[Bugfix] Limit Qwen-Image-Edit-2511 input image count (vllm-project#2840

7045ffd

) Signed-off-by: david6666666 <530634352@qq.com> Co-authored-by: Gao Han <hgaoaf@connect.ust.hk>

clodaghwalsh17 pushed a commit to clodaghwalsh17/nm-vllm-omni-ent that referenced this pull request May 12, 2026

[Bugfix] Limit Qwen-Image-Edit-2511 input image count (vllm-project#2840

d136ab5

) Signed-off-by: david6666666 <530634352@qq.com> Co-authored-by: Gao Han <hgaoaf@connect.ust.hk>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Limit Qwen-Image-Edit-2511 input image count#2840

[Bugfix] Limit Qwen-Image-Edit-2511 input image count#2840
hsliuustc0106 merged 18 commits into
vllm-project:mainfrom
david6666666:codex/issue-2793-qwen-image-edit-oom

david6666666 commented Apr 16, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot commented Apr 16, 2026

Uh oh!

hsliuustc0106 commented Apr 16, 2026

Uh oh!

david6666666 commented Apr 16, 2026

Blocking Issues

Uh oh!

lishunyang12 left a comment

Uh oh!

SamitHuang left a comment

Uh oh!

SamitHuang Apr 17, 2026

Uh oh!

SamitHuang Apr 17, 2026

Uh oh!

SamitHuang Apr 17, 2026

Uh oh!

david6666666 commented Apr 17, 2026

Uh oh!

SamitHuang left a comment

Uh oh!

Gaohan123 commented Apr 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

david6666666 commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Root Cause

Why This Fix

Validation

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector Bot commented Apr 16, 2026

Uh oh!

hsliuustc0106 commented Apr 16, 2026

Blocking Issues

Uh oh!

david6666666 commented Apr 16, 2026

Blocking Issues

Uh oh!

lishunyang12 left a comment

Choose a reason for hiding this comment

Uh oh!

SamitHuang left a comment

Choose a reason for hiding this comment

Uh oh!

SamitHuang Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

SamitHuang Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

SamitHuang Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

david6666666 commented Apr 17, 2026

Uh oh!

SamitHuang left a comment

Choose a reason for hiding this comment

Uh oh!

Gaohan123 commented Apr 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

david6666666 commented Apr 16, 2026 •

edited

Loading