[WIP][Model] Add Ming-flash-omni-2.0 Image Generation (Diffusion) Stage by ZhengWG · Pull Request #2875 · vllm-project/vllm-omni

ZhengWG · 2026-04-17T08:16:59Z

Purpose

This PR extends #1822 (Ming-flash-omni-2.0 Thinker stage) by adding the image generation (diffusion) stage for inclusionAI/Ming-flash-omni-2.0 https://huggingface.co/inclusionAI/Ming-flash-omni-2.0, enabling end-to-end text-to-image.

Modified HF model repo to use: https://huggingface.co/Jonathan1909/Ming-flash-omni-2.0

cc @yuanheng-zhao

Usage

Start the server:

vllm serve /home/admin/model/ --omni \
--stage-configs-path vllm_omni/model_executor/stage_configs/ming_flash_omni_dual.yaml  \
--trust-remote-code --log-stats --port 8188 --host 0.0.0.0

Test request:

curl http://127.0.0.1:8188/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "/home/admin/model/",
    "messages": [
      {"role":"user","content":"Please draw a cute cat."}
    ],
    "modalities": ["image"]
  }' -o /tmp/ming_response.json


python -c "
import base64, json
r = json.load(open('/tmp/ming_response.json'))
url = r['choices'][0]['message']['content'][0]['image_url']['url']
png = base64.b64decode(url.split(',')[1])
open('/tmp/ming_cat.png', 'wb').write(png)
print('PNG bytes:', len(png))
"

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan. Please provide the test scripts & test commands. Please state the reasons if your codes don't require additional test scripts. For test file guidelines, please check the test style doc
The test results. Please paste the results comparison before and after, or the e2e results.
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. Please run mkdocs serve to sync the documentation editions to ./docs.
(Optional) Release notes update. If your change is user-facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

chatgpt-codex-connector · 2026-04-17T08:17:05Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

hsliuustc0106 · 2026-04-17T09:31:33Z

This PR is marked [WIP]. Ready for full review when work-in-progress status is removed.

Missing PR Body Evidence Required:

Before review can proceed, please add the following to the PR description:

vLLM-Omni generation script (offline Omni or online vllm serve)
Generated sample output (image)
vLLM-Omni e2e latency (hardware: GPU model, count; resolution; steps)
vLLM-Omni peak VRAM usage (GB)

See Diffusion Model Requirements for details.

Preliminary scan available on request once the above evidence is provided.

hsliuustc0106 · 2026-04-17T09:36:40Z

Ready for full review when WIP status removed. Preliminary scan available on request.

When ready for review, please ensure PR description includes:

Generation script (offline or online )
Sample outputs (you already have an image - good)
End-to-end latency (hardware specs, resolution, steps)
Peak VRAM usage

Also consider adding a diffusers baseline comparison for performance numbers.

hsliuustc0106 · 2026-04-17T11:21:54Z

FYI, #1822 merged now, resolve conflicts please

yuanheng-zhao · 2026-04-17T13:51:48Z

The thinker stage has been merged to main, let's rebase onto main with cutting off from the thinker stage changes.
For example, git rebase --onto main the-thinker-branch your-current-branch

ZhengWG · 2026-04-17T15:56:24Z

The thinker stage has been merged to main, let's rebase onto main with cutting off from the thinker stage changes.

For example, git rebase --onto main the-thinker-branch your-current-branch

OK，I will do it ASAP.

Signed-off-by: ZhengWG <zwg0606@gmail.com>

yuanheng-zhao · 2026-04-23T04:37:23Z

Ming-flash-omni-2.0 talker (TTS & Omni-Speech) #2890 has been merged to main. Shall we rebase on/merge from main please? Thanks! @ZhengWG

ZhengWG · 2026-04-23T06:00:36Z

Ming-flash-omni-2.0 talker (TTS & Omni-Speech) #2890 has been merged to main. Shall we rebase on/merge from main please? Thanks! @ZhengWG

Ok, I will do it ASAP.

ZhengWG requested a review from hsliuustc0106 as a code owner April 17, 2026 08:17

ZhengWG force-pushed the py/ming-omni-dev branch from de1014d to 27860d4 Compare April 17, 2026 08:35

ZhengWG added 11 commits April 19, 2026 16:08

naive support dit

3ae8678

Signed-off-by: ZhengWG <zwg0606@gmail.com>

feat: naive support text2img for Ming

a3c6d71

Signed-off-by: ZhengWG <zwg0606@gmail.com>

feat: support mult-statges

004f053

Signed-off-by: ZhengWG <zwg0606@gmail.com>

fix: fix ming import error

eb1cde3

Signed-off-by: ZhengWG <zwg0606@gmail.com>

clean code

1e71f08

Signed-off-by: ZhengWG <zwg0606@gmail.com>

refactor: del useless code

1a71dbf

Signed-off-by: ZhengWG <zwg0606@gmail.com>

fix: add prompt util for Ming

a93880b

Signed-off-by: ZhengWG <zwg0606@gmail.com>

fix: support sample-params for Ming

3c4cd6e

Signed-off-by: ZhengWG <zwg0606@gmail.com>

feat: support negative-prompt

8b5f4ae

Signed-off-by: ZhengWG <zwg0606@gmail.com>

feat: support ref-image

b6dce22

Signed-off-by: ZhengWG <zwg0606@gmail.com>

feat: add byte5 support for Ming

8aef154

Signed-off-by: ZhengWG <zwg0606@gmail.com>

ZhengWG force-pushed the py/ming-omni-dev branch from 58112c5 to 8aef154 Compare April 19, 2026 08:20

fix: fix sampling_pars support

769ed0d

Signed-off-by: ZhengWG <zwg0606@gmail.com>

This was referenced Apr 20, 2026

[New Model]: inclusionAI/Ming-flash-omni-2.0 #1343

Open

[Model] Ming-flash-omni-2.0 Omni-Speech and TTS #2890

Merged

fix: fix ref image-edit

4b46c01

Signed-off-by: ZhengWG <zwg0606@gmail.com>

ZhengWG force-pushed the py/ming-omni-dev branch from d6e80c6 to 4b46c01 Compare April 20, 2026 06:49

fix: fix byt5 encoder for Ming

c055de9

Signed-off-by: ZhengWG <zwg0606@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP][Model] Add Ming-flash-omni-2.0 Image Generation (Diffusion) Stage#2875

[WIP][Model] Add Ming-flash-omni-2.0 Image Generation (Diffusion) Stage#2875
ZhengWG wants to merge 14 commits intovllm-project:mainfrom
ZhengWG:py/ming-omni-dev

ZhengWG commented Apr 17, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot commented Apr 17, 2026

Uh oh!

hsliuustc0106 commented Apr 17, 2026

Uh oh!

hsliuustc0106 commented Apr 17, 2026

Uh oh!

hsliuustc0106 commented Apr 17, 2026

Uh oh!

yuanheng-zhao commented Apr 17, 2026

Uh oh!

ZhengWG commented Apr 17, 2026

Uh oh!

yuanheng-zhao commented Apr 23, 2026

Uh oh!

ZhengWG commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ZhengWG commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Usage

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector Bot commented Apr 17, 2026

Uh oh!

hsliuustc0106 commented Apr 17, 2026

Uh oh!

hsliuustc0106 commented Apr 17, 2026

Uh oh!

hsliuustc0106 commented Apr 17, 2026

Uh oh!

yuanheng-zhao commented Apr 17, 2026

Uh oh!

ZhengWG commented Apr 17, 2026

Uh oh!

yuanheng-zhao commented Apr 23, 2026

Uh oh!

ZhengWG commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ZhengWG commented Apr 17, 2026 •

edited

Loading