[Core]Add GPU Diffusion Runner by princepride · Pull Request #822 · vllm-project/vllm-omni

princepride · 2026-01-16T15:43:39Z

Purpose

Related: #800

This PR refactors the GPU diffusion worker architecture to improve code organization and maintainability:

Separated model runner logic: Extracted GPUDiffusionModelRunner from GPUDiffusionWorker to follow the separation of concerns principle
Improved naming consistency: Renamed gpu_worker.py → gpu_diffusion_worker.py and test_gpu_worker.py → test_gpu_diffusion_worker.py for better clarity
Adjust NPU worker: Updated npu_worker.py to align with the new architecture and add missing functionality
Added comprehensive unit tests: Implemented detailed tests for load_weights, sleep, and wake_up methods with proper mocking

Test Plan

Unit Test

pytest tests/diffusion/test_gpu_diffusion_worker.py -v

Result:

============================================ test session starts ============================================
platform linux -- Python 3.13.11, pytest-9.0.2, pluggy-1.6.0 -- /proj-tango-pvc/users/zhipeng.wang/workspace/vllm-omni/.venv/bin/python3
cachedir: .pytest_cache
rootdir: /proj-tango-pvc/users/zhipeng.wang/workspace/vllm-omni
configfile: pyproject.toml
plugins: cov-7.0.0, anyio-4.12.1
collected 8 items                                                                                           

tests/diffusion/test_gpu_diffusion_worker.py::TestGPUDiffusionWorkerLoadWeights::test_load_weights_calls_pipeline PASSED [ 12%]
tests/diffusion/test_gpu_diffusion_worker.py::TestGPUDiffusionWorkerLoadWeights::test_load_weights_empty_iterable PASSED [ 25%]
tests/diffusion/test_gpu_diffusion_worker.py::TestGPUDiffusionWorkerSleep::test_sleep_level_1 PASSED  [ 37%]
tests/diffusion/test_gpu_diffusion_worker.py::TestGPUDiffusionWorkerSleep::test_sleep_level_2 PASSED  [ 50%]
tests/diffusion/test_gpu_diffusion_worker.py::TestGPUDiffusionWorkerSleep::test_sleep_memory_freed_validation PASSED [ 62%]
tests/diffusion/test_gpu_diffusion_worker.py::TestGPUDiffusionWorkerWakeUp::test_wake_up_without_buffers PASSED [ 75%]
tests/diffusion/test_gpu_diffusion_worker.py::TestGPUDiffusionWorkerWakeUp::test_wake_up_with_buffers PASSED [ 87%]
tests/diffusion/test_gpu_diffusion_worker.py::TestGPUDiffusionWorkerWakeUp::test_wake_up_partial_buffer_restore PASSED [100%]

Test Run Diffusion Model

python examples/offline_inference/text_to_image/text_to_image.py

Result:

Signed-off-by: princepride <wangzhipeng628@gmail.com>

princepride · 2026-01-16T15:45:07Z

@ZJY0516 @hsliuustc0106

gcanlin · 2026-01-16T16:28:38Z

Hi! Could we please wait for #774, which micro-refactors diffusion_worker to be hardware-agnostic? Then we don't need to modify the platform_utils and npu_worker. And the gpu word will be removed. I'd like to make #774 merged first, but it's also okay to merge this one first. I have some concerns about that #774 is becoming larger and larger.

hsliuustc0106 · 2026-01-16T21:26:54Z

Hi! Could we please wait for #774, which micro-refactors diffusion_worker to be hardware-agnostic? Then we don't need to modify the platform_utils and npu_worker. And the gpu word will be removed. I'd like to make #774 merged first, but it's also okay to merge this one first. I have some concerns about that #774 is becoming larger and larger.

I think #774 may need more discussions

ZJY0516

LGTM

Signed-off-by: princepride <wangzhipeng628@gmail.com>

Gaohan123 · 2026-01-17T06:57:51Z

        destroy_distributed_env()


 class WorkerProc:


Is its function similar to executor? Not now, but do we have plan to refractor it as executor in the future?

@ZJY0516 What do you think?

hsliuustc0106 · 2026-01-17T07:18:12Z

any speed difference before and after this PR?

princepride · 2026-01-17T07:54:45Z

any speed difference before and after this PR?

I am testing it.

princepride · 2026-01-17T08:12:35Z

I use this script python examples/offline_inference/text_to_image/text_to_image.py compare the speed on H200, the average e2e time of original version is 15236ms, and the current version is 15233ms.

Gaohan123

LGTM. Thanks

Signed-off-by: princepride <wangzhipeng628@gmail.com> Signed-off-by: Chen Yang <2082464740@qq.com>

Signed-off-by: princepride <wangzhipeng628@gmail.com>

Add gpu diffusion runner

214499a

Signed-off-by: princepride <wangzhipeng628@gmail.com>

princepride requested a review from hsliuustc0106 as a code owner January 16, 2026 15:43

ZJY0516 approved these changes Jan 17, 2026

View reviewed changes

Comment thread vllm_omni/diffusion/worker/gpu_diffusion_worker.py

ZJY0516 requested a review from SamitHuang January 17, 2026 02:38

ZJY0516 added the ready label to trigger buildkite CI label Jan 17, 2026

princepride added 2 commits January 17, 2026 04:01

recover wake_up comments

51b69ad

Signed-off-by: princepride <wangzhipeng628@gmail.com>

adjust ci pipeline

ed33549

Signed-off-by: princepride <wangzhipeng628@gmail.com>

Gaohan123 reviewed Jan 17, 2026

View reviewed changes

Gaohan123 approved these changes Jan 17, 2026

View reviewed changes

hsliuustc0106 merged commit 36c2876 into vllm-project:main Jan 17, 2026
7 checks passed

hsliuustc0106 mentioned this pull request Jan 17, 2026

[Hardware] Support platforms and plugin system #774

Merged

11 tasks

erfgss pushed a commit to erfgss/vllm-omni that referenced this pull request Jan 19, 2026

[Core]Add GPU Diffusion Runner (vllm-project#822)

2c43fe8

Signed-off-by: princepride <wangzhipeng628@gmail.com> Signed-off-by: Chen Yang <2082464740@qq.com>

with1015 pushed a commit to with1015/vllm-omni that referenced this pull request Jan 20, 2026

[Core]Add GPU Diffusion Runner (vllm-project#822)

3382ee3

Signed-off-by: princepride <wangzhipeng628@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core]Add GPU Diffusion Runner#822

[Core]Add GPU Diffusion Runner#822
hsliuustc0106 merged 3 commits into
vllm-project:mainfrom
princepride:add-gpu-diffusion-runner

princepride commented Jan 16, 2026 •

edited

Loading

Uh oh!

princepride commented Jan 16, 2026

Uh oh!

gcanlin commented Jan 16, 2026

Uh oh!

hsliuustc0106 commented Jan 16, 2026

Uh oh!

ZJY0516 left a comment

Uh oh!

Uh oh!

Uh oh!

Gaohan123 Jan 17, 2026

Uh oh!

princepride Jan 17, 2026

Uh oh!

hsliuustc0106 commented Jan 17, 2026

Uh oh!

princepride commented Jan 17, 2026

Uh oh!

princepride commented Jan 17, 2026

Uh oh!

Gaohan123 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

princepride commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Unit Test

Test Run Diffusion Model

Uh oh!

princepride commented Jan 16, 2026

Uh oh!

gcanlin commented Jan 16, 2026

Uh oh!

hsliuustc0106 commented Jan 16, 2026

Uh oh!

ZJY0516 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Gaohan123 Jan 17, 2026

Choose a reason for hiding this comment

Uh oh!

princepride Jan 17, 2026

Choose a reason for hiding this comment

Uh oh!

hsliuustc0106 commented Jan 17, 2026

Uh oh!

princepride commented Jan 17, 2026

Uh oh!

princepride commented Jan 17, 2026

Uh oh!

Gaohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

princepride commented Jan 16, 2026 •

edited

Loading