[Bugfix]: modify diffusion pipeline profiler result in videos by bjf-frz · Pull Request #2647 · vllm-project/vllm-omni

bjf-frz · 2026-04-09T12:23:21Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

This PR addresses an issue where, after enabling the --enable-diffusion-pipeline-profiler, the /v1/videos interface in wan2.2 does not properly handle peak_memory and stage_durations.

Test Plan

server end:
vllm serve Wan2.2-I2V-A14B-Diffusers/ --omni --port 8091

user end:
python3 benchmarks/diffusion/diffusion_benchmark_serving.py
--base-url http://localhost:8091
--model Wan2.2-I2V-A14B-Diffusers/
--backend v1/videos
--dataset random
--task i2v
--num-prompts 1
--max-concurrency 1
--request-rate inf
--width 640
--height 480
--num-frames 81
--fps 16
--num-inference-steps 2

Test Result

================= Serving Benchmark Result =================
Backend:                                 v1/videos      
Model:                                   /home/admin/Wan2.2-I2V-14B-Distill-Diffusers/
Dataset:                                 random         
Task:                                    i2v            
--------------------------------------------------
Benchmark duration (s):                  12.14          
Request rate:                            inf            
Max request concurrency:                 1              
Successful requests:                     1/1              
--------------------------------------------------
Request throughput (req/s):              0.08           
Latency Mean (s):                        12.1411        
Latency Median (s):                      12.1411        
Latency P99 (s):                         12.1411        
Latency P95 (s):                         12.1411        
--------------------------------------------------
Peak Memory Max (MB):                    74204.00       
Peak Memory Mean (MB):                   74204.00       
Peak Memory Median (MB):                 74204.00       
--------------------------------------------------
Stage Durations Mean (s):
  Wan22I2VPipeline.text_encoder.forward: 0.0515         
  Wan22I2VPipeline.vae.encode:           1.0445         
  Wan22I2VPipeline.vae.decode:           1.5887

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan. Please provide the test scripts & test commands. Please state the reasons if your codes don't require additional test scripts. For test file guidelines, please check the test style doc
The test results. Please paste the results comparison before and after, or the e2e results.
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. Please run mkdocs serve to sync the documentation editions to ./docs.
(Optional) Release notes update. If your change is user-facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: bjf-frz <frz123db@gmail.com>

chatgpt-codex-connector · 2026-04-09T12:23:28Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

david6666666 · 2026-04-09T13:11:34Z

please add purpose

bjf-frz · 2026-04-09T13:14:55Z

@Bounty-hunter @david6666666 @wtomin PTAL, thx !

bjf-frz · 2026-04-09T13:16:27Z

@yangjianjuan PTAL, thx

david6666666

One non-blocking test-coverage note below.

david6666666 · 2026-04-10T02:55:12Z

Stage Durations Mean (s):
Wan22I2VPipeline.text_encoder.forward: 0.0515
Wan22I2VPipeline.vae.encode: 1.0445
Wan22I2VPipeline.vae.decode: 1.5887

Do we have dit Stage Durations time?

bjf-frz · 2026-04-10T03:08:22Z

@hsliuustc0106

Stage Durations Mean (s): Wan22I2VPipeline.text_encoder.forward: 0.0515 Wan22I2VPipeline.vae.encode: 1.0445 Wan22I2VPipeline.vae.decode: 1.5887

Do we have dit Stage Durations time?

The DIT process in wan2.2 is currently scattered throughout the forward. It needs to be refactored into a dedicated diffuse function to enable proper profiling. This refactoring will be addressed in a separate PR.

david6666666 · 2026-04-10T03:11:06Z

LGTM

…roject#2647) Signed-off-by: bjf-frz <frz123db@gmail.com> (cherry picked from commit fbb5dd5) Signed-off-by: David Chen <530634352@qq.com>

…roject#2647) Signed-off-by: bjf-frz <frz123db@gmail.com>

bugfix: modify diffusion pipeline profiler result in videos

cc7502b

Signed-off-by: bjf-frz <frz123db@gmail.com>

bjf-frz requested a review from hsliuustc0106 as a code owner April 9, 2026 12:23

bjf-frz changed the title ~~[WIP][Bugfix]: modify diffusion pipeline profiler result in videos~~ [Bugfix]: modify diffusion pipeline profiler result in videos Apr 10, 2026

david6666666 reviewed Apr 10, 2026

View reviewed changes

Comment thread benchmarks/diffusion/backends.py

david6666666 mentioned this pull request Apr 10, 2026

[RFC][0.20.0]: Qwen-Image、Qwen-Image-Layered、Qwen-Image-Edit-Plus、Wan2.2 Production-grade Feature Monitoring JiusiServe/vllm-omni#181

Closed

1 task

david6666666 added the ready label to trigger buildkite CI label Apr 10, 2026

david6666666 approved these changes Apr 10, 2026

View reviewed changes

david6666666 merged commit fbb5dd5 into vllm-project:main Apr 10, 2026
8 checks passed

daixinning pushed a commit to daixinning/vllm-omni that referenced this pull request Apr 13, 2026

[Bugfix]: modify diffusion pipeline profiler result in videos (vllm-p…

b052381

…roject#2647) Signed-off-by: bjf-frz <frz123db@gmail.com>

lengrongfu pushed a commit to lengrongfu/vllm-omni that referenced this pull request May 1, 2026

[Bugfix]: modify diffusion pipeline profiler result in videos (vllm-p…

133f2ad

…roject#2647) Signed-off-by: bjf-frz <frz123db@gmail.com>

clodaghwalsh17 pushed a commit to clodaghwalsh17/nm-vllm-omni-ent that referenced this pull request May 12, 2026

[Bugfix]: modify diffusion pipeline profiler result in videos (vllm-p…

252c2ca

…roject#2647) Signed-off-by: bjf-frz <frz123db@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix]: modify diffusion pipeline profiler result in videos#2647

[Bugfix]: modify diffusion pipeline profiler result in videos#2647
david6666666 merged 1 commit into
vllm-project:mainfrom
bjf-frz:bugfix_videos_diffusion_pipeline_profiler

bjf-frz commented Apr 9, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot commented Apr 9, 2026

Uh oh!

david6666666 commented Apr 9, 2026

Uh oh!

bjf-frz commented Apr 9, 2026

Uh oh!

bjf-frz commented Apr 9, 2026

Uh oh!

david6666666 left a comment

Uh oh!

Uh oh!

david6666666 commented Apr 10, 2026

Uh oh!

bjf-frz commented Apr 10, 2026

Uh oh!

david6666666 commented Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bjf-frz commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector Bot commented Apr 9, 2026

Uh oh!

david6666666 commented Apr 9, 2026

Uh oh!

bjf-frz commented Apr 9, 2026

Uh oh!

bjf-frz commented Apr 9, 2026

Uh oh!

david6666666 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

david6666666 commented Apr 10, 2026

Uh oh!

bjf-frz commented Apr 10, 2026

Uh oh!

david6666666 commented Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bjf-frz commented Apr 9, 2026 •

edited

Loading