[Bugfix]: modify diffusion pipeline profiler result in videos#2647
Conversation
Signed-off-by: bjf-frz <frz123db@gmail.com>
|
Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits. |
|
please add purpose |
|
@Bounty-hunter @david6666666 @wtomin PTAL, thx ! |
|
@yangjianjuan PTAL, thx |
david6666666
left a comment
There was a problem hiding this comment.
One non-blocking test-coverage note below.
|
Stage Durations Mean (s): Do we have dit Stage Durations time? |
The DIT process in wan2.2 is currently scattered throughout the |
|
LGTM |
…roject#2647) Signed-off-by: bjf-frz <frz123db@gmail.com> (cherry picked from commit fbb5dd5) Signed-off-by: David Chen <530634352@qq.com>
…roject#2647) Signed-off-by: bjf-frz <frz123db@gmail.com>
…roject#2647) Signed-off-by: bjf-frz <frz123db@gmail.com>
…roject#2647) Signed-off-by: bjf-frz <frz123db@gmail.com>
PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.
Purpose
This PR addresses an issue where, after enabling the
--enable-diffusion-pipeline-profiler, the/v1/videosinterface in wan2.2 does not properly handlepeak_memoryandstage_durations.Test Plan
server end:
vllm serve Wan2.2-I2V-A14B-Diffusers/ --omni --port 8091
user end:
python3 benchmarks/diffusion/diffusion_benchmark_serving.py
--base-url http://localhost:8091
--model Wan2.2-I2V-A14B-Diffusers/
--backend v1/videos
--dataset random
--task i2v
--num-prompts 1
--max-concurrency 1
--request-rate inf
--width 640
--height 480
--num-frames 81
--fps 16
--num-inference-steps 2
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model. Please runmkdocs serveto sync the documentation editions to./docs.BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)