[Test] Add L4 diffusion feature test for GLM-Image#3451
Conversation
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: a80dae5abc
ℹ️ About Codex in GitHub
Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".
|
@yenuo26 PTAL 🙏 not sure if I edit the |
| path: /mnt/hf-cache | ||
| type: DirectoryOrCreate | ||
|
|
||
| - label: ":full_moon: Diffusion X2I(&A&T) · GLM-Image Function Test with H100" |
There was a problem hiding this comment.
is this going to be an indepdent test pipeline?
hsliuustc0106
left a comment
There was a problem hiding this comment.
LGTM — clean test PR with good use of shared helpers and parametrization. No substantive issues.
|
COMMENT This PR adds comprehensive L4 diffusion feature tests for GLM-Image, covering baseline, Tensor-Parallel, and HSDP configurations in both T2I and I2I modes. Notes:
No blocking issues found. This is a valuable addition to the test suite that will help catch regressions in GLM-Image's parallel features. |
|
Hi @herotai214, friendly reminder — this PR hasn't had any activity (commits or reviews) in the past 7 days. 🕐 Could you please provide an update?
Thanks for your contribution! 🙏 |
Thanks for reminding; I was working for Hunyuan for a period previously, and I was aware that the multi-stage CLI issue solution seems to be uncertain; Therefore, I put this PR on hold. Now I recognize the #3483 solves to CLI issue and have been merged today; I'll verify asap if it helps this L4 test (support TP, HSDP properly through CLI), or otherwise I may need to push a revised version after removing those unsupported features (through CLI). |
Signed-off-by: herotai214 <herotai214@gmail.com>
|
Updated the script; Verified that #3483 worked, and able to use CLI here to apply TP, SP, Cache-DiT.... But after rebased and upgrading to vllm 0.22.0 this morning with the commit [Rebase] Rebase to vllm releases/v0.22.0 (#3891), I cannot use CLI to apply any feature here anymore... Will double check and raise issue very soon.... |
PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.
Purpose
This PR adds L4 test that covers the baseline & 2 of the supported features Tensor-Parallel, HSDP in both t2i & i2i cases for GLM-Image
Referred to #2167. Since it only covers i2i case, didn't effectively test the features, and the codes are quite different now, decided to raise this separate PR.
Tests are picked up by nightly:
pytest -sv test_glm_image_expansion.py -m "full_model" --run-level "full_model".The tests are intended to be picked up by the L4 nightly diffusion pipeline described in RFC #1832.
Test Plan
Model
zai-org/GLM-ImageChanges:
.buildkite/test-nightly.ymlto trigger testtests/e2e/online_serving/test_glm_image_expansion.py(the new test file)Tests added:
After referring to [RFC]: Continuous Diffusion Model Acceleration Support #1217:
Above 3 tests loop through both t2i and i2i cases -> 6 tests in total.
All tests use 2 cards.
Remark:
Test Result
pytest -sv tests/e2e/online_serving/test_glm_image_expansion.py -m "full_model" --run-level "full_model"(With #3384 fix!)
pytest -sv tests/e2e/online_serving/test_glm_image_expansion.py -m "full_model" --collect-only
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model. Please runmkdocs serveto sync the documentation editions to./docs.BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)