[CI/Build] Add Buildkite step for diffusion quantization tests by pjh4993 · Pull Request #9 · pjh4993/vllm-omni

pjh4993 · 2026-04-09T01:04:35Z

Purpose

Add a Buildkite pipeline step for tests/diffusion/quantization/ which was missing from test-ready.yml. These tests (added in vllm-project#1470, refactored in vllm-project#1764) have core_model and diffusion markers but were never wired into CI, so breakages went undetected.

Fixes #8
(upstream: Fixes vllm-project#2614)

Test Plan

The change is a CI config addition — no local test needed. Validation will happen when Buildkite runs the new step on a PR with the ready label.

The new step runs:

timeout 15m pytest -s -v tests/diffusion/quantization/ -m 'core_model' --run-level core_model

Test Result

N/A — CI-only change. The step uses gpu_1_queue (L4 GPU), matching the pattern of other diffusion test steps.

…arkers The unified quantization framework (vllm-project#1764) consolidated source code at vllm_omni/quantization/, but tests were still under tests/diffusion/quantization/, and they had no Buildkite CI coverage. This PR: - Moves tests/diffusion/quantization/ to tests/quantization/ to mirror the source layout. - Aligns pytest markers with the actual test type: * test_int8_config.py: core_model + cuda + L4 (GPU smoke test) * test_inc_config.py: core_model + cpu (pure config builder) * test_fp8_config.py: core_model + cpu (drop redundant diffusion marker) * test_gguf_config.py: core_model + cpu (drop redundant diffusion marker) - Updates the test docstring and contributing doc to reference the new path. After this change, the existing CUDA Unit Test with single card step (pytest -m 'core_model and cuda and L4 and not distributed_cuda') will automatically pick up the GPU quantization tests, and the Simple Unit Test step will pick up the CPU ones — so no dedicated Buildkite step is needed. Fixes vllm-project#2614 Signed-off-by: pjh4993 <pjh4993@naver.com>

Split quantization quality tests by model group in test-nightly-diffusion.yml: - Other group: Z-Image and FLUX FP8 quality tests - Qwen-Image group: Qwen-Image FP8 quality test Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: pjh4993 <pjh4993@naver.com>

…smoke tests Separate test_int8_config.py into two files aligned with codebase conventions: - test_int8_config.py (core_model, cpu): pure config/factory unit tests using mocks - test_int8_smoke.py (core_model, cuda, L4): real hardware smoke tests with @cuda_available and @npu_available skipif guards Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: pjh4993 <pjh4993@naver.com>

Set VLLM_TEST_CLEAN_GPU_MEMORY=1 on the qwen-image quantization quality test step so the autouse conftest fixture reclaims the runner GPU before each test. Without it, a failed first attempt can leave a StageDiffusionProc child holding tens of GiB, and the in-session retry then hits a spurious CUDA OOM during weight loading (observed in build #6405 as a 59 GiB leaked sibling process on an A100 runner). Signed-off-by: pjh4993 <pjh4993@naver.com>

pjh4993 force-pushed the chore/ghi-2614-ci-add-diffusion-quantization-tests branch 7 times, most recently from b33677f to 662eb54 Compare April 13, 2026 09:29

pjh4993 and others added 2 commits April 13, 2026 13:36

pjh4993 force-pushed the chore/ghi-2614-ci-add-diffusion-quantization-tests branch from 662eb54 to 9d3885f Compare April 13, 2026 13:56

pjh4993 and others added 2 commits April 13, 2026 14:04

pjh4993 force-pushed the chore/ghi-2614-ci-add-diffusion-quantization-tests branch from 9d3885f to 16db77c Compare April 13, 2026 14:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI/Build] Add Buildkite step for diffusion quantization tests#9

[CI/Build] Add Buildkite step for diffusion quantization tests#9
pjh4993 wants to merge 4 commits intomainfrom
chore/ghi-2614-ci-add-diffusion-quantization-tests

pjh4993 commented Apr 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pjh4993 commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pjh4993 commented Apr 9, 2026 •

edited

Loading