Split test_piecewise_cuda_graph.py to optimize CI resource usage by alisonshao · Pull Request #15290 · sgl-project/sglang

alisonshao · 2025-12-16T23:09:38Z

Summary

Split 1 GPU tests into test_piecewise_cuda_graph_1_gpu_a.py (6 tests, ~500s) and test_piecewise_cuda_graph_1_gpu_b.py (5 tests, ~500s)
Move 2 GPU test (Qwen3OmniMOE) to test_piecewise_cuda_graph_2_gpu.py (~200s) and reduce tp from 4 to 2
Remove original test_piecewise_cuda_graph.py from per-commit-4-gpu suite (was 1200s)
Update run_suite.py to reference new test files in appropriate suites

This reduces CI time for 4 GPU tests and better distributes the workload across different GPU resource pools.

Test plan

Verify the split test files run correctly on CI
Confirm estimated times are within 500s limit per file

gemini-code-assist · 2025-12-16T23:09:42Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

alisonshao · 2025-12-16T23:10:59Z

/tag-and-rerun-ci

yuan-luo · 2025-12-17T03:42:40Z

@alisonshao Still timeout, PTAL.
https://github.com/sgl-project/sglang/actions/runs/20287352911/job/58265809329?pr=15290

- Split 1 GPU tests into test_piecewise_cuda_graph_1_gpu_a.py (6 tests, ~500s) and test_piecewise_cuda_graph_1_gpu_b.py (5 tests, ~500s) - Move 2 GPU test (Qwen3OmniMOE) to test_piecewise_cuda_graph_2_gpu.py (~200s) and reduce tp from 4 to 2 - Remove original test_piecewise_cuda_graph.py from per-commit-4-gpu suite - Update run_suite.py to reference new test files in appropriate suites This reduces CI time for 4 GPU tests and better distributes the workload.

- Move TestPiecewiseCudaGraphAWQ from 1_gpu_a to 1_gpu_b to balance runtimes - Update estimated times: 1_gpu_a (460s), 1_gpu_b (480s) - Fix test_vision_chunked_prefill.py estimate from 117s to 150s

…n3_pp * 'main' of https://github.com/sgl-project/sglang: (74 commits) [bug fix][pp] fix inconsistent latency between tp (sgl-project#15379) Fix warp illegal instruction in kimi k2 thinking PCG (sgl-project#15306) Fix gpt-oss yarn with `truncate` argument (sgl-project#14270) Monkey patch deepseek-ocr's `v_head_dim` (sgl-project#15384) [model-gateway] Replace PolicyRegistry RwLock with DashMap for lock-free policy lookups (sgl-project#15361) [PP] Fix dynamic chunking strategy for PP (sgl-project#15372) Fix issue: ENABLE_BELOW_SM90 cannot be enabled on aarch64 CPU (sgl-project#12967) Split test_piecewise_cuda_graph.py to optimize CI resource usage (sgl-project#15290) unified management of environment variables for vlm cuda ipc transport (sgl-project#14501) Mistral Large 3 NVFP4 TRTLLM MoE support (sgl-project#15049) fix: adjust time for test_epd_disaggregation.py (sgl-project#15354) Add doc for qwen3 next (sgl-project#15337) feat: DeepSeek-V3.2 Streaming tool call output (sgl-project#15278) Feature/trtllm mha workspace size configurable sgl-project#15089 (sgl-project#15131) [VLM] Support cos sin cache for Qwen3-VL & GLM-4.1V (sgl-project#15205) [Deepseek V3.2] Support Overlap Spec + NSA (sgl-project#15307) Add request-level timestamp for when prefill finishes (sgl-project#14860) [CI] Migrate LoRA tests to test/registered/lora/ (sgl-project#15176) Reserve more memory for DeepSeekOCR model and adjust server start timeout for DeepGEMM to reduce flakiness (sgl-project#15277) Fix condition check for require_gathered_buffer (sgl-project#15328) ...

…-project#15290)

github-actions bot added the run-ci label Dec 16, 2025

This comment was marked as outdated.

Sign in to view

alisonshao mentioned this pull request Dec 17, 2025

[VLM] Support Piecewise CUDA Graph for Qwen3-Omni-MOE #14222

Merged

6 tasks

yuan-luo approved these changes Dec 17, 2025

View reviewed changes

alisonshao added 2 commits December 17, 2025 14:35

Fix lint: remove unused imports

c9793bd

alisonshao force-pushed the split-piecewise-cuda-graph-tests branch from 8a2357e to c9793bd Compare December 17, 2025 22:35

alisonshao and others added 5 commits December 17, 2025 16:10

Fix test_vision_chunked_prefill.py estimated time from 117s to 500s

ffaff29

Balance piecewise cuda graph tests and fix estimated times

575dfa7

- Move TestPiecewiseCudaGraphAWQ from 1_gpu_a to 1_gpu_b to balance runtimes - Update estimated times: 1_gpu_a (460s), 1_gpu_b (480s) - Fix test_vision_chunked_prefill.py estimate from 117s to 150s

Merge branch 'main' into split-piecewise-cuda-graph-tests

23bb301

Merge branch 'main' into split-piecewise-cuda-graph-tests

a4b0a5e

Merge branch 'main' into split-piecewise-cuda-graph-tests

742cc31

merrymercy merged commit 58c840d into sgl-project:main Dec 18, 2025
25 of 29 checks passed

Prozac614 pushed a commit to Prozac614/sglang that referenced this pull request Dec 23, 2025

Split test_piecewise_cuda_graph.py to optimize CI resource usage (sgl…

8f827a8

…-project#15290)

jiaming1130 pushed a commit to zhuyijie88/sglang that referenced this pull request Dec 25, 2025

Split test_piecewise_cuda_graph.py to optimize CI resource usage (sgl…

ba131d5

…-project#15290)

YChange01 pushed a commit to YChange01/sglang that referenced this pull request Jan 13, 2026

Split test_piecewise_cuda_graph.py to optimize CI resource usage (sgl…

cb8674b

…-project#15290)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split test_piecewise_cuda_graph.py to optimize CI resource usage#15290

Split test_piecewise_cuda_graph.py to optimize CI resource usage#15290
merrymercy merged 7 commits intosgl-project:mainfrom
alisonshao:split-piecewise-cuda-graph-tests

alisonshao commented Dec 16, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Dec 16, 2025

Uh oh!

alisonshao commented Dec 16, 2025

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

yuan-luo commented Dec 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

alisonshao commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

gemini-code-assist bot commented Dec 16, 2025

Uh oh!

alisonshao commented Dec 16, 2025

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

yuan-luo commented Dec 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

alisonshao commented Dec 16, 2025 •

edited

Loading