Skip to content

Split test_piecewise_cuda_graph.py to optimize CI resource usage#15290

Merged
merrymercy merged 7 commits intosgl-project:mainfrom
alisonshao:split-piecewise-cuda-graph-tests
Dec 18, 2025
Merged

Split test_piecewise_cuda_graph.py to optimize CI resource usage#15290
merrymercy merged 7 commits intosgl-project:mainfrom
alisonshao:split-piecewise-cuda-graph-tests

Conversation

@alisonshao
Copy link
Collaborator

@alisonshao alisonshao commented Dec 16, 2025

Summary

  • Split 1 GPU tests into test_piecewise_cuda_graph_1_gpu_a.py (6 tests, ~500s) and test_piecewise_cuda_graph_1_gpu_b.py (5 tests, ~500s)
  • Move 2 GPU test (Qwen3OmniMOE) to test_piecewise_cuda_graph_2_gpu.py (~200s) and reduce tp from 4 to 2
  • Remove original test_piecewise_cuda_graph.py from per-commit-4-gpu suite (was 1200s)
  • Update run_suite.py to reference new test files in appropriate suites

This reduces CI time for 4 GPU tests and better distributes the workload across different GPU resource pools.

Test plan

  • Verify the split test files run correctly on CI
  • Confirm estimated times are within 500s limit per file

@gemini-code-assist
Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@alisonshao
Copy link
Collaborator Author

/tag-and-rerun-ci

@alisonshao

This comment was marked as outdated.

@github-actions

This comment was marked as outdated.

@yuan-luo
Copy link
Collaborator

- Split 1 GPU tests into test_piecewise_cuda_graph_1_gpu_a.py (6 tests, ~500s)
  and test_piecewise_cuda_graph_1_gpu_b.py (5 tests, ~500s)
- Move 2 GPU test (Qwen3OmniMOE) to test_piecewise_cuda_graph_2_gpu.py (~200s)
  and reduce tp from 4 to 2
- Remove original test_piecewise_cuda_graph.py from per-commit-4-gpu suite
- Update run_suite.py to reference new test files in appropriate suites

This reduces CI time for 4 GPU tests and better distributes the workload.
@alisonshao alisonshao force-pushed the split-piecewise-cuda-graph-tests branch from 8a2357e to c9793bd Compare December 17, 2025 22:35
@merrymercy merrymercy merged commit 58c840d into sgl-project:main Dec 18, 2025
25 of 29 checks passed
Liwansi added a commit to iforgetmyname/sglang that referenced this pull request Dec 19, 2025
…n3_pp

* 'main' of https://github.com/sgl-project/sglang: (74 commits)
  [bug fix][pp] fix inconsistent latency between tp (sgl-project#15379)
  Fix warp illegal instruction in kimi k2 thinking PCG (sgl-project#15306)
  Fix gpt-oss yarn with `truncate` argument (sgl-project#14270)
  Monkey patch deepseek-ocr's `v_head_dim` (sgl-project#15384)
  [model-gateway] Replace PolicyRegistry RwLock with DashMap for lock-free policy lookups (sgl-project#15361)
  [PP] Fix dynamic chunking strategy for PP (sgl-project#15372)
  Fix issue: ENABLE_BELOW_SM90 cannot be enabled on aarch64 CPU (sgl-project#12967)
  Split test_piecewise_cuda_graph.py to optimize CI resource usage (sgl-project#15290)
  unified management of environment variables for vlm cuda ipc transport  (sgl-project#14501)
  Mistral Large 3 NVFP4 TRTLLM MoE support (sgl-project#15049)
  fix: adjust time for test_epd_disaggregation.py (sgl-project#15354)
  Add doc for qwen3 next (sgl-project#15337)
  feat: DeepSeek-V3.2 Streaming tool call output (sgl-project#15278)
  Feature/trtllm mha workspace size configurable sgl-project#15089 (sgl-project#15131)
  [VLM] Support cos sin cache for Qwen3-VL & GLM-4.1V (sgl-project#15205)
  [Deepseek V3.2] Support Overlap Spec + NSA (sgl-project#15307)
  Add request-level timestamp for when prefill finishes (sgl-project#14860)
  [CI] Migrate LoRA tests to test/registered/lora/ (sgl-project#15176)
  Reserve more memory for DeepSeekOCR model and adjust server start timeout for DeepGEMM to reduce flakiness (sgl-project#15277)
  Fix condition check for require_gathered_buffer (sgl-project#15328)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants