ci: bump test_mimo_models.py est_time 330 → 610 by alisonshao · Pull Request #24551 · sgl-project/sglang

alisonshao · 2026-05-06T21:21:35Z

Summary

PR [Feature] Xiaomi MiMo-V2.5 day0 support #23811 added a second test class (TestMiMoV2 — XiaomiMiMo/MiMo-V2.5, TP=8 DP=2, MMMU + GSM8K + EAGLE spec) to test_mimo_models.py without bumping est_time.
The file now runs ~500-640 s on h200 vs the unchanged est_time=330, which causes the auto-partitioner to consistently overload shard 0 of stage-c-test-8-gpu-h200 and hit the 30-min Run test wall (e.g. runs 25428444359, 25411981650).

Test plan

Trigger a scheduled-style run on main and confirm stage-c-test-8-gpu-h200 (0) no longer hits the 30-min wall.

PR #23811 added a second test class (TestMiMoV2 — XiaomiMiMo/MiMo-V2.5, TP=8 DP=2, MMMU + GSM8K + EAGLE spec) to test_mimo_models.py without bumping est_time. The file now spins up two full servers and runs ~600 s on h200; staying at 330 s makes the partitioner consistently overload shard 0 of stage-c-test-8-gpu-h200, which keeps hitting the 30-min "Run test" wall (e.g. runs 25428444359, 25411981650). Same value already proposed by the auto-bump bot in #24331; this PR is a focused subset to unblock the H200 timeouts now.

gemini-code-assist · 2026-05-06T21:21:39Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

* main: (894 commits) [Bug Fix] Fix RunAI streamer: corrupted weights, missing quant init, and broken URIs for multimodal models (sgl-project#22715) [Kernel] Deprecate DeepGemm in sgl kernel and apply custom wheel sgl-deep-gemm (sgl-project#24268) propagate pytest exit code from test __main__ entries (sgl-project#24487) [R3] Avoid implicit CUDA sync in routed experts DP slicing (sgl-project#24550) Add ChatCompletionRequest-style support to /v1/tokenize (sgl-project#23981) Support Triton MLA FP8 KV cache (sgl-project#20479) [diffusion] chore: align LTX-2 with official (sgl-project#24313) Expand support matrix for pypi wheel release (sgl-project#24565) [codex] Optimize Z-Image packed QKV (sgl-project#24117) [Misc] Fix breaking weight checker test (sgl-project#24553) [LoRA] Fix qkv_proj LoRA buffer sizing when tp_size > num_key_value_heads (sgl-project#24420) ci: bump test_mimo_models.py est_time 330 → 610 (sgl-project#24551) [CI] Temporarily disable marco/mcdse-2b-v1 in test_embedding_models (sgl-project#24279) Improve metrics, observability, and PD deploy tooling (sgl-project#24521) Fix diffusion fallback guards and validation (sgl-project#23335) [PD] Prevent update_status to Failed from cleared entries (sgl-project#24539) [CP] Register KV cache allgather buffer with symmetric memory (sgl-project#24040) Support getting checksums in weight checker (sgl-project#24537) Refactor buffer patterns in weight checker (sgl-project#24538) Add unit and end-to-end tests for weight checker (sgl-project#24536) ... # Conflicts: # python/sglang/srt/managers/scheduler.py # python/sglang/srt/model_executor/model_runner.py

Kangyan-Zhou approved these changes May 6, 2026

View reviewed changes

Kangyan-Zhou merged commit e72246c into main May 6, 2026
61 of 67 checks passed

Kangyan-Zhou deleted the alison/bump-mimo-est-time branch May 6, 2026 21:35

Fridge003 pushed a commit that referenced this pull request May 6, 2026

ci: bump test_mimo_models.py est_time 330 → 610 (#24551)

cfe02ab

LLThomas pushed a commit to LLThomas/sglang that referenced this pull request May 8, 2026

ci: bump test_mimo_models.py est_time 330 → 610 (sgl-project#24551)

a43d0c1

LucQueen pushed a commit to LucQueen/sglang that referenced this pull request May 12, 2026

ci: bump test_mimo_models.py est_time 330 → 610 (sgl-project#24551)

1511b0d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: bump test_mimo_models.py est_time 330 → 610#24551

ci: bump test_mimo_models.py est_time 330 → 610#24551
Kangyan-Zhou merged 1 commit into
mainfrom
alison/bump-mimo-est-time

alisonshao commented May 6, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot commented May 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alisonshao commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

gemini-code-assist Bot commented May 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alisonshao commented May 6, 2026 •

edited

Loading