Support Intern-S2-Preview by RunningLeon · Pull Request #24875 · sgl-project/sglang

RunningLeon · 2026-05-10T06:12:28Z

Motivation

python3 -m sglang.launch_server \
    --model-path internlm/Intern-S2-Preview \
    --trust-remote-code \
    --tp-size 8 \
    --mem-fraction-static 0.8 \
    --attention-backend fa3 \
    --mm-attention-backend fa3 \
    --reasoning-parser qwen3 \
    --tool-call-parser qwen3_coder

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review and Merge Process

Ping Merge Oncalls to start the process. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

gemini-code-assist · 2026-05-10T06:12:32Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

ispobock · 2026-05-10T06:20:35Z

/tag-and-rerun-ci

* main: (87 commits) [Fix] Disable FlashInfer allreduce fusion under deterministic inference (sgl-project#24629) fix: STANDALONE spec-decode hidden-size mismatch crash (sgl-project#24217) Followup fix for Custom AR V2 in non NVL scenarios (sgl-project#24742) Fix reduce_scatterv producer contract for SUM_LEN (sgl-project#24785) [NPU]Documentation update for communications quantization feature (sgl-project#24668) [Session R3] Add routed_experts_start_len for absolute routing slice control (sgl-project#24851) [Model] Add MiniCPM-V 4.6 support (sgl-project#24855) Support Intern-S2-Preview (sgl-project#24875) [PD] Unify dsv4 dispatch with swa (sgl-project#24888) Optimize MHC pipeline: DeepGemm, fused norm, fused hc_head (sgl-project#24775) Fix PD bootstrap failure handling (sgl-project#24772) [Spec] Cleanup idle stub and shape-check patterns (sgl-project#24881) [Bug] Add dsv4 state_type branch to mooncake disaggregation (sgl-project#24878) [Spec V1] Split draft-extend phase from `EagleDraftInput` into new `EagleDraftExtendInput` (sgl-project#24859) [Gemma4] Optimize Gemm4 with fused Q/K/V RMSNorm + per-expert FP8 ckpt loader (sgl-project#24696) [spec decoding] support kimi-k2.5-eagle3-mla (sgl-project#24826) [SPEC V2] fix: skip stale state updates in spec-v2 overlap (sgl-project#23456) [RL] Call torch.cuda.empty_cache() for `in-place` pause mode to avoid OOM (sgl-project#24854) [diffusion] CI: add cache-dit CI tests (sgl-project#19213) [Utils] Make request dump robust to unpicklable server_args and large meta_info (sgl-project#24767) ... # Conflicts: # python/sglang/srt/utils/common.py

RunningLeon added 2 commits April 20, 2026 11:58

support interns2preview

5dc46b2

Merge remote-tracking branch 'upstream/main' into s2preview

219f70e

RunningLeon requested review from Fridge003, HaiShaw, JustinTong0323, Ying1123, ch-wan, hnyls2002, ispobock, merrymercy, mickqian, yhyang201 and yuan-luo as code owners May 10, 2026 06:12

RunningLeon requested review from BBuf, ByronHsu, Edwardf0t1 and ShangmingCai as code owners May 10, 2026 06:12

github-actions Bot added the run-ci label May 10, 2026

ispobock approved these changes May 10, 2026

View reviewed changes

ispobock merged commit 335dbd6 into sgl-project:main May 10, 2026
203 of 237 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Intern-S2-Preview#24875

Support Intern-S2-Preview#24875
ispobock merged 2 commits into
sgl-project:mainfrom
RunningLeon:s2preview

RunningLeon commented May 10, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot commented May 10, 2026

Uh oh!

ispobock commented May 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RunningLeon commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Review and Merge Process

Uh oh!

gemini-code-assist Bot commented May 10, 2026

Uh oh!

ispobock commented May 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RunningLeon commented May 10, 2026 •

edited

Loading