Fix gpt-oss yarn with `truncate` argument by hnyls2002 · Pull Request #14270 · sgl-project/sglang

hnyls2002 · 2025-12-02T03:44:27Z

To match with https://huggingface.co/openai/gpt-oss-20b/blob/main/config.json#L66

gemini-code-assist · 2025-12-02T03:44:30Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

harrisonlimh · 2025-12-03T12:10:41Z

Tested with gpt-oss-120b, and it seems the changed calculations are not creating meaningful differences. I will try with gpt-oss-20b as well.

Impacted variables and tensors with and without truncation:

correction_range: low: 8, high: 18 vs. low: 8.092779115512402, high: 17.39802450158856
ramp_func.mean(): 0.5781250596046448 vs. 0.585820198059082
inv_freq.mean(): 0.0993923768401146 vs. 0.09938869625329971

GPQA eval scores

with truncate

high:
- {'chars': 128.6344696969697, 'chars:std': 336.4544423341523, 'score': 0.7973484848484849, 'score:std': 0.40197497255216075}
medium:
- {'chars': 267.67108585858585, 'chars:std': 424.53770135412367, 'score': 0.726010101010101, 'score:std': 0.44600385002979953}
- {'chars': 233.18371212121212, 'chars:std': 402.93011677260085, 'score': 0.7253787878787878, 'score:std': 0.44632320349079807}
low:
- {'chars': 150.23800505050505, 'chars:std': 324.1160777618058, 'score': 0.6496212121212122, 'score:std': 0.4770885587429018}
- {'chars': 145.9671717171717, 'chars:std': 311.72771065301606, 'score': 0.6357323232323232, 'score:std': 0.48122420598922094}

without truncate

high:
- {'chars': 138.97095959595958, 'chars:std': 367.965799993336, 'score': 0.7847222222222222, 'score:std': 0.4110149099154914}
medium:
- {'chars': 251.08838383838383, 'chars:std': 416.924230715839, 'score': 0.7417929292929293, 'score:std': 0.4376484654879353}
- {'chars': 230.09911616161617, 'chars:std': 405.0049643033156, 'score': 0.7228535353535354, 'score:std': 0.4475894343932065
low:
- {'chars': 146.62815656565655, 'chars:std': 314.83139425447695, 'score': 0.6590909090909091, 'score:std': 0.4740148548775957}
- {'chars': 136.94444444444446, 'chars:std': 302.62053603939734, 'score': 0.6470959595959596, 'score:std': 0.477873182623323}

hnyls2002 · 2025-12-08T05:18:21Z

/tag-and-rerun-ci

hlu1 · 2025-12-18T06:37:37Z

This fix is consistent with the reference implementation from gpt-oss: https://github.com/openai/gpt-oss/blob/main/gpt_oss/torch/model.py#L98-L107

hnyls2002 · 2025-12-18T06:40:28Z

/tag-and-rerun-ci

…n3_pp * 'main' of https://github.com/sgl-project/sglang: (74 commits) [bug fix][pp] fix inconsistent latency between tp (sgl-project#15379) Fix warp illegal instruction in kimi k2 thinking PCG (sgl-project#15306) Fix gpt-oss yarn with `truncate` argument (sgl-project#14270) Monkey patch deepseek-ocr's `v_head_dim` (sgl-project#15384) [model-gateway] Replace PolicyRegistry RwLock with DashMap for lock-free policy lookups (sgl-project#15361) [PP] Fix dynamic chunking strategy for PP (sgl-project#15372) Fix issue: ENABLE_BELOW_SM90 cannot be enabled on aarch64 CPU (sgl-project#12967) Split test_piecewise_cuda_graph.py to optimize CI resource usage (sgl-project#15290) unified management of environment variables for vlm cuda ipc transport (sgl-project#14501) Mistral Large 3 NVFP4 TRTLLM MoE support (sgl-project#15049) fix: adjust time for test_epd_disaggregation.py (sgl-project#15354) Add doc for qwen3 next (sgl-project#15337) feat: DeepSeek-V3.2 Streaming tool call output (sgl-project#15278) Feature/trtllm mha workspace size configurable sgl-project#15089 (sgl-project#15131) [VLM] Support cos sin cache for Qwen3-VL & GLM-4.1V (sgl-project#15205) [Deepseek V3.2] Support Overlap Spec + NSA (sgl-project#15307) Add request-level timestamp for when prefill finishes (sgl-project#14860) [CI] Migrate LoRA tests to test/registered/lora/ (sgl-project#15176) Reserve more memory for DeepSeekOCR model and adjust server start timeout for DeepGEMM to reduce flakiness (sgl-project#15277) Fix condition check for require_gathered_buffer (sgl-project#15328) ...

fix

52f50c0

hnyls2002 requested review from BBuf, Edwardf0t1, Fridge003, HaiShaw, Ying1123, ch-wan, ispobock and merrymercy as code owners December 2, 2025 03:44

fix

9ec319c

github-actions bot added the run-ci label Dec 8, 2025

hlu1 approved these changes Dec 18, 2025

View reviewed changes

Merge branch 'main' into lsyin/fix-gpt-oss-truncate-rope

69f6101

hnyls2002 merged commit 374ad4c into main Dec 18, 2025
53 of 69 checks passed

hnyls2002 deleted the lsyin/fix-gpt-oss-truncate-rope branch December 18, 2025 08:31

Prozac614 pushed a commit to Prozac614/sglang that referenced this pull request Dec 23, 2025

Fix gpt-oss yarn with truncate argument (sgl-project#14270)

ad061f0

jiaming1130 pushed a commit to zhuyijie88/sglang that referenced this pull request Dec 25, 2025

Fix gpt-oss yarn with truncate argument (sgl-project#14270)

51fd84a

YChange01 pushed a commit to YChange01/sglang that referenced this pull request Jan 13, 2026

Fix gpt-oss yarn with truncate argument (sgl-project#14270)

6fe4730

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gpt-oss yarn with `truncate` argument#14270

Fix gpt-oss yarn with `truncate` argument#14270
hnyls2002 merged 3 commits intomainfrom
lsyin/fix-gpt-oss-truncate-rope

hnyls2002 commented Dec 2, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Dec 2, 2025

Uh oh!

harrisonlimh commented Dec 3, 2025 •

edited

Loading

Uh oh!

hnyls2002 commented Dec 8, 2025

Uh oh!

hlu1 commented Dec 18, 2025

Uh oh!

hnyls2002 commented Dec 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hnyls2002 commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot commented Dec 2, 2025

Uh oh!

harrisonlimh commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Impacted variables and tensors with and without truncation:

GPQA eval scores

Uh oh!

hnyls2002 commented Dec 8, 2025

Uh oh!

hlu1 commented Dec 18, 2025

Uh oh!

hnyls2002 commented Dec 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hnyls2002 commented Dec 2, 2025 •

edited

Loading

harrisonlimh commented Dec 3, 2025 •

edited

Loading