Skip to content

fix(moe): relocate orphan tuned configs after #23019#24004

Merged
Qiaolin-Yu merged 2 commits into
sgl-project:mainfrom
Wen-xuan-Xu:fix/moe-orphan-configs
Apr 29, 2026
Merged

fix(moe): relocate orphan tuned configs after #23019#24004
Qiaolin-Yu merged 2 commits into
sgl-project:mainfrom
Wen-xuan-Xu:fix/moe-orphan-configs

Conversation

@Wen-xuan-Xu
Copy link
Copy Markdown
Contributor

@Wen-xuan-Xu Wen-xuan-Xu commented Apr 29, 2026

Motivation

After #23019 moved the MoE config loader and the configs/ tree from fused_moe_triton/ to moe_runner/triton_utils/, two later PRs unknowingly added 33 tuned-config JSONs to the OLD path:

The runtime loader anchors its search via
os.path.dirname(os.path.realpath(file)) of the loader file (now in moe_runner/triton_utils/), so configs in the old directory were never read — runtime fell back to
get_default_config().

The configs themselves were properly tuned and benchmarked at submission time via the in-process override_config() path used by the tuning script — that is why the PR authors observed real speedup. The bug is purely a wrong filesystem location.

Root cause: the tuning README still pointed contributors to the old path. This PR moves the misplaced configs into the runtime-loaded location and fixes the README.

Modifications

Changes:

  • R100 git-mv 33 JSONs into moe_runner/triton_utils/configs/{triton_3_5_1,triton_3_6_0}/
  • Update benchmark/kernels/fused_moe_triton/README.md path

No content changes. No code changes.

References: #23019 #22791 #23533

Checklist

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@github-actions github-actions Bot added the documentation Improvements or additions to documentation label Apr 29, 2026
After sgl-project#23019 moved the MoE config loader and the configs/ tree from
`fused_moe_triton/` to `moe_runner/triton_utils/`, two later PRs
unknowingly added 33 tuned-config JSONs to the OLD path:

- sgl-project#22791 (LFM2)        — 24 files (E=32/64, H100/B200/MI325X)
- sgl-project#23533 (Hy3 preview) —  9 files (E=192,N=192 incl. _down,
                                    H20/H20-3e/B200)

The runtime loader anchors its search via
os.path.dirname(os.path.realpath(__file__)) of the loader file
(now in moe_runner/triton_utils/), so configs in the old
directory were never read — runtime fell back to
get_default_config().

The configs themselves were properly tuned and benchmarked at
submission time via the in-process override_config() path used
by the tuning script — that is why the PR authors observed real
speedup. The bug is purely a wrong filesystem location.

Root cause: the tuning README still pointed contributors to the
old path. This PR moves the misplaced configs into the
runtime-loaded location and fixes the README.

Changes:
  * R100 git-mv 33 JSONs into moe_runner/triton_utils/configs/{triton_3_5_1,triton_3_6_0}/
  * Update benchmark/kernels/fused_moe_triton/README.md path

No content changes. No code changes.

References: sgl-project#23019 sgl-project#22791 sgl-project#23533
@Wen-xuan-Xu Wen-xuan-Xu force-pushed the fix/moe-orphan-configs branch from 62f5997 to 7ec2604 Compare April 29, 2026 05:03
@Wen-xuan-Xu
Copy link
Copy Markdown
Contributor Author

/tag-and-rerun-ci

Copy link
Copy Markdown
Collaborator

@Qiaolin-Yu Qiaolin-Yu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nice catch

@Qiaolin-Yu Qiaolin-Yu merged commit d9270b8 into sgl-project:main Apr 29, 2026
112 of 128 checks passed
vguduruTT pushed a commit to vguduruTT/sglang that referenced this pull request May 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants