[NPU][Doc] Update GLM-5 docs, enabling deepep by default by cen121212 · Pull Request #23708 · sgl-project/sglang

cen121212 · 2026-04-25T07:43:35Z

Motivation

When DeepEP is not enabled, there can be accuracy issues, so DeepEP is enabled by default.

Modifications

docs/platforms/ascend/ascend_npu_glm5_examples.md

Accuracy Tests

Speed Tests and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review and Merge Process

Ping Merge Oncalls to start the process. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

gemini-code-assist

Code Review

This pull request updates the Ascend NPU GLM5 documentation by removing specific environment variables and increasing the maximum batch size for CUDA graphs. A review comment correctly identifies that the 'deepep' backend is incompatible with Ascend NPU hardware and suggests using 'ascend_fuseep' instead, while also recommending the removal of the 'deepep-mode' flag.

gemini-code-assist · 2026-04-25T07:49:08Z

+        --moe-a2a-backend deepep \
+        --deepep-mode auto \


For Ascend NPU, the optimized MoE All-to-All backend is ascend_fuseep. The deepep backend is specifically designed for NVIDIA GPUs using the deep_ep library and will not work on NPU. Additionally, --deepep-mode is not used by the ascend_fuseep backend and should be removed.

Suggested change

--moe-a2a-backend deepep \

--deepep-mode auto \

--moe-a2a-backend ascend_fuseep \

iforgetmyname · 2026-05-08T02:49:38Z

/tag-and-rerun-ci

…#23708)

zijiexia · 2026-06-04T21:52:21Z

Hi @cen121212 , we've moved our documentations under docs_new so your changes here might not be correctly reflected on our documentation page. Can you kindly migrate this change to the corresponding page under docs_new? Thank you so much! sorry for any confusions.

npu: fix glm5 docs

7ce7624

cen121212 requested a review from wisclmy0611 as a code owner April 25, 2026 07:43

github-actions Bot added documentation Improvements or additions to documentation npu labels Apr 25, 2026

gemini-code-assist Bot reviewed Apr 25, 2026

View reviewed changes

iforgetmyname approved these changes May 8, 2026

View reviewed changes

iforgetmyname changed the title ~~【NPU】【docs】 fix glm5 docs~~ [NPU][Doc] Update GLM-5 docs, enabling deepep by default May 8, 2026

github-actions Bot added the run-ci label May 8, 2026

iforgetmyname merged commit 461bc8a into sgl-project:main May 8, 2026
42 checks passed

Dogacel pushed a commit to Dogacel/sglang-fork that referenced this pull request May 8, 2026

[NPU][Doc] Update GLM-5 docs, enabling deepep by default (sgl-project…

31be508

…#23708)

LLThomas pushed a commit to LLThomas/sglang that referenced this pull request May 8, 2026

[NPU][Doc] Update GLM-5 docs, enabling deepep by default (sgl-project…

7270b2b

…#23708)

LucQueen pushed a commit to LucQueen/sglang that referenced this pull request May 12, 2026

[NPU][Doc] Update GLM-5 docs, enabling deepep by default (sgl-project…

8a290a1

…#23708)

zijiexia mentioned this pull request May 25, 2026

fix(ci): enforce legacy docs/ gate in Lint workflow #26322

Merged

5 tasks

zijiexia mentioned this pull request Jun 4, 2026

docs: sync legacy docs/-only updates into docs_new (Mintlify) #27308

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NPU][Doc] Update GLM-5 docs, enabling deepep by default#23708

[NPU][Doc] Update GLM-5 docs, enabling deepep by default#23708
iforgetmyname merged 1 commit into
sgl-project:mainfrom
cen121212:4-25-sgl-project-main

cen121212 commented Apr 25, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Uh oh!

iforgetmyname commented May 8, 2026

Uh oh!

Uh oh!

zijiexia commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	--moe-a2a-backend deepep \
	--deepep-mode auto \
	--moe-a2a-backend ascend_fuseep \

Conversation

cen121212 commented Apr 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Review and Merge Process

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

iforgetmyname commented May 8, 2026

Uh oh!

Uh oh!

zijiexia commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cen121212 commented Apr 25, 2026 •

edited

Loading