fix pd-mtp metadata buffer hidden size by hnyls2002 · Pull Request #23918 · sgl-project/sglang

hnyls2002 · 2026-04-28T07:08:31Z

Use spec_hidden_size for the PD MetadataBuffers so the spec module's hidden state ferries correctly across prefill/decode. Adds matching EAGLE config to the PD-disagg NIXL test prefill side.

gemini-code-assist

Code Review

This pull request updates the disaggregation scheduler to use spec_hidden_size for EAGLE-based speculative algorithms and refactors the DSv4 Flash PD-disaggregation test. The test configuration now explicitly defines speculative decoding parameters for both prefill and decode stages and increases the GSM8K accuracy threshold to 0.95. Review feedback identifies an inconsistency in the scheduler where is_standalone() was omitted from a buffer sizing condition and suggests consolidating duplicated test arguments to improve maintainability.

gemini-code-assist · 2026-04-28T07:10:07Z

+                    model_config.spec_hidden_size
                    if self.spec_algorithm.is_eagle()


The condition for using spec_hidden_size on the decode side is inconsistent with the prefill side (lines 952-954). It should also include is_standalone() to ensure that metadata buffers are sized correctly when using standalone speculative algorithms (such as MTP). Without this, a buffer size mismatch will occur during state transfer if is_standalone() is true. Note that hidden_states_dtype (lines 902-906) likely requires a similar update for consistency, although it is not part of this diff hunk.

Suggested change

model_config.spec_hidden_size

if self.spec_algorithm.is_eagle()

model_config.spec_hidden_size

if self.spec_algorithm.is_eagle() or self.spec_algorithm.is_standalone()

gemini-code-assist · 2026-04-28T07:10:08Z

        prefill_args = [
-            *COMMON_ARGS,
+            "--trust-remote-code",
            "--disaggregation-mode",
            "prefill",
            "--base-gpu-id",
            "0",
            "--tp",
            "4",
+            "--cuda-graph-max-bs",
+            "128",
+            "--max-running-requests",
+            "256",
+            "--mem-fraction-static",
+            "0.7",
            "--disaggregation-decode-tp",
            "4",
            "--disaggregation-decode-dp",
            "4",
+            "--speculative-algorithm",
+            "EAGLE",
+            "--speculative-num-steps",
+            "3",
+            "--speculative-eagle-topk",
+            "1",
+            "--speculative-num-draft-tokens",
+            "4",


The speculative and runtime arguments are now duplicated between start_prefill and start_decode. To improve maintainability and prevent configuration drift, consider defining a shared list for common arguments (e.g., --cuda-graph-max-bs, --max-running-requests, and the --speculative-* flags) and unpacking it into both prefill_args and decode_args.

fix pd-mtp metadata buffer hidden size

11e9546

hnyls2002 requested review from Ying1123, merrymercy and xiezhq-hermann as code owners April 28, 2026 07:08

gemini-code-assist Bot reviewed Apr 28, 2026

View reviewed changes

hnyls2002 merged commit 7e2faae into dsv4-rebase Apr 28, 2026
6 checks passed

hnyls2002 deleted the lsyin/fix-pd-mtp-spec-hidden-size branch April 28, 2026 18:22

This was referenced Apr 28, 2026

support asymmetric pd-mtp via mock spec hidden #23958

Merged

Deepseek V4 #23882

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix pd-mtp metadata buffer hidden size#23918

fix pd-mtp metadata buffer hidden size#23918
hnyls2002 merged 1 commit intodsv4-rebasefrom
lsyin/fix-pd-mtp-spec-hidden-size

hnyls2002 commented Apr 28, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 28, 2026

Uh oh!

gemini-code-assist Bot Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		model_config.spec_hidden_size
		if self.spec_algorithm.is_eagle()

Conversation

hnyls2002 commented Apr 28, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant