[PD-Disagg] Fully support external DP dispatch w/ PD-disaggregation mode. by hnyls2002 · Pull Request #19268 · sgl-project/sglang

hnyls2002 · 2026-02-24T21:10:47Z

Summary

API: rename data_parallel_rank → routed_dp_rank, add disagg_prefill_dp_rank

Rename data_parallel_rank → routed_dp_rank across the request pipeline to clarify it is a routing directive from external routers, not an infrastructure property
Add disagg_prefill_dp_rank field for decode servers — external router can specify which prefill DP worker holds the KV cache, skipping bootstrap server queries
Keep data_parallel_rank as a deprecated alias in all public API surfaces with DeprecationWarning

Decode-side fix

Rename _resolve_dp_rank → _resolve_prefill_dp_rank and remove incorrect data_parallel_rank check — the old code conflated decode-side DP routing rank with the prefill DP rank needed for KV transfer (never triggered because the field was always None)
_resolve_prefill_dp_rank now checks disagg_prefill_dp_rank first, then falls back to existing bootstrap server resolution ([PD-Disagg] Support query dp rank from bootstrap server. #19168)

Motivation: split an overloaded field into two

On main, data_parallel_rank is consumed by two places with different semantics:

DataParallelController.maybe_external_dp_rank_routing — treats it as "which DP worker should handle this request" (routing)
DecodePreallocQueue._resolve_dp_rank — treats it as "which prefill DP worker has the KV cache" (KV transfer)

Meanwhile, prefill_dp_rank only existed as an internal variable name inside the KV transfer layer (_create_receiver_and_enqueue), never as a request-level field.

This PR splits the single overloaded field into two with clear semantics:

routed_dp_rank — consumed only by DataParallelController for DP worker routing
disagg_prefill_dp_rank — consumed only by _resolve_prefill_dp_rank for KV transfer, now exposed as a public API field so external routers can specify it directly

Propagation

Thread routed_dp_rank + disagg_prefill_dp_rank through TokenizedGenerateReqInput, Req, tokenizer_manager, scheduler, encode_receiver
DataParallelController.maybe_external_dp_rank_routing uses req.routed_dp_rank

Backward compatibility

data_parallel_rank is preserved as a deprecated alias at every public API layer. Callers using the old field name (including sgl-model-gateway Rust/gRPC) continue to work without changes.

API surface	File	Compat mechanism
`CompletionRequest`	`protocol.py`	`model_validator(mode="before")` merges into `routed_dp_rank` + warns
`ChatCompletionRequest`	`protocol.py`	same
`GenerateReqInput` (`/generate`)	`io_struct.py`	`normalize_batch_and_arguments()` merges + warns
`Engine.generate()`	`engine.py`	function param kept, merged before use + warns
`Engine.async_generate()`	`engine.py`	same
`EngineBase.generate()`	`EngineBase.py`	abstract signature includes both old and new params

Internal structs (TokenizedGenerateReqInput, TokenizedEmbeddingReqInput, Req) are renamed directly — no alias needed since they are not public API.

Testing

Propagate scheduler's dp_rank into response meta_info so external routers can verify routing correctness
Add --test-external-dp-routing to mini-lb: randomly assigns routed_dp_rank / disagg_prefill_dp_rank, asserts decode response dp_rank matches (prefill correctness verified implicitly via KV transfer)
Add TestDisaggregationDPAttentionExternalRouting test class (currently skipped pending docker image update)

gemini-code-assist · 2026-02-24T21:10:50Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

hnyls2002 · 2026-02-24T22:06:48Z

/rerun-stage stage-c-test-8-gpu-h20

github-actions · 2026-02-24T22:07:08Z

✅ Triggered stage-c-test-8-gpu-h20 to run independently (skipping dependencies).

github-actions · 2026-02-24T22:07:14Z

🔗 View workflow run

hnyls2002 · 2026-02-24T23:21:35Z

/tag-and-rerun-ci

Co-authored-by: Ratish P <114130421+ratish1@users.noreply.github.com>

hnyls2002 · 2026-02-25T03:52:41Z

All related CIs (except for B200s) passed: https://github.com/sgl-project/sglang/actions/runs/22377079442/job/64781128983

doujiang24 · 2026-03-03T07:48:23Z

@hnyls2002 How about adding an optional HTTP header, i.e. X-data-parallel-rank, which has higher priority to specify the dp-rank than in the request-body payload?
It could be more friendly for external router? Thanks.

hnyls2002 · 2026-03-03T09:22:48Z

@doujiang24 Please submit a PR, thanks.

…ode. (sgl-project#19268) Co-authored-by: Ratish P <114130421+ratish1@users.noreply.github.com>

hnyls2002 added 2 commits February 24, 2026 13:03

rename dp_rank -> prefill_dp_rank

43f3141

update

e574e41

hnyls2002 requested review from ByronHsu, CatherineSue, JustinTong0323, ShangmingCai, Ying1123, ispobock, merrymercy, slin1237 and xiezhq-hermann as code owners February 24, 2026 21:10

fix EPD case

9660502

add test

b873e82

hnyls2002 requested a review from key4ng as a code owner February 24, 2026 23:21

github-actions bot added the model-gateway label Feb 24, 2026

github-actions bot added the run-ci label Feb 24, 2026

hnyls2002 added 5 commits February 24, 2026 15:37

update

2c109d9

simplify the code

077f9b7

do not verify with streaming mode

c926963

warning explictly

8067647

reduce duplication

a5ae816

hnyls2002 added the high priority label Feb 25, 2026

hnyls2002 and others added 2 commits February 24, 2026 16:55

warnings.warn

a4ddce2

Co-authored-by: Ratish P <114130421+ratish1@users.noreply.github.com>

Merge branch 'main' into lsyin/external-dp-rank

9c4d6ad

hnyls2002 force-pushed the lsyin/external-dp-rank branch from 6f2567e to 9c4d6ad Compare February 25, 2026 03:51

hnyls2002 merged commit 539f772 into main Feb 25, 2026
35 of 76 checks passed

hnyls2002 deleted the lsyin/external-dp-rank branch February 25, 2026 03:58

Duyi-Wang mentioned this pull request Feb 27, 2026

[PD-Disagg][Fix] Remove 'test_external_dp_routing' from Rust Router constructor parameters. #19492

Merged

5 tasks

doujiang24 mentioned this pull request Mar 4, 2026

feature: support X-Data-Parallel-Rank header to specific dp-rank. #19832

Merged

5 tasks

This was referenced Mar 12, 2026

Gateway supports dp rank scheduling and scheduling with the minimun number of tokens #16742

Closed

Gateway supports dp rank scheduling and scheduling with the minimun number of tokens #20435

Open

Wangzheee pushed a commit to Wangzheee/sglang that referenced this pull request Mar 21, 2026

[PD-Disagg] Fully support external DP dispatch w/ PD-disaggregation m…

7f694c7

…ode. (sgl-project#19268) Co-authored-by: Ratish P <114130421+ratish1@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PD-Disagg] Fully support external DP dispatch w/ PD-disaggregation mode.#19268

[PD-Disagg] Fully support external DP dispatch w/ PD-disaggregation mode.#19268
hnyls2002 merged 11 commits intomainfrom
lsyin/external-dp-rank

hnyls2002 commented Feb 24, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Feb 24, 2026

Uh oh!

hnyls2002 commented Feb 24, 2026

Uh oh!

github-actions bot commented Feb 24, 2026

Uh oh!

github-actions bot commented Feb 24, 2026

Uh oh!

hnyls2002 commented Feb 24, 2026

Uh oh!

hnyls2002 commented Feb 25, 2026

Uh oh!

Uh oh!

doujiang24 commented Mar 3, 2026 •

edited

Loading

Uh oh!

hnyls2002 commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hnyls2002 commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Backward compatibility

Testing

Uh oh!

gemini-code-assist bot commented Feb 24, 2026

Uh oh!

hnyls2002 commented Feb 24, 2026

Uh oh!

github-actions bot commented Feb 24, 2026

Uh oh!

github-actions bot commented Feb 24, 2026

Uh oh!

hnyls2002 commented Feb 24, 2026

Uh oh!

hnyls2002 commented Feb 25, 2026

Uh oh!

Uh oh!

doujiang24 commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hnyls2002 commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hnyls2002 commented Feb 24, 2026 •

edited

Loading

doujiang24 commented Mar 3, 2026 •

edited

Loading