Support PP for zmq_to_scheduler by gty111 · Pull Request #15312 · sgl-project/sglang

gty111 · 2025-12-17T07:36:32Z

Motivation

Follow up PRs for #12263 to support PP for zmq_to_scheduler

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.
Work with maintainers to merge your PR. See the PR Merge Process

gemini-code-assist · 2025-12-17T07:36:36Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

Copilot

Pull request overview

This PR enables Pipeline Parallelism (PP) support for the zmq_to_scheduler encoder transfer backend in encoder-decoder disaggregated inference scenarios. The key insight is that MM (multimodal) processing only needs to occur at the first PP stage (pp_rank 0), with subsequent stages receiving pre-processed requests from upstream stages.

Key changes:

Restricted MM receiver initialization and processing to pp_rank 0 only
Changed synchronization scope from world_size (all ranks) to tp_size (TP ranks within a PP stage)
Removed the embedding_ports mechanism in favor of direct encoder-to-scheduler communication per TP group

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
python/sglang/srt/server_args.py	Removed validation that prevented PP when using zmq_to_scheduler
python/sglang/srt/models/qwen2_5_vl.py	Simplified weight loading to skip missing weights consistently across modes
python/sglang/srt/managers/io_struct.py	Removed embedding_ports field from request input structures
python/sglang/srt/managers/tokenizer_manager.py	Removed embedding_ports parameter from tokenized object creation
python/sglang/srt/managers/scheduler.py	Added pp_rank checks for MM receiver initialization and processing; added tp_group parameter
python/sglang/srt/disaggregation/encode_receiver.py	Changed synchronization to TP-only scope, improved device placement, removed embedding_port logic

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

python/sglang/srt/models/qwen2_5_vl.py

python/sglang/srt/disaggregation/encode_receiver.py

ShangmingCai · 2025-12-18T03:31:25Z

/tag-and-rerun-ci

ShangmingCai

LGTM, let's wait for the CI.

ShangmingCai · 2025-12-22T03:53:39Z

/rerun-failed-ci

ShangmingCai

LGTM. The modification of the scheduler part is safe since it is all EPD-related code.

Full green CI: https://github.com/sgl-project/sglang/actions/runs/20449993152/job/58772672138?pr=15312

Support PP for zmq_to_scheduler

d71037b

Copilot AI review requested due to automatic review settings December 17, 2025 07:36

gty111 requested review from ByronHsu, ShangmingCai, Ying1123, hnyls2002, merrymercy and xiezhq-hermann as code owners December 17, 2025 07:36

Copilot started reviewing on behalf of gty111 December 17, 2025 07:37 View session

Copilot AI reviewed Dec 17, 2025

View reviewed changes

python/sglang/srt/models/qwen2_5_vl.py Show resolved Hide resolved

ShangmingCai reviewed Dec 17, 2025

View reviewed changes

python/sglang/srt/disaggregation/encode_receiver.py Outdated Show resolved Hide resolved

gty111 added 2 commits December 17, 2025 08:26

Add type hints for mm_receiver

9b87ca6

Merge branch 'main' into fix_pp_scheduler

f2c8085

github-actions bot added the run-ci label Dec 18, 2025

ShangmingCai approved these changes Dec 18, 2025

View reviewed changes

gty111 added 2 commits December 18, 2025 10:45

Merge branch 'main' into fix_pp_scheduler

6a00ac3

Merge branch 'main' into fix_pp_scheduler

6600893

Fix merge

7f3d500

gty111 mentioned this pull request Dec 23, 2025

[Roadmap] Encoder Disaggregation for Multi-modal models #15118

Open

10 tasks

ShangmingCai approved these changes Dec 23, 2025

View reviewed changes

ShangmingCai merged commit fa29669 into sgl-project:main Dec 23, 2025
150 of 155 checks passed

jiaming1130 pushed a commit to zhuyijie88/sglang that referenced this pull request Dec 25, 2025

Support PP for zmq_to_scheduler (sgl-project#15312)

1060786

gty111 mentioned this pull request Dec 28, 2025

Encoder-Prefill-Decode (EPD) Disaggregation lm-sys/lm-sys.github.io#274

Merged

YChange01 pushed a commit to YChange01/sglang that referenced this pull request Jan 13, 2026

Support PP for zmq_to_scheduler (sgl-project#15312)

b801f51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support PP for zmq_to_scheduler#15312

Support PP for zmq_to_scheduler#15312
ShangmingCai merged 6 commits intosgl-project:mainfrom
gty111:fix_pp_scheduler

gty111 commented Dec 17, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Dec 17, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

ShangmingCai commented Dec 18, 2025

Uh oh!

ShangmingCai left a comment

Uh oh!

ShangmingCai commented Dec 22, 2025

Uh oh!

ShangmingCai left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gty111 commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Uh oh!

gemini-code-assist bot commented Dec 17, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

ShangmingCai commented Dec 18, 2025

Uh oh!

ShangmingCai left a comment

Choose a reason for hiding this comment

Uh oh!

ShangmingCai commented Dec 22, 2025

Uh oh!

ShangmingCai left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gty111 commented Dec 17, 2025 •

edited

Loading