[R3] Add routed experts to openai entrypoint by hao-aaron · Pull Request #38939 · vllm-project/vllm

hao-aaron · 2026-04-03T19:25:55Z

Purpose

Adds routed experts introduced in #28284 to openai entrypoint

Test Plan

new unit tests

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: ahao-anyscale <ahao@anyscale.com>

gemini-code-assist

Code Review

This pull request introduces the functionality to return routed expert indices in OpenAI-compatible chat and completion responses. It adds a routed_experts field to the response protocols and updates the serving logic to populate this field from the model output when the --enable-return-routed-experts flag is enabled. Additionally, a new test suite is included to verify the correct shape and values of the returned expert data. I have no feedback to provide.

SumanthRH

What is really needed here is support in the tokens-in-tokens-out /inference/v1/generate endpoint.

Can you replicate the modifications here:

vllm/vllm/entrypoints/serve/disagg/serving.py

Line 46 in 4b506ff

class ServingTokens(OpenAIServing):

vllm/vllm/entrypoints/serve/disagg/protocol.py

Line 117 in 81994e1

class GenerateResponseChoice(BaseModel):

and add a test?

Signed-off-by: ahao-anyscale <ahao@anyscale.com>

SumanthRH

LGTM

x

324356f

Signed-off-by: ahao-anyscale <ahao@anyscale.com>

mergify Bot added the frontend label Apr 3, 2026

hao-aaron marked this pull request as ready for review April 3, 2026 19:26

hao-aaron requested review from DarkLight1337, NickLucche, aarnphm, chaunceyjiang, robertgshaw2-redhat and russellb as code owners April 3, 2026 19:26

gemini-code-assist Bot reviewed Apr 3, 2026

View reviewed changes

SumanthRH suggested changes Apr 3, 2026

View reviewed changes

x

7e0c3a2

Signed-off-by: ahao-anyscale <ahao@anyscale.com>

hao-aaron requested a review from njhill as a code owner April 7, 2026 17:43

SumanthRH approved these changes Apr 8, 2026

View reviewed changes

Merge branch 'main' into r3-entrypoint

ed15a9a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[R3] Add routed experts to openai entrypoint #38939

[R3] Add routed experts to openai entrypoint #38939
hao-aaron wants to merge 3 commits intovllm-project:mainfrom
hao-aaron:r3-entrypoint

hao-aaron commented Apr 3, 2026 •

edited by github-actions Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

SumanthRH left a comment

Uh oh!

SumanthRH left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

hao-aaron commented Apr 3, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

SumanthRH left a comment

Choose a reason for hiding this comment

Uh oh!

SumanthRH left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hao-aaron commented Apr 3, 2026 •

edited by github-actions Bot

Loading