[Frontend] Complete OpenAI render delegation by sagearc · Pull Request #37287 · vllm-project/vllm

sagearc · 2026-03-17T10:27:49Z

Completes the disaggregated frontend work from #36166 by delegating all remaining OpenAIServing preprocessing methods to OpenAIServingRender — the canonical GPU-less render layer — and removing the now-duplicate copies from the base class.

Changes

ServingTokens — routes preprocess_completion through openai_serving_render
OpenAIServingPooling — routes preprocess_completion, preprocess_cmpl, preprocess_chat, and validate_chat_template through openai_serving_render
OpenAIServingResponses — pulls down _render_next_turn and _generate_with_builtin_tools from the base class (exclusively used here)
OpenAIServing — 5 methods and their associated imports removed

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

…ngRender Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

…ingResponses Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

gemini-code-assist

Code Review

This pull request completes the delegation of rendering logic to OpenAIServingRender by moving several preprocessing methods out of the base OpenAIServing class. The changes are well-structured and consistently applied across multiple components, including ServingTokens, OpenAIServingPooling, and OpenAIServingResponses. The refactoring centralizes the rendering logic, improving code organization and separation of concerns. The moved methods and their call sites have been updated correctly. Overall, this is a solid refactoring effort with no apparent issues.

noooop · 2026-03-17T12:18:31Z

vllm/entrypoints/pooling/pooling/serving.py

@@ -47,6 +48,7 @@ def __init__(
        self,
        engine_client: EngineClient,


Can the "OpenAI" and "openai_" prefixes of OpenAIServingRender be removed? I think this has nothing to do with OpenAI.

Yeah let's do that in the next PR

We can discuss scope, but it does have something to do with OpenAI — the main entrypoints accept CompletionRequest and ChatCompletionRequest, which are OpenAI API types.

In my understanding, the renderer is the preprocessing layer of vLLM and is vendor-independent, as it will be used by the entrypoints of all vendors.

From my understanding that might make more sense for the renderer itself? class LLM seems to use it directly via its own API, without going through much of the logic in OpenAIServingRender.

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

with vllm top commit 1b6cb920e6ebcac57154e6154578c39d4892a16c has some diffs with vllm-omni, vllm-project/vllm#32104 vllm-project/vllm#32951 vllm-project/vllm#37287 vllm-project/vllm#36483 just modify vllm-omni to work Signed-off-by: Michael Qiu <qiudayu.qdy@antgroup.com>

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: Vinay Damodaran <vrdn@hey.com>

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: EricccYang <yangyang4991@gmail.com>

sagearc added 8 commits March 17, 2026 09:46

[Frontend] Delegate tokenization serving preprocessing to OpenAIServi…

9534108

…ngRender Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

cr fix

f538d00

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

responses delegation

b7e985e

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

Expose preprocess_cmpl as public method in OpenAIServingRender

908f7ec

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

Delegate ServingTokens preprocessing to OpenAIServingRender

4a156c3

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

Delegate OpenAIServingPooling preprocessing to OpenAIServingRender

c711519

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

Move _render_next_turn and _generate_with_builtin_tools to OpenAIServ…

7071929

…ingResponses Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

Remove preprocessing methods from OpenAIServing

1f5b68c

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

sagearc requested review from DarkLight1337, NickLucche, aarnphm, chaunceyjiang, njhill, noooop, robertgshaw2-redhat and russellb as code owners March 17, 2026 10:27

mergify bot added the frontend label Mar 17, 2026

DarkLight1337 approved these changes Mar 17, 2026

View reviewed changes

DarkLight1337 enabled auto-merge (squash) March 17, 2026 10:31

gemini-code-assist bot reviewed Mar 17, 2026

View reviewed changes

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 17, 2026

noooop reviewed Mar 17, 2026

View reviewed changes

DarkLight1337 merged commit 59192df into vllm-project:main Mar 17, 2026
50 checks passed

sagearc deleted the complete-openai-render-delegation branch March 17, 2026 14:25

Lucaskabela pushed a commit to Lucaskabela/vllm that referenced this pull request Mar 17, 2026

[Frontend] Complete OpenAI render delegation (vllm-project#37287)

2f053f4

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

andylolu2 pushed a commit to andylolu2/vllm that referenced this pull request Mar 18, 2026

[Frontend] Complete OpenAI render delegation (vllm-project#37287)

60cb9d6

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

wendyliu235 pushed a commit to wendyliu235/vllm-public that referenced this pull request Mar 18, 2026

[Frontend] Complete OpenAI render delegation (vllm-project#37287)

7e19a43

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

fxdawnn pushed a commit to fxdawnn/vllm that referenced this pull request Mar 19, 2026

[Frontend] Complete OpenAI render delegation (vllm-project#37287)

f8b6ca7

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

QiuMike mentioned this pull request Mar 25, 2026

Run omni with latest vllm commit 1b6cb920e6ebcac57154e6154578c39d4892a16c vllm-project/vllm-omni#2182

Open

5 tasks

khairulkabir1661 pushed a commit to khairulkabir1661/vllm that referenced this pull request Mar 27, 2026

[Frontend] Complete OpenAI render delegation (vllm-project#37287)

7aae4ab

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

Monishver11 pushed a commit to Monishver11/vllm that referenced this pull request Mar 27, 2026

[Frontend] Complete OpenAI render delegation (vllm-project#37287)

a1fdc76

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>

JiantaoXu pushed a commit to JiantaoXu/vllm that referenced this pull request Mar 28, 2026

[Frontend] Complete OpenAI render delegation (vllm-project#37287)

a1f097d

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>

vrdn-23 pushed a commit to vrdn-23/vllm that referenced this pull request Mar 30, 2026

[Frontend] Complete OpenAI render delegation (vllm-project#37287)

546adf4

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: Vinay Damodaran <vrdn@hey.com>

EricccYang pushed a commit to EricccYang/vllm that referenced this pull request Apr 1, 2026

[Frontend] Complete OpenAI render delegation (vllm-project#37287)

6dc5cdd

Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: EricccYang <yangyang4991@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Frontend] Complete OpenAI render delegation#37287

[Frontend] Complete OpenAI render delegation#37287
DarkLight1337 merged 8 commits intovllm-project:mainfrom
sagearc:complete-openai-render-delegation

sagearc commented Mar 17, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

noooop Mar 17, 2026

Uh oh!

DarkLight1337 Mar 17, 2026

Uh oh!

sagearc Mar 17, 2026

Uh oh!

noooop Mar 17, 2026

Uh oh!

sagearc Mar 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -47,6 +48,7 @@ def __init__(
		self,
		engine_client: EngineClient,

Uh oh!

Conversation

sagearc commented Mar 17, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

noooop Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

sagearc Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

noooop Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

sagearc Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sagearc commented Mar 17, 2026 •

edited by github-actions bot

Loading