[Frontend] Delegate preprocessing to OpenAIServingRender#36483
[Frontend] Delegate preprocessing to OpenAIServingRender#36483vllm-bot merged 20 commits intovllm-project:mainfrom
OpenAIServingRender#36483Conversation
Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
There was a problem hiding this comment.
Code Review
This pull request refactors the preprocessing logic for chat and completion requests by centralizing it into OpenAIServingRender. This is a good architectural improvement that reduces code duplication.
However, I've identified a critical issue. The warmup() method in vllm/entrypoints/openai/chat_completion/serving.py was not updated to reflect this refactoring. It still calls self._preprocess_chat(), which has been moved to OpenAIServingRender. This will cause an AttributeError and prevent the server from starting.
Since this code is not in the diff, I cannot add a specific comment. The fix is to update the call in warmup() to delegate to self.openai_serving_render._preprocess_chat(...). This is a critical fix.
|
Hi @sagearc, the pre-commit checks have failed. Please run: uv pip install pre-commit
pre-commit install
pre-commit run --all-filesThen, commit the changes and push to your branch. For future commits, Tip Is
|
Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
|
Hi @sagearc, the pre-commit checks have failed. Please run: uv pip install pre-commit
pre-commit install
pre-commit run --all-filesThen, commit the changes and push to your branch. For future commits, Tip Is
|
Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
|
This pull request has merge conflicts that must be resolved before it can be |
Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
|
@DarkLight1337 Updated |
Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
|
Hi @sagearc, the pre-commit checks have failed. Please run: uv pip install pre-commit
pre-commit install
pre-commit run --all-filesThen, commit the changes and push to your branch. For future commits, Tip Is
|
DarkLight1337
left a comment
There was a problem hiding this comment.
Thanks, let's see if tests pass now
|
Hi @sagearc, the pre-commit checks have failed. Please run: uv pip install pre-commit
pre-commit install
pre-commit run --all-filesThen, commit the changes and push to your branch. For future commits, Tip Is
|
|
Hi @sagearc, the pre-commit checks have failed. Please run: uv pip install pre-commit
pre-commit install
pre-commit run --all-filesThen, commit the changes and push to your branch. For future commits, Tip Is
|
…ect#36483) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: whycoming <120623296@qq.com>
…ect#36483) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
…ect#36483) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
…ect#36483) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
with vllm top commit 1b6cb920e6ebcac57154e6154578c39d4892a16c has some diffs with vllm-omni, vllm-project/vllm#32104 vllm-project/vllm#32951 vllm-project/vllm#37287 vllm-project/vllm#36483 just modify vllm-omni to work Signed-off-by: Michael Qiu <qiudayu.qdy@antgroup.com>
…ect#36483) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com>
…ect#36483) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>
…ect#36483) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: Vinay Damodaran <vrdn@hey.com>
…ect#36483) Signed-off-by: Sage Ahrac <sagiahrak@gmail.com> Signed-off-by: EricccYang <yangyang4991@gmail.com>
Purpose
OpenAIServingRender(#36166) owns the authoritative preprocessing logic for chat and completion requests, butOpenAIServingChatandOpenAIServingCompletionwere duplicating it rather than delegating.Changes:
render_chat_request/render_completion_requestinto a public entry point (with model check) and a private helper (render_chat/render_completion) with no model/engine checks.OpenAIServingChatandOpenAIServingCompletionnow acceptopenai_serving_renderand delegate their preprocessing to those helpers after doing their own engine-aware checks.OpenAIServingRenderis created first ininit_generate_stateso it can be passed to all serving classes.Test Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.