[Frontend][3/n] Make pooling entrypoints request schema consensus | EmbedRequest & ClassifyRequest#32905
Conversation
|
Documentation preview: https://vllm--32905.org.readthedocs.build/en/32905/ |
There was a problem hiding this comment.
Code Review
This pull request effectively refactors the pooling entrypoint request schemas by introducing EncodingRequestMixin, EmbedRequestMixin, and ClassifyRequestMixin. This change centralizes common parameters, reduces code duplication, and improves maintainability across EmbeddingRequest, ClassificationRequest, and PoolingRequest protocols. The refactoring is well-executed and consistently applied. My review includes a couple of minor fixes for documentation snippet tags to ensure the documentation builds correctly.
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: wang.yuqi <noooop@126.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: wang.yuqi <noooop@126.com>
|
Ready for review |
| ```python | ||
| --8<-- "vllm/entrypoints/openai/protocol.py:completion-sampling-params" | ||
| --8<-- "vllm/entrypoints/openai/completion/protocol.py:completion-sampling-params" | ||
| ``` |
There was a problem hiding this comment.
Fixed some API documentation; there might still be some that need to be fixed.
…mbedRequest & ClassifyRequest (vllm-project#32905) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: 陈建华 <1647430658@qq.com>
…mbedRequest & ClassifyRequest (vllm-project#32905) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
…mbedRequest & ClassifyRequest (vllm-project#32905) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Purpose
Split out the following RequestMixin
address #31784 (comment)
Test Plan
tests/entrypoints/pooling/
Test Result
pass
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.