[Frontend] Fix default_chat_template_kwargs handling in Responses API by sidsaha-ai · Pull Request #37739 · vllm-project/vllm

sidsaha-ai · 2026-03-21T07:39:59Z

Summary

--default-chat-template-kwargs was already available in the shared render stack, but the /v1/responses serving path still dropped those defaults when building prompts and when instantiating the reasoning parser used to post-process non-streaming responses.

This meant Responses API requests could still behave as if Qwen3 thinking was enabled even when the server was started with --default-chat-template-kwargs '{"enable_thinking": false}', which in turn could leave output_text empty and move all generated text into reasoning output.

Changes

pass default_chat_template_kwargs into OpenAIServingResponses
add chat_template_kwargs to ResponsesRequest
merge server defaults with per-request chat_template_kwargs for responses prompt rendering
pass the merged template kwargs through all responses-side reasoning parser paths, including the unified non-streaming parser wrapper
document chat_template_kwargs support for /v1/responses
add unit coverage plus an end-to-end Responses API regression test for server defaults and per-request override

Testing

PATH="/Users/siddharthsaha/python_envs/vllm-pr/bin:$PATH" /Users/siddharthsaha/python_envs/vllm-pr/bin/python -m pytest tests/entrypoints/openai/responses/test_protocol.py -q
PATH="/Users/siddharthsaha/python_envs/vllm-pr/bin:$PATH" /Users/siddharthsaha/python_envs/vllm-pr/bin/python -m pytest tests/entrypoints/openai/responses/test_serving_responses.py -q
PATH="/Users/siddharthsaha/python_envs/vllm-pr/bin:$PATH" /Users/siddharthsaha/python_envs/vllm-pr/bin/python -m pytest tests/entrypoints/openai/responses/test_chat_template_kwargs.py -q
PATH="/Users/siddharthsaha/python_envs/vllm-pr/bin:$PATH" /Users/siddharthsaha/python_envs/vllm-pr/bin/pre-commit run --files docs/features/reasoning_outputs.md tests/entrypoints/openai/responses/test_protocol.py tests/entrypoints/openai/responses/test_serving_responses.py vllm/entrypoints/openai/generate/api_router.py vllm/entrypoints/openai/parser/responses_parser.py vllm/entrypoints/openai/responses/context.py vllm/entrypoints/openai/responses/protocol.py vllm/entrypoints/openai/responses/serving.py vllm/parser/abstract_parser.py

Code Review

This pull request correctly addresses the issue of default_chat_template_kwargs not being handled in the /v1/responses API. The changes effectively propagate both server-level defaults and per-request chat_template_kwargs to the prompt rendering and reasoning parser logic. The added unit and end-to-end tests provide good coverage for the new functionality. I found one minor issue in the documentation that needs to be addressed.

gemini-code-assist · 2026-03-21T07:41:24Z

docs/features/reasoning_outputs.md

+
 ## Limitations

 - The reasoning content is only available for online serving's chat completion endpoint (`/v1/chat/completions`).


This documentation appears to be outdated with the changes in this PR. While this PR adds support for reasoning outputs in the /v1/responses endpoint, this line still states that reasoning content is only available for /v1/chat/completions. This should be updated to include /v1/responses to reflect the new capability.

Fixed in fac98b1 by updating the limitations section to include /v1/responses alongside /v1/chat/completions.

Signed-off-by: Sid Saha <siddharthsaha@Siddharths-MacBook-Pro.local>

chaunceyjiang · 2026-03-23T03:58:03Z

vllm/entrypoints/openai/responses/protocol.py

            "and vLLM will ignore it."
        ),
    )
+    chat_template_kwargs: dict[str, Any] | None = Field(


Thanks~ @sidsaha-ai

This is a known issue. The reason we haven’t implemented it so far is that we wanted to wait and see whether OpenAI would introduce a similar field.

Otherwise, introducing these fields would cause the Responses API to overlap with chat completions.

Cool. Should we then wait and close this PR? Or should I go ahead with rebasing and can get approval.

mergify · 2026-03-24T06:06:06Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @sidsaha-ai.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

[Frontend] Fix default_chat_template_kwargs handling in Responses API

1be9f4f

Signed-off-by: Sid Saha <siddharthsaha@Siddharths-MacBook-Pro.local>

sidsaha-ai requested review from DarkLight1337, NickLucche, aarnphm, chaunceyjiang, robertgshaw2-redhat and russellb as code owners March 21, 2026 07:39

mergify bot added documentation Improvements or additions to documentation frontend labels Mar 21, 2026

gemini-code-assist bot reviewed Mar 21, 2026

View reviewed changes

[Docs] Update reasoning outputs limitations for Responses API

fac98b1

Signed-off-by: Sid Saha <siddharthsaha@Siddharths-MacBook-Pro.local>

chaunceyjiang reviewed Mar 23, 2026

View reviewed changes

mergify bot added the needs-rebase label Mar 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Frontend] Fix default_chat_template_kwargs handling in Responses API#37739

[Frontend] Fix default_chat_template_kwargs handling in Responses API#37739
sidsaha-ai wants to merge 2 commits intovllm-project:mainfrom
sidsaha-ai:fix/responses-default-chat-template-kwargs

sidsaha-ai commented Mar 21, 2026

Uh oh!

github-actions bot commented Mar 21, 2026

Uh oh!

mergify bot commented Mar 21, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Mar 21, 2026

Uh oh!

sidsaha-ai Mar 21, 2026

Uh oh!

chaunceyjiang Mar 23, 2026

Uh oh!

sidsaha-ai Mar 25, 2026

Uh oh!

mergify bot commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		## Limitations

		- The reasoning content is only available for online serving's chat completion endpoint (`/v1/chat/completions`).

Uh oh!

Conversation

sidsaha-ai commented Mar 21, 2026

Summary

Changes

Testing

Related

Uh oh!

github-actions bot commented Mar 21, 2026

Uh oh!

mergify bot commented Mar 21, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

sidsaha-ai Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

chaunceyjiang Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

sidsaha-ai Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants