[Frontend] Enable generic structured_outputs for responses API by alecsolder · Pull Request #33709 · vllm-project/vllm

alecsolder · 2026-02-03T16:35:54Z

Purpose

The current ResponsesAPI implementation only supports setting an output text format using json_schema, however for more complicated use cases like grammars, regexes, choices, etc, you need to be able to pass in the full structured_outputs object

Test Plan

vllm serve openai/gpt-oss-20b --enforce-eager --max-model-len=65536 \
--tool-call-parser=openai --enable-auto-tool-choice --reasoning-parser=openai_gptoss

curl -X POST http://localhost:8000/v1/responses \
    -H "Content-Type: application/json" \
    -d '{
      "model": "openai/gpt-oss-20b",
      "input": "Pick a color",
      "structured_outputs": {
        "choice": ["red", "green", "blue"]
      }
    }'

Test Result

Final output message:

{"id":"msg_946aeff87d4ed2e2","content":[{"annotations":[],"text":"green","type":"output_text","logprobs":null}],"

Full response, showing it still respects only enabling it after reasoning

{"id":"resp_be3abebb226eafdf","created_at":1770136083,"incomplete_details":null,"instructions":null,"metadata":null,"model":"openai/gpt-oss-20b","object":"response","output":[{"id":"rs_9664daf6b147c565","summary":[],"type":"reasoning","content":[{"text":"User says: \"Pick a color\". They want a color. Probably answer with a color name. We can also give maybe a suggestion like \"sky blue\" or just pick a random color; maybe include an RGB hex code. Probably pick one: like \"emerald green\". Let's pick \"emerald green (#50C878)\".","type":"reasoning_text"}],"encrypted_content":null,"status":null},{"id":"msg_946aeff87d4ed2e2","content":[{"annotations":[],"text":"green","type":"output_text","logprobs":null}],"role":"assistant","status":"completed","type":"message"}],"parallel_tool_calls":true,"temperature":1.0,"tool_choice":"auto","tools":[],"top_p":1.0,"background":false,"max_output_tokens":65468,"max_tool_calls":null,"previous_response_id":null,"prompt":null,"reasoning":null,"service_tier":"auto","status":"completed","text":null,"top_logprobs":null,"truncation":"disabled","usage":{"input_tokens":68,"input_tokens_details":{"cached_tokens":64,"input_tokens_per_turn":[68],"cached_tokens_per_turn":[64]},"output_tokens":79,"output_tokens_details":{"reasoning_tokens":69,"tool_output_tokens":0,"output_tokens_per_turn":[79],"tool_output_tokens_per_turn":[0]},"total_tokens":147},"user":null,"input_messages":null,"output_messages":null}%

gemini-code-assist

Code Review

This pull request enables generic structured_outputs for the responses API, which is a great enhancement. The implementation is straightforward and includes relevant tests. I've found one area for improvement regarding the conflict detection logic to make it more consistent and robust. My feedback is detailed in the review comment.

vllm/entrypoints/openai/responses/protocol.py

yeqcharlotte · 2026-02-03T23:38:54Z

vllm/entrypoints/openai/responses/protocol.py

    # this cannot be used in conjunction with previous_response_id
    # TODO: consider supporting non harmony messages as well
    previous_input_messages: list[OpenAIHarmonyMessage | dict] | None = None
+    structured_outputs: StructuredOutputsParams | None = Field(


how does users access these from http endpoints? openai responses support passing structured output using text.format

You can just set it directly for http, there is an example in the PR description

curl -X POST http://localhost:8000/v1/responses \ -H "Content-Type: application/json" \ -d '{ "model": "openai/gpt-oss-20b", "input": "Pick a color", "structured_outputs": { "choice": ["red", "green", "blue"] } }'

If we wanted to put it on text.format, we would have to implement our own new class which can differentiate the OpenAI ResponseFormatTextConfig type from the structured output type, which has already had annoying changes in the past.

IMO I think I prefer keeping the two fields separate because it would allow us to more clearly differentiate "the code needed to provide a complete Responses API implementation" from "extra features on top of responses API for vLLM specifically". Keeping it as the StructuredOutputsParams type would also mean that it is reusable across the different provider APIs longer term, it would be nice to be able to set the same thing for Anthropic apis and Openai apis to guide model behavior in a way that isn't explicitly tied to API functionality.

daniel-salib · 2026-02-06T19:30:00Z

LGTM!

Signed-off-by: Alec Solder <alecs@fb.com>

…project#33709) Signed-off-by: Alec Solder <alecs@fb.com> Co-authored-by: Alec Solder <alecs@fb.com> Signed-off-by: Eldar Kurtic <research@neuralmagic.com>

…project#33709) Signed-off-by: Alec Solder <alecs@fb.com> Co-authored-by: Alec Solder <alecs@fb.com>

alecsolder requested review from DarkLight1337, NickLucche, aarnphm, chaunceyjiang and robertgshaw2-redhat as code owners February 3, 2026 16:35

mergify bot added the frontend label Feb 3, 2026

gemini-code-assist bot reviewed Feb 3, 2026

View reviewed changes

vllm/entrypoints/openai/responses/protocol.py Show resolved Hide resolved

yeqcharlotte reviewed Feb 3, 2026

View reviewed changes

zhuohan123 enabled auto-merge (squash) February 9, 2026 18:07

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 9, 2026

Alec Solder added 3 commits February 10, 2026 21:55

Enable generic structured_outputs for responses API

3742a23

Signed-off-by: Alec Solder <alecs@fb.com>

no inline imports

acaaa43

Signed-off-by: Alec Solder <alecs@fb.com>

Move up error check

d5d7cd6

Signed-off-by: Alec Solder <alecs@fb.com>

auto-merge was automatically disabled February 11, 2026 05:59
Head branch was pushed to by a user without write access

alecsolder force-pushed the alecs/responses_grammar branch from f590726 to d5d7cd6 Compare February 11, 2026 05:59

alecsolder added 5 commits February 11, 2026 20:00

Merge branch 'main' into alecs/responses_grammar

f7e1a14

Merge branch 'main' into alecs/responses_grammar

a62ca7c

Merge branch 'main' into alecs/responses_grammar

be5a992

Merge branch 'main' into alecs/responses_grammar

53f9a24

Merge branch 'main' into alecs/responses_grammar

941b860

zhuohan123 merged commit be7370d into vllm-project:main Feb 13, 2026
5 of 6 checks passed

llsj14 pushed a commit to llsj14/vllm that referenced this pull request Mar 1, 2026

[Frontend] Enable generic structured_outputs for responses API (vllm-…

f9287c0

…project#33709) Signed-off-by: Alec Solder <alecs@fb.com> Co-authored-by: Alec Solder <alecs@fb.com>

This was referenced Mar 3, 2026

[Responses API] Structured output + reasoning via structural tag embedding #35873

Closed

[Responses API] Structured output + reasoning via structural tag embedding #35904

Open

tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Mar 4, 2026

[Frontend] Enable generic structured_outputs for responses API (vllm-…

6620241

…project#33709) Signed-off-by: Alec Solder <alecs@fb.com> Co-authored-by: Alec Solder <alecs@fb.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Frontend] Enable generic structured_outputs for responses API#33709

[Frontend] Enable generic structured_outputs for responses API#33709
zhuohan123 merged 8 commits intovllm-project:mainfrom
alecsolder:alecs/responses_grammar

alecsolder commented Feb 3, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

yeqcharlotte Feb 3, 2026

Uh oh!

chaunceyjiang Feb 4, 2026

Uh oh!

alecsolder Feb 4, 2026

Uh oh!

daniel-salib commented Feb 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

alecsolder commented Feb 3, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

yeqcharlotte Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

chaunceyjiang Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

alecsolder Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

daniel-salib commented Feb 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

alecsolder commented Feb 3, 2026 •

edited by github-actions bot

Loading