[Bugfix] Actually enable serialize_messages for harmony Responses (related to #26185) by jacobthebanana · Pull Request #27377 · vllm-project/vllm

jacobthebanana · 2025-10-23T01:07:10Z

Purpose

For the OpenAI-compatible v1/responses route, enable raw messages to be sent when enable_response_messages is set to True in extra_body.

Previously, the responses are empty because of an issue in openai/harmony. (openai/harmony#78)

#26185 implements most of the fix, but these aren't actually invoked, at least not when serving the model through the vllm serve. The reason is that the said PR specifies when_used="json". Thus, this serialization method is ignored because of the use of model_dump() in vllm/entrypoints/openai/api_server.py#L527-L529.

The fix is to trigger the serializers by setting mode="json" when invoking model_dump.

Test Plan

Start vLLM server vllm serve openai/gpt-oss-20b

Send a Response request with enable_response_messages set to True in extra_body

resp = client.responses.create(
        model=model,
        input=prompt,
        extra_body={"enable_response_messages": True,}
)

print(resp.model_dump_json(indent=2))

Repeat the above for the streaming case.

Test Result

Original:

Details

``` "input_messages": [ { "author": { "role": "system", "name": null }, "content": [ {} ], "channel": null, "recipient": null, "content_type": null }, ... ], "output_messages": [ ... { "author": { "role": "assistant", "name": null }, "content": [ {} ], "channel": "final", "recipient": null, "content_type": null } ] } ```

After adding mode="json"

Details

``` "input_messages": [ { "role": "system", "name": null, "content": [ { "model_identity": "You are ChatGPT, a large language model trained by OpenAI.", "reasoning_effort": "Medium", "conversation_start_date": "2025-10-22", "knowledge_cutoff": "2024-06", "channel_config": { "valid_channels": [ "analysis", "final" ], "channel_required": true }, "type": "system_content" } ] }, { "role": "user", "name": null, "content": [ { "type": "text", "text": "Write a haiku about autumn leaves." } ] } ], "output_messages": [ { "role": "assistant", "name": null, "content": [ { "type": "text", "text": "User wants a haiku about autumn leaves. Simple. Use 5-7-5 syllable structure. Let's produce one. Ensure it's about autumn leaves. Provide in one paragraph." } ], "channel": "analysis" }, { "role": "assistant", "name": null, "content": [ { "type": "text", "text": "Leaves whisper, fall— \ncrimson and amber drift down, \nautumn sighs in wind." } ], "channel": "final" } ] } ```

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

This pull request correctly enables message serialization for Harmony Responses by calling model_dump(mode="json"). The change is a necessary fix for an upstream issue in openai/harmony, and the problem is well-described in the pull request. The code modification is simple, targeted, and correctly applied in both the create_responses and retrieve_responses functions. The inclusion of a TODO comment with a link to the upstream issue is good practice for maintainability. The change appears correct and complete, and I have no further suggestions.

Signed-off-by: Andrew Xia <axia@meta.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>

…_response_messages is set. Signed-off-by: Jacob-Junqi Tian <jacob@banana.abay.cf>

Signed-off-by: Jacob-Junqi Tian <jacob@banana.abay.cf>

jacobthebanana · 2025-10-23T14:15:38Z

(force-pushing to add sign-off)

mergify · 2026-01-14T07:32:03Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @jacobthebanana.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

jacobthebanana requested review from aarnphm and chaunceyjiang as code owners October 23, 2025 01:07

mergify bot added frontend gpt-oss Related to GPT-OSS models labels Oct 23, 2025

gemini-code-assist bot reviewed Oct 23, 2025

View reviewed changes

github-project-automation bot added this to gpt-oss Issues & Enhancements Oct 23, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Oct 23, 2025

jacobthebanana referenced this pull request Oct 23, 2025

[responsesAPI][bugfix] serialize harmony messages (#26185)

185d8ed

Signed-off-by: Andrew Xia <axia@meta.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>

jacobthebanana force-pushed the response-harmony-stopgap branch from 89af97b to 21d9a79 Compare October 23, 2025 14:10

jacobthebanana added 2 commits October 23, 2025 10:11

Actually enables serialize_messages for harmony Responses when enable…

d174648

…_response_messages is set. Signed-off-by: Jacob-Junqi Tian <jacob@banana.abay.cf>

Moved serialization fix to api_server

26d9bdc

Signed-off-by: Jacob-Junqi Tian <jacob@banana.abay.cf>

jacobthebanana force-pushed the response-harmony-stopgap branch from 21d9a79 to 26d9bdc Compare October 23, 2025 14:11

jacobthebanana changed the title ~~Actually enables serialize_messages for harmony Responses (related to #26185)~~ [Bugfix] Actually enable serialize_messages for harmony Responses (related to #26185) Oct 26, 2025

Merge branch 'vllm-project:main' into response-harmony-stopgap

26305e9

This was referenced Dec 2, 2025

[Bug]: v1/responses enable_response_messages returns blank message content #29831

Closed

Added regression test for openai/harmony/issues/78 #29830

Open

mergify bot added the bug Something isn't working label Jan 14, 2026

mergify bot added the needs-rebase label Jan 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Actually enable serialize_messages for harmony Responses (related to #26185)#27377

[Bugfix] Actually enable serialize_messages for harmony Responses (related to #26185)#27377
jacobthebanana wants to merge 3 commits intovllm-project:mainfrom
VectorInstitute:response-harmony-stopgap

jacobthebanana commented Oct 23, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

jacobthebanana commented Oct 23, 2025

Uh oh!

mergify bot commented Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

jacobthebanana commented Oct 23, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

jacobthebanana commented Oct 23, 2025

Uh oh!

mergify bot commented Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jacobthebanana commented Oct 23, 2025 •

edited by github-actions bot

Loading