[Bugfix] Fix Responses API instructions leaking through previous_response_id by he-yufeng · Pull Request #37727 · vllm-project/vllm

he-yufeng · 2026-03-21T01:09:52Z

What's the problem

When using /v1/responses with previous_response_id, the instructions from the prior response carry over into the new response. Per the OpenAI spec, instructions should NOT carry over:

"When using along with previous_response_id, the instructions from a previous response will not be carried over to the next response."

Root cause

construct_input_messages() in responses/utils.py prepends request_instructions as a system message, then the full messages list (including that system message) gets stored in msg_store. When the next request references previous_response_id, those stored messages — old system message included — are retrieved and extended into the new conversation. The new request also adds its own instructions, so you end up with both old and new system messages.

Fix

Filter out system messages when pulling prev_msg from the store in construct_input_messages(). One-line change: messages.extend(prev_msg) becomes messages.extend(m for m in prev_msg if m.get("role") != "system").

This ensures each request only uses its own instructions, regardless of what the previous response had. Works correctly for all cases: new instructions provided, no instructions provided, or no previous response at all.

Test plan

Added 4 unit tests in tests/entrypoints/openai/responses/test_responses_utils.py covering:
- Old system message stripped when new instructions provided
- Old system message stripped when no instructions provided
- Non-system messages (user/assistant) preserved correctly
- Baseline: no previous messages works as before

…onse_id Signed-off-by: Yufeng He <40085740+he-yufeng@users.noreply.github.com>

gemini-code-assist

Code Review

This pull request addresses a bug where instructions from a previous response would leak into a new response when using previous_response_id. The change in vllm/entrypoints/openai/responses/utils.py correctly filters out system messages from the previous message history, aligning with the specification that instructions should not be carried over. New unit tests have been added in tests/entrypoints/openai/responses/test_responses_utils.py to validate the fix across various scenarios. The changes appear correct and are appropriately tested.

chaunceyjiang · 2026-03-21T02:07:36Z

vllm/entrypoints/openai/responses/utils.py

-        # Add the previous messages.
-        messages.extend(prev_msg)
+        # Filter out system messages from previous conversation -- per the
+        # OpenAI spec, instructions should NOT carry over across responses.


Looks good. Is there any related OpenAI spec documentation for this? Could you share the link?

Sure! From the OpenAI API Reference — Create Response, the instructions parameter description states:

When used along with previous_response_id, the instructions from a previous response will not be carried over to the next response. This makes it simple to swap out system (or developer) messages in new responses.

The Text Generation guide also reinforces this — the instructions parameter only applies to the current response, and instructions from previous turns will not be present in the context when using previous_response_id.

[Bugfix] Fix Responses API instructions leaking through previous_resp…

eda6cf2

…onse_id Signed-off-by: Yufeng He <40085740+he-yufeng@users.noreply.github.com>

he-yufeng requested review from DarkLight1337, NickLucche, aarnphm, chaunceyjiang, robertgshaw2-redhat and russellb as code owners March 21, 2026 01:09

gemini-code-assist bot reviewed Mar 21, 2026

View reviewed changes

mergify bot added frontend bug Something isn't working labels Mar 21, 2026

chaunceyjiang reviewed Mar 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Fix Responses API instructions leaking through previous_response_id#37727

[Bugfix] Fix Responses API instructions leaking through previous_response_id#37727
he-yufeng wants to merge 1 commit intovllm-project:mainfrom
he-yufeng:fix/responses-api-instructions-leak

he-yufeng commented Mar 21, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

chaunceyjiang Mar 21, 2026

Uh oh!

he-yufeng Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

he-yufeng commented Mar 21, 2026

What's the problem

Root cause

Fix

Test plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

chaunceyjiang Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

he-yufeng Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants