[Refactor] Extract Harmony streaming SSE event builders into streaming_events.py by sfeng33 · Pull Request #34909 · vllm-project/vllm

sfeng33 · 2026-02-19T19:20:19Z

Purpose

serving.py is ~2800 lines. The _emit_* methods are pure functions (state + data → events) that don't use instance state, yet they live on the class. Extracting them:

Reduces serving.py by ~800 lines
Makes event builders independently testable
Prepares for StreamingParsableContext support — the upcoming streaming parsable implementation will reuse these same event builders instead of duplicating them (as _process_simple_streaming_events currently does)

Future plan

This is the first step toward unifying SSE event emission across all three streaming paths (Harmony, Simple, Parsable):

This PR: Extract event builders into shared module
Follow-up: Further refactor SSE events so that StreamingParsableContext and HarmonyContext share the same event processor loop

Test Plan

pytest tests/entrypoints/openai/responses/test_harmony.py

Signed-off-by: sfeng33 <4florafeng@gmail.com>

gemini-code-assist

Code Review

This pull request successfully refactors the Harmony streaming SSE event builders by extracting them from serving.py into a dedicated streaming_events.py module. This significantly improves the maintainability and testability of the code, and sets a solid foundation for unifying event emission across different streaming contexts. The refactor correctly handles the transition of state and dependencies (like tool_server). I have identified a robustness issue in the browser tool event builder that could lead to runtime crashes on unexpected model output.

gemini-code-assist · 2026-02-19T19:21:38Z

vllm/entrypoints/openai/responses/streaming_events.py

+    parsed_args = json.loads(previous_item.content[0].text)
+    action = None
+
+    if function_name == "search":
+        action = response_function_web_search.ActionSearch(
+            type="search",
+            query=parsed_args["query"],
+        )
+    elif function_name == "open":
+        action = response_function_web_search.ActionOpenPage(
+            type="open_page",
+            # TODO: translate to url
+            url=f"cursor:{parsed_args.get('cursor', '')}",
+        )
+    elif function_name == "find":
+        action = response_function_web_search.ActionFind(
+            type="find",
+            pattern=parsed_args["pattern"],
+            # TODO: translate to url
+            url=f"cursor:{parsed_args.get('cursor', '')}",
+        )
+    else:
+        raise ValueError(f"Unknown function name: {function_name}")


The emit_browser_tool_events function lacks robustness against unexpected model output. Specifically:

json.loads will raise a JSONDecodeError if the model produces malformed JSON.

Accessing parsed_args["query"] and parsed_args["pattern"] will raise a KeyError if those keys are missing.

The ValueError at line 658 will propagate and cause a 500 error for the streaming request.

Since this is an async generator context, these unhandled exceptions will crash the stream for the client. Consider implementing robust parsing with error handling similar to _parse_browser_tool_call in harmony_utils.py.

try: parsed_args = json.loads(previous_item.content[0].text) except (json.JSONDecodeError, IndexError): return [] if function_name == "search": query = parsed_args.get("query") if not query: return [] action = response_function_web_search.ActionSearch( type="search", query=query, ) elif function_name == "open": action = response_function_web_search.ActionOpenPage( type="open_page", # TODO: translate to url url=f"cursor:{parsed_args.get('cursor', '')}", ) elif function_name == "find": pattern = parsed_args.get("pattern") if not pattern: return [] action = response_function_web_search.ActionFind( type="find", pattern=pattern, # TODO: translate to url url=f"cursor:{parsed_args.get('cursor', '')}", ) else: return []

This piece of code is copied over, I prefer to keep logic unchanged in this PR since it's for refactoring, but let me know if folks think otherwise.

sfeng33 · 2026-02-19T19:22:41Z

PTAL: @qandrew @daniel-salib

qandrew

lgtm, thanks! cc @houseroad can we add "ready" / automerge?

mgoin

LGTM just two nits. They can wait if the goal is to make this purely a movement PR

vllm/entrypoints/openai/responses/streaming_events.py

sfeng33 · 2026-02-20T00:37:17Z

Verified the entrypoints-integration-responses-api tests passed locally (test_mcp_tools, test_parsable_context, test_parsable_context). The other failed tests are unrelated.

…g_events.py (vllm-project#34909) Signed-off-by: sfeng33 <4florafeng@gmail.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>

…g_events.py (vllm-project#34909) Signed-off-by: sfeng33 <4florafeng@gmail.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk> Signed-off-by: Andrii Skliar <askliar@nvidia.com>

…g_events.py (vllm-project#34909) Signed-off-by: sfeng33 <4florafeng@gmail.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>

sfeng33 added 2 commits February 19, 2026 14:12

refactor

737e450

Signed-off-by: sfeng33 <4florafeng@gmail.com>

format

8945f2e

Signed-off-by: sfeng33 <4florafeng@gmail.com>

mergify bot added frontend gpt-oss Related to GPT-OSS models labels Feb 19, 2026

github-project-automation bot added this to gpt-oss Issues & Enhancements Feb 19, 2026

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Feb 19, 2026

sfeng33 marked this pull request as ready for review February 19, 2026 19:21

sfeng33 requested review from DarkLight1337, aarnphm, chaunceyjiang and russellb as code owners February 19, 2026 19:21

gemini-code-assist bot reviewed Feb 19, 2026

View reviewed changes

Merge branch 'main' into streaming_parser

2cddcae

qandrew approved these changes Feb 19, 2026

View reviewed changes

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 19, 2026

mgoin approved these changes Feb 19, 2026

View reviewed changes

vllm/entrypoints/openai/responses/streaming_events.py Show resolved Hide resolved

vllm/entrypoints/openai/responses/streaming_events.py Show resolved Hide resolved

github-project-automation bot moved this from To Triage to Ready in gpt-oss Issues & Enhancements Feb 19, 2026

Merge branch 'main' into streaming_parser

9e401b5

DarkLight1337 enabled auto-merge (squash) February 20, 2026 04:15

vllm-bot merged commit ed31a02 into vllm-project:main Feb 20, 2026
47 of 50 checks passed

github-project-automation bot moved this from Ready to Done in gpt-oss Issues & Enhancements Feb 20, 2026

sfeng33 deleted the streaming_parser branch February 21, 2026 18:24

will-deines mentioned this pull request Mar 4, 2026

[Bugfix] Fix Harmony streaming cross-channel delta accumulation #36011

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Refactor] Extract Harmony streaming SSE event builders into streaming_events.py#34909

[Refactor] Extract Harmony streaming SSE event builders into streaming_events.py#34909
vllm-bot merged 4 commits intovllm-project:mainfrom
sfeng33:streaming_parser

sfeng33 commented Feb 19, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Uh oh!

sfeng33 Feb 19, 2026

Uh oh!

sfeng33 commented Feb 19, 2026

Uh oh!

qandrew left a comment

Uh oh!

mgoin left a comment

Uh oh!

Uh oh!

Uh oh!

sfeng33 commented Feb 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

sfeng33 commented Feb 19, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Future plan

Test Plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

sfeng33 Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

sfeng33 commented Feb 19, 2026

Uh oh!

qandrew left a comment

Choose a reason for hiding this comment

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sfeng33 commented Feb 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

sfeng33 commented Feb 19, 2026 •

edited by github-actions bot

Loading