[HiCache] return cached_tokens_details in sglext for streaming responses by vladnosiv · Pull Request #22055 · sgl-project/sglang

vladnosiv · 2026-04-03T15:03:40Z

Motivation

sglext.cached_tokens_details is returned correctly in non-streaming chat/completions responses, but silently dropped in streaming mode. The backend populates cached_tokens_details in meta_info for every request, and the streaming loop already collects it but it was never extracted and emitted.

Signed-off-by: Vladislav Nosivskoy <vladnosiv@gmail.com>

gemini-code-assist

Code Review

This pull request introduces support for cached token details in OpenAI-compatible chat and completion streams, consolidating extension data into a single response chunk. It also adds a utility function for processing cached token metadata. Review feedback suggests simplifying the conditional logic for assigning routed experts to improve readability.

python/sglang/srt/entrypoints/openai/serving_chat.py

python/sglang/srt/entrypoints/openai/serving_completions.py

huangtingwei9988

LGTM

fix: cache hit breakdown for streaming

1112eca

Signed-off-by: Vladislav Nosivskoy <vladnosiv@gmail.com>

vladnosiv requested review from CatherineSue, JustinTong0323, ispobock, merrymercy and slin1237 as code owners April 3, 2026 15:03

gemini-code-assist bot reviewed Apr 3, 2026

View reviewed changes

python/sglang/srt/entrypoints/openai/serving_chat.py Show resolved Hide resolved

python/sglang/srt/entrypoints/openai/serving_completions.py Show resolved Hide resolved

huangtingwei9988 approved these changes Apr 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HiCache] return cached_tokens_details in sglext for streaming responses#22055

[HiCache] return cached_tokens_details in sglext for streaming responses#22055
vladnosiv wants to merge 1 commit intosgl-project:mainfrom
vladnosiv:fix-cache-hit-breakdown-chat-streaming

vladnosiv commented Apr 3, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

huangtingwei9988 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vladnosiv commented Apr 3, 2026

Motivation

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

huangtingwei9988 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants