fix: hicache with eagle didn't manage the draft model's kv cache. by cicirori · Pull Request #17338 · sgl-project/sglang

cicirori · 2026-01-19T07:58:41Z

Motivation

When both speculative decoding and HiCache are enabled, the draft model KV cache is not managed by the HiRadix cache.

When the original draft model KV cache is evicted, but the target model hits HiCache, the draft model ends up drafting based on a random KV cache.

Modifications

Add dedicated L2 HiCache support for the draft model KV cache.

Accuracy Tests

Before the fix
When HiCache was triggered, the acceptance length dropped for the same request.
After the fix
The acceptance length stays almost the same.

see #16964

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review Process

Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
After green CI and required approvals, ask Merge Oncalls to merge.

gemini-code-assist · 2026-01-19T07:58:46Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

cicirori · 2026-01-19T08:02:06Z

/tag-run-ci-label

cicirori · 2026-01-19T08:03:38Z

/tag-and-rerun-ci

xiezhq-hermann · 2026-01-27T02:04:13Z

@ispobock should the draft model just re-compute when the cache is already evicted for the draft model?

hnyls2002 · 2026-02-12T03:24:13Z

/rerun-stage stage-b-test-large-1-gpu

cicirori · 2026-02-15T09:38:33Z

/rerun-stage stage-b-test-large-1-gpu

github-actions · 2026-02-15T09:38:56Z

✅ Triggered stage-b-test-large-1-gpu to run independently (skipping dependencies).

github-actions · 2026-02-15T09:39:02Z

🔗 View workflow run

cicirori added 3 commits January 18, 2026 23:42

let hicache manage draft model's kvcache as well

2158e25

support spec v2

bb07c22

refine

c1f6a05

cicirori requested review from Ying1123, hnyls2002, merrymercy and xiezhq-hermann as code owners January 19, 2026 07:58

cicirori added the run-ci label Jan 19, 2026

cicirori changed the title ~~fix: eagle with hicache didn't manage the draft model's kv cache.~~ fix: hicache with eagle didn't manage the draft model's kv cache. Jan 19, 2026

zhyncs assigned xiezhq-hermann Jan 19, 2026

zhyncs added bug Something isn't working hicache Hierarchical Caching for SGLang speculative-decoding labels Jan 19, 2026

xiezhq-hermann added the high priority label Jan 27, 2026

xiezhq-hermann assigned ispobock Jan 27, 2026

alphabetc1 mentioned this pull request Jan 27, 2026

[HiCache][WIP] support spec decode+hicache storage #17776

Closed

5 tasks

cicirori added 2 commits February 12, 2026 22:22

Merge branch 'main' into fix_hicache_with_spec

a5df22e

Merge branch 'main' into fix_hicache_with_spec

779f3b1

cicirori closed this Mar 5, 2026

alphabetc1 mentioned this pull request Mar 23, 2026

[HiCache] feat: add draft KV cache backing for L2/L3 #21125

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: hicache with eagle didn't manage the draft model's kv cache.#17338

fix: hicache with eagle didn't manage the draft model's kv cache.#17338
cicirori wants to merge 5 commits intomainfrom
fix_hicache_with_spec

cicirori commented Jan 19, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot commented Jan 19, 2026

Uh oh!

cicirori commented Jan 19, 2026

Uh oh!

cicirori commented Jan 19, 2026

Uh oh!

xiezhq-hermann commented Jan 27, 2026

Uh oh!

hnyls2002 commented Feb 12, 2026

Uh oh!

cicirori commented Feb 15, 2026

Uh oh!

github-actions Bot commented Feb 15, 2026

Uh oh!

github-actions Bot commented Feb 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

cicirori commented Jan 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Review Process

Uh oh!

gemini-code-assist Bot commented Jan 19, 2026

Uh oh!

cicirori commented Jan 19, 2026

Uh oh!

cicirori commented Jan 19, 2026

Uh oh!

xiezhq-hermann commented Jan 27, 2026

Uh oh!

hnyls2002 commented Feb 12, 2026

Uh oh!

cicirori commented Feb 15, 2026

Uh oh!

github-actions Bot commented Feb 15, 2026

Uh oh!

github-actions Bot commented Feb 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

cicirori commented Jan 19, 2026 •

edited

Loading