[kv_offload+HMA][9/N]: Support lookup with multiple KV groups by orozery · Pull Request #39401 · vllm-project/vllm

orozery · 2026-04-09T09:44:09Z

This PR extends the offloading connector to support lookups (get_num_new_matched_tokens) where KVCacheConfig contains multiple groups.

Currently supports only full attention groups.

gemini-code-assist

Code Review

This pull request refactors the offloading scheduler in vllm/distributed/kv_transfer/kv_connector/v1/offloading/scheduler.py to support multiple KV cache groups. It initializes lookup_groups in the constructor and modifies the get_num_new_matched_tokens method to iterate through these groups, performing individual block lookups and handling request deferral or delay based on the state of blocks within each group. There are no review comments to address.

This commit extends the offloading connector to support lookups (get_num_new_matched_tokens) where KVCacheConfig contains multiple groups. Signed-off-by: Or Ozeri <oro@il.ibm.com>

orozery requested review from ApostaC and NickLucche as code owners April 9, 2026 09:44

mergify bot added the kv-connector label Apr 9, 2026

gemini-code-assist bot reviewed Apr 9, 2026

View reviewed changes

panpan0000 mentioned this pull request Apr 14, 2026

Introduce De-dup/Similarity-Check in CI Workflow for PR/Issue #39695

Open

5 tasks

orozery force-pushed the kv-offload-lookup-multiple-groups branch from 09cca7b to 587ba2a Compare April 20, 2026 07:27

orozery requested a review from xuechendi as a code owner April 20, 2026 07:27

[kv_offload+HMA][9/N]: Support lookup with multiple KV groups

56f4df9

This commit extends the offloading connector to support lookups (get_num_new_matched_tokens) where KVCacheConfig contains multiple groups. Signed-off-by: Or Ozeri <oro@il.ibm.com>

orozery force-pushed the kv-offload-lookup-multiple-groups branch from 587ba2a to 56f4df9 Compare April 20, 2026 11:05

orozery added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[kv_offload+HMA][9/N]: Support lookup with multiple KV groups#39401

[kv_offload+HMA][9/N]: Support lookup with multiple KV groups#39401
orozery wants to merge 1 commit intovllm-project:mainfrom
orozery:kv-offload-lookup-multiple-groups

orozery commented Apr 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

orozery commented Apr 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant