[KVConnector]: Enable Cross-layers KV cache layout for MultiConnector by kfirtoledo · Pull Request #30761 · vllm-project/vllm

kfirtoledo · 2025-12-16T08:59:31Z

Enable MultiConnector to use the new continuous cross-layer KV cache layout
described in RFC #27742 and implemented in #27743.
The MultiConnector aggregates cross-layer support across all underlying connectors
and returns true only if all connectors support it.

chatgpt-codex-connector · 2025-12-16T08:59:37Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

gemini-code-assist

Code Review

This pull request enables the MultiConnector to support the new continuous cross-layer KV cache layout. The changes correctly propagate the prefer_cross_layer_blocks property and the register_cross_layers_kv_cache method to the underlying connectors. My review identifies a potential edge case in the implementation of prefer_cross_layer_blocks where it could return an incorrect value if no connectors are configured, which could lead to unexpected behavior. I've provided a suggestion to make the implementation more robust.

vllm/distributed/kv_transfer/kv_connector/v1/multi_connector.py

github-actions · 2025-12-16T09:33:12Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

mergify · 2025-12-16T09:33:31Z

Hi @kfirtoledo, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?

mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

orozery · 2025-12-16T16:46:44Z

@NickLucche per your suggestion here we defined
the prefer_cross_layer_blocks as a ClassVar.
This does not work well for the MultiConnector, as its prefer_cross_layer_blocks can only be determined at runtime.
I think we should change this ClassVar to a @property. WDYT?

NickLucche

Thanks for doing this @kfirtoledo!
And yes the property change is fine here @orozery , given this feature was just introduced.

Would you mind adding a small unit test to keep this feature in check? It looks trivial, but we had similar issues with MC before..
Other than that this is lgtm.

kfirtoledo · 2025-12-18T08:09:10Z

@NickLucche and @orozery, I added the unit tests, PTAL

vllm/distributed/kv_transfer/kv_connector/v1/multi_connector.py

vllm/distributed/kv_transfer/kv_connector/v1/base.py

mergify · 2025-12-30T12:55:11Z

Hi @kfirtoledo, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?

mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

Signed-off-by: Kfir Toledo <kfir.toledo@ibm.com>

orozery

lgtm. Thanks!

NickLucche

Thanks @kfirtoledo !

…vllm-project#30761) Signed-off-by: Kfir Toledo <kfir.toledo@ibm.com>

…vllm-project#30761) Signed-off-by: Kfir Toledo <kfir.toledo@ibm.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

…vllm-project#30761) Signed-off-by: Kfir Toledo <kfir.toledo@ibm.com>

kfirtoledo requested review from ApostaC and NickLucche as code owners December 16, 2025 08:59

mergify bot added the kv-connector label Dec 16, 2025

gemini-code-assist bot reviewed Dec 16, 2025

View reviewed changes

vllm/distributed/kv_transfer/kv_connector/v1/multi_connector.py Outdated Show resolved Hide resolved

kfirtoledo force-pushed the multi-connector branch from 83fdf01 to cd61201 Compare December 17, 2025 06:20

NickLucche reviewed Dec 17, 2025

View reviewed changes

kfirtoledo force-pushed the multi-connector branch from cd61201 to 517698a Compare December 18, 2025 07:41

mergify bot added the v1 label Dec 18, 2025

kfirtoledo force-pushed the multi-connector branch from 517698a to c842eee Compare December 18, 2025 07:55

orozery requested changes Dec 22, 2025

View reviewed changes

vllm/distributed/kv_transfer/kv_connector/v1/multi_connector.py Outdated Show resolved Hide resolved

vllm/distributed/kv_transfer/kv_connector/v1/base.py Show resolved Hide resolved

kfirtoledo force-pushed the multi-connector branch from c842eee to 2f7a452 Compare December 30, 2025 11:39

[KVConnector]: Enable Cross-layers KV cache layout for MultiConnector

288e3a1

Signed-off-by: Kfir Toledo <kfir.toledo@ibm.com>

kfirtoledo force-pushed the multi-connector branch from 2f7a452 to 288e3a1 Compare December 30, 2025 13:05

orozery approved these changes Dec 30, 2025

View reviewed changes

NickLucche approved these changes Jan 7, 2026

View reviewed changes

NickLucche enabled auto-merge (squash) January 7, 2026 14:45

Merge branch 'main' into multi-connector

80ad94b

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 7, 2026

NickLucche merged commit b89443b into vllm-project:main Jan 7, 2026
50 checks passed

yugong333 pushed a commit to yugong333/vllm that referenced this pull request Jan 9, 2026

[KVConnector]: Enable Cross-layers KV cache layout for MultiConnector (…

3351f8c

…vllm-project#30761) Signed-off-by: Kfir Toledo <kfir.toledo@ibm.com>

akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026

[KVConnector]: Enable Cross-layers KV cache layout for MultiConnector (…

502b8bf

…vllm-project#30761) Signed-off-by: Kfir Toledo <kfir.toledo@ibm.com>

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[KVConnector]: Enable Cross-layers KV cache layout for MultiConnector (…

1291bed

…vllm-project#30761) Signed-off-by: Kfir Toledo <kfir.toledo@ibm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[KVConnector]: Enable Cross-layers KV cache layout for MultiConnector#30761

[KVConnector]: Enable Cross-layers KV cache layout for MultiConnector#30761
NickLucche merged 2 commits intovllm-project:mainfrom
kfirtoledo:multi-connector

kfirtoledo commented Dec 16, 2025 •

edited by github-actions bot

Loading

Uh oh!

chatgpt-codex-connector bot commented Dec 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

github-actions bot commented Dec 16, 2025

Uh oh!

mergify bot commented Dec 16, 2025

Uh oh!

orozery commented Dec 16, 2025

Uh oh!

NickLucche left a comment

Uh oh!

kfirtoledo commented Dec 18, 2025

Uh oh!

Uh oh!

Uh oh!

mergify bot commented Dec 30, 2025

Uh oh!

orozery left a comment

Uh oh!

NickLucche left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

kfirtoledo commented Dec 16, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector bot commented Dec 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Dec 16, 2025

Uh oh!

mergify bot commented Dec 16, 2025

Uh oh!

orozery commented Dec 16, 2025

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

kfirtoledo commented Dec 18, 2025

Uh oh!

Uh oh!

Uh oh!

mergify bot commented Dec 30, 2025

Uh oh!

orozery left a comment

Choose a reason for hiding this comment

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kfirtoledo commented Dec 16, 2025 •

edited by github-actions bot

Loading