[Feature] Keep HMA enabled for supported KV connectors by arpera · Pull Request #41644 · vllm-project/vllm

arpera · 2026-05-04T15:36:40Z

Motivation

vLLM currently disables the hybrid KV cache manager by default whenever kv_transfer_config is set, unless the user explicitly passes --no-disable-hybrid-kv-cache-manager. That is conservative for connectors that do not support HMA, but it also disables HMA for connectors like NixlConnector that already advertise SupportsHMA.

This change preserves the conservative default for unsupported connectors while allowing HMA to stay enabled by default when the selected connector explicitly supports it.

Test Results

Hardware: 4xGB200

Functional

python -m pytest tests/v1/core/test_kv_cache_utils.py::test_hma_not_disabled_for_supported_kv_connector tests/v1/core/test_kv_cache_utils.py::test_hma_disabled_for_unsupported_kv_connector -v — passed.

Performance

Not measured. This only changes the default HMA decision for KV connectors that already declare HMA support.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Signed-off-by: Artem Perevedentsev <aperevedents@nvidia.com>

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

gemini-code-assist · 2026-05-04T15:42:01Z

Warning

Gemini encountered an error creating the review. You can try again by commenting /gemini review.

arpera · 2026-05-04T15:42:55Z

/gemini review

gemini-code-assist · 2026-05-04T15:45:28Z

Warning

Gemini encountered an error creating the review. You can try again by commenting /gemini review.

arpera · 2026-05-04T15:48:00Z

@vadiklyutiy, please, have a look

orozery · 2026-05-05T08:32:45Z

cc @NickLucche
This will break the multi connector.

arpera · 2026-05-05T08:50:42Z

@orozery, could you please explain a bit more the issue with multi connector?

orozery · 2026-05-05T08:57:39Z

@orozery, could you please explain a bit more the issue with multi connector?

See discussion in #39571.
The multi connector currently assumes that HMA will be off by default if kv_transfer_config is present.
Breaking this assumption would lead to a possible case where HMA is enabled though one of the sub-connectors of the multi-connector does not support HMA.

NickLucche

Thanks for contributing @arpera.
Let's chat before changing current hma opt-in policy.

arpera · 2026-05-05T14:44:59Z

Sure, I am open to discussion, if there is any concerns about this change, please, feel free to discuss it there.

[Feature] Keep HMA enabled for supported KV connectors

774c887

Signed-off-by: Artem Perevedentsev <aperevedents@nvidia.com>

claude Bot reviewed May 4, 2026

View reviewed changes

mergify Bot added the v1 label May 4, 2026

arpera requested review from ApostaC, ProExpertProg, WoosukKwon, alexm-redhat, heheda12345, hmellor, houseroad, mgoin, njhill, orozery, robertgshaw2-redhat, tlrmchlsmth, yewentao256, youkaichao and ywang96 as code owners May 4, 2026 15:37

orozery requested review from NickLucche and removed request for ywang96 May 5, 2026 08:32

NickLucche requested changes May 5, 2026

View reviewed changes

liuzijing2014 mentioned this pull request May 6, 2026

[KV Connector] Opt DecodeBenchConnector into SupportsHMA #41770

Merged

markmc mentioned this pull request May 8, 2026

[Bug]: NIXL connector silently disables HMA, halving KV cache capacity — flip default to HMA=on even with connectors #42024

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Keep HMA enabled for supported KV connectors#41644

[Feature] Keep HMA enabled for supported KV connectors#41644
arpera wants to merge 1 commit intovllm-project:mainfrom
arpera:qwen35-hma-connector-default

arpera commented May 4, 2026

Uh oh!

claude Bot left a comment

Uh oh!

gemini-code-assist Bot commented May 4, 2026

Uh oh!

arpera commented May 4, 2026

Uh oh!

gemini-code-assist Bot commented May 4, 2026

Uh oh!

arpera commented May 4, 2026

Uh oh!

orozery commented May 5, 2026

Uh oh!

arpera commented May 5, 2026

Uh oh!

orozery commented May 5, 2026

Uh oh!

NickLucche left a comment

Uh oh!

arpera commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

arpera commented May 4, 2026

Motivation

Test Results

Functional

Performance

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

gemini-code-assist Bot commented May 4, 2026

Uh oh!

arpera commented May 4, 2026

Uh oh!

gemini-code-assist Bot commented May 4, 2026

Uh oh!

arpera commented May 4, 2026

Uh oh!

orozery commented May 5, 2026

Uh oh!

arpera commented May 5, 2026

Uh oh!

orozery commented May 5, 2026

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

arpera commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants