[NIXL][1/N] Refactor `kernel_block_size` detection by NickLucche · Pull Request #35752 · vllm-project/vllm

NickLucche · 2026-03-02T14:43:39Z

This PR is based on top #32204, hence the latter must be merged before the former.

This PR is a small refactor/cleanup of the register_kv_cache main loop (which is quite dense), utilizing the KVCacheConfig (now available after the HMA PR) aimed at simplifying code logic.
In fact, there's no need to wait until iteration over kv cache tensors to figure out which kernel block size was selected by the backend or how many blocks the kv cache has.

This is also an attempt at breaking up hybrid SSM support here #34727 into smaller, more easily reviewable PRs.

Signed-off-by: NickLucche <nlucches@redhat.com>

gemini-code-assist

Code Review

This pull request introduces a significant refactoring to the NIXL connector to support Hybrid Memory Allocator (HMA) and improve kernel block size detection. The changes are extensive, modifying core connector logic, utility functions, and tests to handle multiple KV cache groups, which is essential for HMA. Key additions include the BlockIds type alias, the _sync_block_size_with_kernel method for managing logical and physical block sizes, and updates to many components to be HMA-aware. The test suite has been appropriately expanded with HMA-specific tests and existing tests have been adapted. The overall changes are well-structured and appear correct. I've identified one area for improvement in a test script concerning code duplication.

gemini-code-assist · 2026-03-02T14:49:12Z

tests/v1/kv_connector/nixl_integration/run_accuracy_test.sh

+    # Add HMA flag if specified
+    if [[ -n "$ENABLE_HMA_VAR" ]]; then
+      BASE_CMD="${BASE_CMD} $ENABLE_HMA_VAR"
+    fi


This block of code to add the HMA flag is duplicated for the decode instances loop (lines 220-223). To improve maintainability and reduce redundancy, consider refactoring this logic into a helper function or applying it once to avoid having the same logic in two places.

NickLucche added 2 commits February 27, 2026 11:48

HMA+NIXL

421c35a

Signed-off-by: NickLucche <nlucches@redhat.com>

init

4b9f987

Signed-off-by: NickLucche <nlucches@redhat.com>

mergify bot added v1 kv-connector labels Mar 2, 2026

gemini-code-assist bot reviewed Mar 2, 2026

View reviewed changes

NickLucche changed the title ~~[NIXL] Refactor kernel_block_size detection~~ [NIXL][1/N] Refactor kernel_block_size detection Mar 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NIXL][1/N] Refactor `kernel_block_size` detection#35752

[NIXL][1/N] Refactor `kernel_block_size` detection#35752
NickLucche wants to merge 2 commits intovllm-project:mainfrom
NickLucche:minor-refactor-register-kv-cache

NickLucche commented Mar 2, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

NickLucche commented Mar 2, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

NickLucche commented Mar 2, 2026 •

edited by github-actions bot

Loading