Enable support for prefill side kv_layout and block_size update by yeonsily · Pull Request #867 · vllm-project/vllm-gaudi

yeonsily · 2026-01-22T21:30:57Z

update example to support prefill HND and agreed_block_size
enable prefill side kv_layout and block_size update

Port vllm-project/vllm#30448 to vllm-gaudi

enable support for prefill side kv_layout and block_size update 1. update example to support prefill HND and agreed_block_size 2. enable prefill side kv_layout and block_size update Signed-off-by: Chendi Xue <chendi.xue@intel.com> Signed-off-by: Yeonsil Yoon <yeon.sil.yoon@intel.com>

xuechendi · 2026-01-22T22:07:17Z

@yeonsily , I would suggest create a new nixl_connector file only for Gaudi to CUDA scenario instead of override current hpu_nixl_connector.py one.
The reason is upstream change is too fast, override so many function will fail in no time

Signed-off-by: Yeonsil Yoon <yeon.sil.yoon@intel.com>

github-actions · 2026-01-24T01:14:49Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

xuechendi · 2026-01-24T01:23:39Z

Did you missed the switch for flag check and which nixl_connector.py to import

yeonsily · 2026-01-26T17:05:29Z

@xuechendi The change in hpu_nixl_connector is common regardless of heterogeneous run. And the ones in hetero_hpu_nixl_connector is extra for hetero.

xuechendi · 2026-01-26T23:34:32Z

@xuechendi The change in hpu_nixl_connector is common regardless of heterogeneous run. And the ones in hetero_hpu_nixl_connector is extra for hetero.

I see how to use VLLM_HPU_HETERO_KV_LAYOUT, was thinking to simply use VLLM_HPU_HETERO_KV_LAYOUT which file to import. But your approach also works

yeonsily · 2026-01-26T23:54:23Z

@xuechendi @michalkuligowski Thank you for your review! I see CI is failed but don't think it's from my side as my change won't trigger without the flag. It seems CI is broken now. Is there any way to re-trigger CI without commit any change?

Signed-off-by: Yeonsil Yoon <yeon.sil.yoon@intel.com>

github-actions · 2026-01-27T22:48:09Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
6218034dd7f9a56596e4fd8c8c8fc1d8011ed9c2

…-project#867) 1. update example to support prefill HND and agreed_block_size 2. enable prefill side kv_layout and block_size update Port vllm-project/vllm#30448 to vllm-gaudi --------- Signed-off-by: Chendi Xue <chendi.xue@intel.com> Signed-off-by: Yeonsil Yoon <yeon.sil.yoon@intel.com> Signed-off-by: Wang, Zheng W <zheng.w.wang@intel.com>

…-project#867) 1. update example to support prefill HND and agreed_block_size 2. enable prefill side kv_layout and block_size update Port vllm-project/vllm#30448 to vllm-gaudi --------- Signed-off-by: Chendi Xue <chendi.xue@intel.com> Signed-off-by: Yeonsil Yoon <yeon.sil.yoon@intel.com>

…-project#867) 1. update example to support prefill HND and agreed_block_size 2. enable prefill side kv_layout and block_size update Port vllm-project/vllm#30448 to vllm-gaudi --------- Signed-off-by: Chendi Xue <chendi.xue@intel.com> Signed-off-by: Yeonsil Yoon <yeon.sil.yoon@intel.com> Signed-off-by: slokesha <slokeshappa@habana.ai>

1. update example to support prefill HND and agreed_block_size 2. enable prefill side kv_layout and block_size update Port vllm-project/vllm#30448 to vllm-gaudi --------- Signed-off-by: Chendi Xue <chendi.xue@intel.com> Signed-off-by: Yeonsil Yoon <yeon.sil.yoon@intel.com>

yeonsily requested review from adobrzyn, afierka-intel, iboiko-habana, kamil-kaczor, ksmusz, kzawora-intel, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners January 22, 2026 21:30

yeonsily mentioned this pull request Jan 22, 2026

enable support for prefill side kv_layout and block_size update #859

Closed

github-actions Bot mentioned this pull request Jan 22, 2026

🚦 Team Review Dashboard #701

Open

Move hetero specific nixl changes to a new file

e30b2db

Signed-off-by: Yeonsil Yoon <yeon.sil.yoon@intel.com>

michalkuligowski approved these changes Jan 26, 2026

View reviewed changes

Merge branch 'main' into dev/prefill_kv_layout

ee0a709

xuechendi approved these changes Jan 26, 2026

View reviewed changes

yeonsily closed this Jan 27, 2026

yeonsily reopened this Jan 27, 2026

Move condition check to __init__ to fix CI failures.

6c9b769

Signed-off-by: Yeonsil Yoon <yeon.sil.yoon@intel.com>

xuechendi merged commit d54f4c2 into vllm-project:main Jan 28, 2026
53 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable support for prefill side kv_layout and block_size update#867

Enable support for prefill side kv_layout and block_size update#867
xuechendi merged 4 commits intovllm-project:mainfrom
yeonsily:dev/prefill_kv_layout

yeonsily commented Jan 22, 2026

Uh oh!

xuechendi commented Jan 22, 2026

Uh oh!

github-actions Bot commented Jan 24, 2026

Uh oh!

xuechendi commented Jan 24, 2026

Uh oh!

yeonsily commented Jan 26, 2026

Uh oh!

xuechendi commented Jan 26, 2026

Uh oh!

yeonsily commented Jan 26, 2026

Uh oh!

github-actions Bot commented Jan 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yeonsily commented Jan 22, 2026

Uh oh!

xuechendi commented Jan 22, 2026

Uh oh!

github-actions Bot commented Jan 24, 2026

🚧 CI Blocked

Uh oh!

xuechendi commented Jan 24, 2026

Uh oh!

yeonsily commented Jan 26, 2026

Uh oh!

xuechendi commented Jan 26, 2026

Uh oh!

yeonsily commented Jan 26, 2026

Uh oh!

github-actions Bot commented Jan 27, 2026

✅ CI Passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants