Fix hpu_model_runner based on (#20291) by xuechendi · Pull Request #4 · vllm-project/vllm-gaudi

xuechendi · 2025-07-03T16:59:38Z

vllm-project/vllm#20291

updated scheduler_output with cache, this PR will fix the failing on hpu_model_runner

Signed-off-by: Chendi Xue <chendi.xue@intel.com>

Signed-off-by: Agata Dobrzyniewicz <adobrzyniewicz@habana.ai>

* remove loop from get_block_descs_ids * remove extra code * Update hpu_nixl_connector.py to simplify reshape

Fix hpu_model_runner based on (#20291)

abf07c7

Signed-off-by: Chendi Xue <chendi.xue@intel.com>

xuechendi merged commit 35d46d0 into main Jul 3, 2025
1 check passed

kzawora-intel deleted the fix_20291 branch July 10, 2025 15:48

adobrzyn added a commit that referenced this pull request Sep 8, 2025

After review #4

4836ec3

Signed-off-by: Agata Dobrzyniewicz <adobrzyniewicz@habana.ai>

skavulya pushed a commit to skavulya/vllm-gaudi that referenced this pull request Nov 11, 2025

remove loop from get_block_descs_ids (vllm-project#4)

05a2e6b

* remove loop from get_block_descs_ids * remove extra code * Update hpu_nixl_connector.py to simplify reshape

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix hpu_model_runner based on (#20291)#4

Fix hpu_model_runner based on (#20291)#4
xuechendi merged 1 commit intomainfrom
fix_20291

xuechendi commented Jul 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

xuechendi commented Jul 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant