Skip to content

Fix hpu_model_runner based on (#20291)#4

Merged
xuechendi merged 1 commit intomainfrom
fix_20291
Jul 3, 2025
Merged

Fix hpu_model_runner based on (#20291)#4
xuechendi merged 1 commit intomainfrom
fix_20291

Conversation

@xuechendi
Copy link
Copy Markdown
Collaborator

vllm-project/vllm#20291

updated scheduler_output with cache, this PR will fix the failing on hpu_model_runner

Signed-off-by: Chendi Xue <chendi.xue@intel.com>
@xuechendi xuechendi merged commit 35d46d0 into main Jul 3, 2025
1 check passed
@kzawora-intel kzawora-intel deleted the fix_20291 branch July 10, 2025 15:48
adobrzyn added a commit that referenced this pull request Sep 8, 2025
Signed-off-by: Agata Dobrzyniewicz <adobrzyniewicz@habana.ai>
skavulya pushed a commit to skavulya/vllm-gaudi that referenced this pull request Nov 11, 2025
* remove loop from get_block_descs_ids

* remove extra code

* Update hpu_nixl_connector.py to simplify reshape
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant