[ROCm] Fix KV copy methods and auto-select attention backend for ROCm by AndreasKaratzas · Pull Request #36845 · vllm-project/vllm

AndreasKaratzas · 2026-03-12T04:29:58Z

Added insert_blocks_to_device and swap_out_blocks_to_host to RocmPlatform. These were only defined on CudaPlatform, causing a TypeError: 'NoneType' object is not callable crash when NixlConnector tried to copy KV blocks between GPU and CPU buffers during prefill/decode disaggregation on ROCm.
Updated spec_decode_acceptance_test.sh to auto-select the attention backend based on the detected GPU platform: TRITON_ATTN on ROCm, FLASH_ATTN on NVIDIA. Previously the script hardcoded FLASH_ATTN regardless of platform. The backend can still be overridden via ATTENTION_BACKEND=<value>.

cc @kenroche

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

DarkLight1337 · 2026-03-13T07:53:58Z

cc @tjtanaa

tjtanaa

LGTM

…vllm-project#36845) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

[ROCm] Fix KV copy methods and auto-select attention backend for ROCm

b163004

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas requested review from ApostaC, orozery and tjtanaa as code owners March 12, 2026 04:29

mergify bot added the rocm Related to AMD ROCm label Mar 12, 2026

github-project-automation bot moved this to Todo in AMD Mar 12, 2026

github-project-automation bot added this to AMD Mar 12, 2026

mergify bot added v1 kv-connector labels Mar 12, 2026

AndreasKaratzas added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 12, 2026

Merge remote-tracking branch 'origin/main' into akaratza_fix_spec_dec

f5752d2

tjtanaa approved these changes Mar 16, 2026

View reviewed changes

tjtanaa merged commit 911355e into vllm-project:main Mar 16, 2026
49 checks passed

github-project-automation bot moved this from Todo to Done in AMD Mar 16, 2026

AndreasKaratzas deleted the akaratza_fix_spec_dec branch March 16, 2026 16:14

Lucaskabela pushed a commit to Lucaskabela/vllm that referenced this pull request Mar 17, 2026

[ROCm] Fix KV copy methods and auto-select attention backend for ROCm (…

a9be222

…vllm-project#36845) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

wendyliu235 pushed a commit to wendyliu235/vllm-public that referenced this pull request Mar 18, 2026

[ROCm] Fix KV copy methods and auto-select attention backend for ROCm (…

fef831a

…vllm-project#36845) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

fxdawnn pushed a commit to fxdawnn/vllm that referenced this pull request Mar 19, 2026

[ROCm] Fix KV copy methods and auto-select attention backend for ROCm (…

e050452

…vllm-project#36845) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] Fix KV copy methods and auto-select attention backend for ROCm#36845

[ROCm] Fix KV copy methods and auto-select attention backend for ROCm#36845
tjtanaa merged 2 commits intovllm-project:mainfrom
ROCm:akaratza_fix_spec_dec

AndreasKaratzas commented Mar 12, 2026 •

edited by github-actions bot

Loading

Uh oh!

DarkLight1337 commented Mar 13, 2026

Uh oh!

tjtanaa left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

AndreasKaratzas commented Mar 12, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DarkLight1337 commented Mar 13, 2026

Uh oh!

tjtanaa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AndreasKaratzas commented Mar 12, 2026 •

edited by github-actions bot

Loading