[ROCm] Fix broken import in platform attention backend dispatching by AndreasKaratzas · Pull Request #30432 · vllm-project/vllm

AndreasKaratzas · 2025-12-10T22:40:04Z

Summary

Removes broken dependency on get_env_variable_attn_backend from vllm.attention.selector in ROCm platform configuration.

Problem

The import from vllm.attention.selector import get_env_variable_attn_backend was causing failures on ROCm. This was used to check for ROCM_AITER_UNIFIED_ATTN backend selection when setting KV cache block size.

Fix

Deprecate the get_env_variable_attn_backend check and rely on environment variables (VLLM_ROCM_USE_AITER_UNIFIED_ATTENTION, VLLM_ROCM_USE_AITER) directly for block size configuration.

Tracking upstream PR #30396 for the proper way to handle attention backend selection going forward.

Testing

Verified ROCm platform initializes correctly without import errors.

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

gemini-code-assist

Code Review

This pull request fixes a broken import in the ROCm platform configuration by replacing a call to get_env_variable_attn_backend with direct environment variable checks. While this resolves the import issue, the new implementation for checking environment variables is inconsistent with how boolean flags are handled elsewhere in the codebase, which could lead to incorrect behavior. I've provided a suggestion to align the implementation with the project's standards.

gemini-code-assist · 2025-12-10T22:40:58Z

vllm/platforms/rocm.py

+                os.environ.get("VLLM_ROCM_USE_AITER_UNIFIED_ATTENTION")
+                and os.environ.get("VLLM_ROCM_USE_AITER")


The direct use of os.environ.get() to check for these boolean environment variables is inconsistent with the established pattern in vllm.envs. This can lead to incorrect behavior. For instance, if a user sets VLLM_ROCM_USE_AITER=0 to disable it, os.environ.get() will return the string "0", which is truthy, causing the block size to be incorrectly set to 64.

To ensure consistent and correct behavior, you should use the vllm.envs module, which is already imported in this file. This module correctly parses these environment variables as booleans.

Suggested change

os.environ.get("VLLM_ROCM_USE_AITER_UNIFIED_ATTENTION")

and os.environ.get("VLLM_ROCM_USE_AITER")

envs.VLLM_ROCM_USE_AITER_UNIFIED_ATTENTION

and envs.VLLM_ROCM_USE_AITER

This is out of the scope of this PR. We can address it in a future PR. For now the purpose of this PR is to resolve an import error and deprecate get_env_variable_attn_backend

No, the bot is right, don't use os.environ

gshtras · 2025-12-10T22:45:03Z

vllm/platforms/rocm.py

+                os.environ.get("VLLM_ROCM_USE_AITER_UNIFIED_ATTENTION")
+                and os.environ.get("VLLM_ROCM_USE_AITER")


No, the bot is right, don't use os.environ

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

…llm-project#30432) Signed-off-by: Andreas Karatzas <akaratza@amd.com> Signed-off-by: Ubuntu <mjtaheri68@gmail.com>

…llm-project#30432) Signed-off-by: Andreas Karatzas <akaratza@amd.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

Fixes ROCm platform - deprecates

f260fb7

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas requested a review from tjtanaa as a code owner December 10, 2025 22:40

mergify bot added the rocm Related to AMD ROCm label Dec 10, 2025

gemini-code-assist bot reviewed Dec 10, 2025

View reviewed changes

gshtras requested changes Dec 10, 2025

View reviewed changes

Addressing bots comments.

0c44733

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas requested a review from gshtras December 10, 2025 22:47

gshtras approved these changes Dec 10, 2025

View reviewed changes

gshtras enabled auto-merge (squash) December 10, 2025 22:53

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 10, 2025

gshtras merged commit b51255f into vllm-project:main Dec 11, 2025
46 of 47 checks passed

AndreasKaratzas deleted the fix/rocm-attn-dispatching branch December 11, 2025 01:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] Fix broken import in platform attention backend dispatching#30432

[ROCm] Fix broken import in platform attention backend dispatching#30432
gshtras merged 2 commits intovllm-project:mainfrom
ROCm:fix/rocm-attn-dispatching

AndreasKaratzas commented Dec 10, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Uh oh!

AndreasKaratzas Dec 10, 2025

Uh oh!

gshtras Dec 10, 2025

Uh oh!

AndreasKaratzas Dec 10, 2025

Uh oh!

gshtras Dec 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		os.environ.get("VLLM_ROCM_USE_AITER_UNIFIED_ATTENTION")
		and os.environ.get("VLLM_ROCM_USE_AITER")

Uh oh!

Conversation

AndreasKaratzas commented Dec 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Fix

Testing

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

AndreasKaratzas Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gshtras Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

AndreasKaratzas Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

gshtras Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AndreasKaratzas commented Dec 10, 2025 •

edited by github-actions bot

Loading