Skip to content

[DOC][ROCm]: Add attention backend guide#29400

Draft
vllmellm wants to merge 7 commits intovllm-project:mainfrom
EmbeddedLLM:docs/rocm-attention-backends
Draft

[DOC][ROCm]: Add attention backend guide#29400
vllmellm wants to merge 7 commits intovllm-project:mainfrom
EmbeddedLLM:docs/rocm-attention-backends

Conversation

@vllmellm
Copy link
Copy Markdown
Contributor

@vllmellm vllmellm commented Nov 25, 2025

Purpose

This PR adds a new documentation page attention_backends.md under the "Features" section.

It provides a comprehensive guide on:

  • The differences between available ROCm attention backends (TRITON_ATTN, ROCM_ATTN, ROCM_AITER_FA, etc.).
  • How to use the VLLM_ATTENTION_BACKEND environment variable to select specific backends.
  • Backend selection priority logic.
  • Configuration for DeepSeek MLA models on ROCm.

This documentation corresponds to the changes and features solidified in PR #26980.

Test Plan

  • Built the documentation locally using mkdocs serve.
  • Verified the rendering of tables, code blocks, and admonitions.

Test Result

The documentation renders correctly.

Preview:
Screenshot 2025-11-25 at 20 26 15

Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com>
@mergify
Copy link
Copy Markdown
Contributor

mergify bot commented Nov 25, 2025

Documentation preview: https://vllm--29400.org.readthedocs.build/en/29400/

@mergify mergify bot added documentation Improvements or additions to documentation rocm Related to AMD ROCm labels Nov 25, 2025
Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com>
Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com>
Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com>
Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com>
Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com>
@vllmellm vllmellm changed the title [DOC][ROCm]: Add ROCm attention backend guide [DOC][ROCm]: Add attention backend guide Nov 25, 2025
Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation rocm Related to AMD ROCm

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant