Skip to content

[Bugfix][ROCm][GPT-OSS] Use old triton_kernels implementation on ROCm if the new API is not available#34153

Merged
gshtras merged 3 commits intovllm-project:mainfrom
ROCm:rocm_triton_kernels_fallback
Feb 9, 2026
Merged

[Bugfix][ROCm][GPT-OSS] Use old triton_kernels implementation on ROCm if the new API is not available#34153
gshtras merged 3 commits intovllm-project:mainfrom
ROCm:rocm_triton_kernels_fallback

Conversation

@gshtras
Copy link
Copy Markdown
Collaborator

@gshtras gshtras commented Feb 9, 2026

Follow up for #30525

ROCm version of triton_kernels does not have SparseMatrix or make_ragged_tensor_metadata

Until we can move to the updated one, to preserve GPT-OSS support on ROCm, we fall back to the old implementation.

Testing plan

vllm serve openai/gpt-oss-120b should work

… support triton 3.6

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
@mergify mergify bot added gpt-oss Related to GPT-OSS models rocm Related to AMD ROCm bug Something isn't working labels Feb 9, 2026
@github-project-automation github-project-automation bot moved this to Todo in AMD Feb 9, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a fallback mechanism to support older triton_kernels APIs on ROCm platforms, which may lack SparseMatrix and make_ragged_tensor_metadata. The changes correctly detect the platform and API availability to switch between legacy and modern implementations. However, I've identified a critical bug in the import logic that will cause a NameError on non-ROCm platforms. My review includes a suggested fix for this issue.

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
@github-project-automation github-project-automation bot moved this from To Triage to Ready in gpt-oss Issues & Enhancements Feb 9, 2026
@mgoin mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 9, 2026
@gshtras gshtras merged commit c60f8e3 into vllm-project:main Feb 9, 2026
59 checks passed
@github-project-automation github-project-automation bot moved this from Todo to Done in AMD Feb 9, 2026
@gshtras gshtras deleted the rocm_triton_kernels_fallback branch February 9, 2026 23:39
gshtras added a commit to ROCm/vllm that referenced this pull request Feb 10, 2026
… if the new API is not available (vllm-project#34153)

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
@gshtras gshtras added this to the v0.16.0 cherry picks milestone Feb 10, 2026
khluu pushed a commit that referenced this pull request Feb 11, 2026
… if the new API is not available (#34153)

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
(cherry picked from commit c60f8e3)
khluu added a commit that referenced this pull request Feb 20, 2026
… on ROCm if the new API is not available (#34153)"

This reverts commit 55a1bae.
khluu added a commit that referenced this pull request Feb 25, 2026
… on ROCm if the new API is not available (#34153)"

This reverts commit 55a1bae.
llsj14 pushed a commit to llsj14/vllm that referenced this pull request Mar 1, 2026
… if the new API is not available (vllm-project#34153)

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
khluu added a commit that referenced this pull request Mar 3, 2026
… on ROCm if the new API is not available (#34153)"

This reverts commit c60f8e3.
tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Mar 4, 2026
… if the new API is not available (vllm-project#34153)

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
liuchenbing2026 pushed a commit to liuchenbing2026/vllm that referenced this pull request Apr 4, 2026
… if the new API is not available (vllm-project#34153)

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working gpt-oss Related to GPT-OSS models ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm

Projects

Status: Done
Status: Done

Development

Successfully merging this pull request may close these issues.

2 participants