[Bugfix][ROCm][GPT-OSS] Use old triton_kernels implementation on ROCm if the new API is not available by gshtras · Pull Request #34153 · vllm-project/vllm

gshtras · 2026-02-09T17:39:11Z

Follow up for #30525

ROCm version of triton_kernels does not have SparseMatrix or make_ragged_tensor_metadata

Until we can move to the updated one, to preserve GPT-OSS support on ROCm, we fall back to the old implementation.

Testing plan

vllm serve openai/gpt-oss-120b should work

… support triton 3.6 Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

gemini-code-assist

Code Review

This pull request introduces a fallback mechanism to support older triton_kernels APIs on ROCm platforms, which may lack SparseMatrix and make_ragged_tensor_metadata. The changes correctly detect the platform and API availability to switch between legacy and modern implementations. However, I've identified a critical bug in the import logic that will cause a NameError on non-ROCm platforms. My review includes a suggested fix for this issue.

vllm/model_executor/layers/fused_moe/gpt_oss_triton_kernels_moe.py

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

… if the new API is not available (vllm-project#34153) Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

… if the new API is not available (#34153) Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> (cherry picked from commit c60f8e3)

… on ROCm if the new API is not available (#34153)" This reverts commit 55a1bae.

… if the new API is not available (vllm-project#34153) Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

… on ROCm if the new API is not available (#34153)" This reverts commit c60f8e3.

… if the new API is not available (vllm-project#34153) Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

gshtras added 2 commits February 9, 2026 17:25

Fall back to the old triton_kernels API in case of ROCm before we can…

ecfad70

… support triton 3.6 Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

Allow use of new API on ROCm when available

9b243f9

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

gshtras requested review from mgoin and pavanimajety as code owners February 9, 2026 17:39

mergify bot added gpt-oss Related to GPT-OSS models rocm Related to AMD ROCm bug Something isn't working labels Feb 9, 2026

github-project-automation bot added this to AMD and gpt-oss Issues & Enhancements Feb 9, 2026

github-project-automation bot moved this to Todo in AMD Feb 9, 2026

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Feb 9, 2026

gemini-code-assist bot reviewed Feb 9, 2026

View reviewed changes

vllm/model_executor/layers/fused_moe/gpt_oss_triton_kernels_moe.py Outdated Show resolved Hide resolved

Fix import to happen on all platforms

db43087

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

gshtras requested a review from tjtanaa February 9, 2026 17:48

sunway513 mentioned this pull request Feb 9, 2026

[ROCm] Use upstream Triton instead of custom ROCm/triton build sunway513/vllm#1

Draft

4 tasks

mgoin approved these changes Feb 9, 2026

View reviewed changes

github-project-automation bot moved this from To Triage to Ready in gpt-oss Issues & Enhancements Feb 9, 2026

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 9, 2026

gshtras merged commit c60f8e3 into vllm-project:main Feb 9, 2026
59 checks passed

github-project-automation bot moved this from Todo to Done in AMD Feb 9, 2026

github-project-automation bot moved this from Ready to Done in gpt-oss Issues & Enhancements Feb 9, 2026

gshtras deleted the rocm_triton_kernels_fallback branch February 9, 2026 23:39

gshtras added a commit to ROCm/vllm that referenced this pull request Feb 10, 2026

[Bugfix][ROCm][GPT-OSS] Use old triton_kernels implementation on ROCm…

32f5c68

… if the new API is not available (vllm-project#34153) Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

gshtras added this to the v0.16.0 cherry picks milestone Feb 10, 2026

khluu added a commit that referenced this pull request Feb 20, 2026

Revert "[Bugfix][ROCm][GPT-OSS] Use old triton_kernels implementation…

ad2a6bb

… on ROCm if the new API is not available (#34153)" This reverts commit 55a1bae.

khluu added a commit that referenced this pull request Feb 25, 2026

Revert "[Bugfix][ROCm][GPT-OSS] Use old triton_kernels implementation…

3c9496f

… on ROCm if the new API is not available (#34153)" This reverts commit 55a1bae.

llsj14 pushed a commit to llsj14/vllm that referenced this pull request Mar 1, 2026

[Bugfix][ROCm][GPT-OSS] Use old triton_kernels implementation on ROCm…

2ca67d2

… if the new API is not available (vllm-project#34153) Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>

khluu added a commit that referenced this pull request Mar 3, 2026

Revert "[Bugfix][ROCm][GPT-OSS] Use old triton_kernels implementation…

775d1de

… on ROCm if the new API is not available (#34153)" This reverts commit c60f8e3.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix][ROCm][GPT-OSS] Use old triton_kernels implementation on ROCm if the new API is not available#34153

[Bugfix][ROCm][GPT-OSS] Use old triton_kernels implementation on ROCm if the new API is not available#34153
gshtras merged 3 commits intovllm-project:mainfrom
ROCm:rocm_triton_kernels_fallback

gshtras commented Feb 9, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

gshtras commented Feb 9, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gshtras commented Feb 9, 2026 •

edited by github-actions bot

Loading