fix(rocm): Enable non-gated MoE (is_act_and_mul=False) support on ROCm by rabi · Pull Request #32244 · vllm-project/vllm

rabi · 2026-01-13T08:28:05Z

Purpose

Models like NemotronH use non-gated MoE with activations like relu2_no_mul. Previously, this was blocked on ROCm because the platform check only allowed CUDA.

Updates platform check from is_cuda() to is_cuda_alike() to allow ROCm
Disables AITER kernel for non-gated MoE since AITER only supports gated activations (silu/gelu)
Falls back to Triton implementation which properly handles non-gated activations via apply_moe_activation()

Test Plan

Tested on AMD MI210 GPU with NemotronH model.

Test Result

Model loads and serves successfully.

gemini-code-assist

Code Review

This pull request correctly enables non-gated MoE support on ROCm. The changes are well-targeted and logical. You've correctly updated the platform check from is_cuda() to is_cuda_alike() to include ROCm. Additionally, you've properly disabled the AITER kernel for non-gated MoE, as it only supports gated activations, allowing the system to fall back to the Triton implementation which handles this case. The changes appear correct and align with the stated purpose. I have no further comments.

tjtanaa

LGTM

mergify · 2026-01-16T03:27:07Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @rabi.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

mergify · 2026-01-16T03:39:27Z

Hi @rabi, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?

mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

Models like NemotronH use non-gated MoE with activations like relu2_no_mul. Previously, this was blocked on ROCm because the platform check only allowed CUDA. - Updates platform check from is_cuda() to is_cuda_alike() to allow ROCm - Disables AITER kernel for non-gated MoE since AITER only supports gated activations (silu/gelu) - Falls back to Triton implementation which properly handles non-gated activations via apply_moe_activation() Signed-off-by: rabi <ramishra@redhat.com>

tjtanaa

Thank you for the fix.

vllm-project#32244) Signed-off-by: rabi <ramishra@redhat.com>

vllm-project#32244) Signed-off-by: rabi <ramishra@redhat.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

vllm-project#32244) Signed-off-by: rabi <ramishra@redhat.com>

rabi requested review from mgoin and pavanimajety as code owners January 13, 2026 08:28

mergify bot added the rocm Related to AMD ROCm label Jan 13, 2026

DarkLight1337 requested a review from tjtanaa January 13, 2026 08:28

gemini-code-assist bot reviewed Jan 13, 2026

View reviewed changes

rabi mentioned this pull request Jan 13, 2026

[FIX] Add NO_MUL activation support for modular kernel path #31528

Merged

tjtanaa approved these changes Jan 14, 2026

View reviewed changes

tjtanaa added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 14, 2026

mergify bot added the needs-rebase label Jan 16, 2026

rabi force-pushed the fix_rocm branch from e87a997 to d6e80e2 Compare January 16, 2026 03:34

mergify bot removed the needs-rebase label Jan 16, 2026

rabi force-pushed the fix_rocm branch from d6e80e2 to 2d6efdb Compare January 16, 2026 03:56

tjtanaa approved these changes Jan 16, 2026

View reviewed changes

tjtanaa merged commit b66b0d6 into vllm-project:main Jan 16, 2026
52 of 53 checks passed

akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026

fix(rocm): Enable non-gated MoE (is_act_and_mul=False) support on ROCm (

a779a03

vllm-project#32244) Signed-off-by: rabi <ramishra@redhat.com>

dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026

fix(rocm): Enable non-gated MoE (is_act_and_mul=False) support on ROCm (

8eda9b5

vllm-project#32244) Signed-off-by: rabi <ramishra@redhat.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

fix(rocm): Enable non-gated MoE (is_act_and_mul=False) support on ROCm (

01a1447

vllm-project#32244) Signed-off-by: rabi <ramishra@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(rocm): Enable non-gated MoE (is_act_and_mul=False) support on ROCm#32244

fix(rocm): Enable non-gated MoE (is_act_and_mul=False) support on ROCm#32244
tjtanaa merged 1 commit intovllm-project:mainfrom
rabi:fix_rocm

rabi commented Jan 13, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

tjtanaa left a comment

Uh oh!

mergify bot commented Jan 16, 2026

Uh oh!

mergify bot commented Jan 16, 2026

Uh oh!

tjtanaa left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

rabi commented Jan 13, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

tjtanaa left a comment

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Jan 16, 2026

Uh oh!

mergify bot commented Jan 16, 2026

Uh oh!

tjtanaa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rabi commented Jan 13, 2026 •

edited by github-actions bot

Loading