[Qwen][Bugfix] Fixes sigmoid activation in torch impl of RMSNormGated. by sighingnow · Pull Request #40245 · vllm-project/vllm

sighingnow · 2026-04-18T12:15:43Z

The sigmoid activation in RMSNormGated was added to the forward_cuda, but not forward_native.

Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

gemini-code-assist

Code Review

This pull request introduces support for configurable activation functions in the RMSNormGated layer, specifically adding 'sigmoid' alongside the existing 'silu'/'swish' options. It also updates the GDNLinearAttention module to handle these gate types from the model configuration. Feedback points out that the assertion in layernorm.py is too restrictive as it excludes 'swish' and suggests using torch.sigmoid instead of the deprecated F.sigmoid.

gemini-code-assist · 2026-04-18T12:16:40Z

+        assert self.activation in ["silu", "sigmoid"]
+        act_fn = F.sigmoid if self.activation == "sigmoid" else F.silu


The assertion is too restrictive as it excludes "swish", which is the default activation for this class (defined at line 429). This will cause a runtime error for any model using the default configuration when running in forward_native. Additionally, torch.sigmoid is preferred over F.sigmoid as the latter is deprecated in modern PyTorch versions.

Suggested change

assert self.activation in ["silu", "sigmoid"]

act_fn = F.sigmoid if self.activation == "sigmoid" else F.silu

assert self.activation in ["silu", "sigmoid", "swish"]

act_fn = torch.sigmoid if self.activation == "sigmoid" else F.silu

Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>

sighingnow · 2026-04-20T01:48:35Z

@youkaichao @Tib-Gridello @ZJY0516 can you folks take a look at this PR? The test failure seems not related to this PR.

ZJY0516 · 2026-04-20T03:15:35Z

        weight = self.weight.float()
        z = z.float() if z is not None else None

+        assert self.activation in ["silu", "sigmoid", "swish"]


I suggest doing this in __init__ to avoid overhead during forward

vllm-project#40245) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>

vllm-project#40245) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com> Signed-off-by: Avinash Singh <avinashsingh.rcoem@gmail.com>

vllm-project#40245) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com> Signed-off-by: Adrian <info@zzit.ch>

Fixes sigmoid activation in torch impl of RMSNormGated.

766d93d

Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>

sighingnow requested review from ZJY0516, tdoublep and vadiklyutiy as code owners April 18, 2026 12:15

claude Bot reviewed Apr 18, 2026

View reviewed changes

mergify Bot added qwen Related to Qwen models bug Something isn't working labels Apr 18, 2026

gemini-code-assist Bot reviewed Apr 18, 2026

View reviewed changes

sighingnow enabled auto-merge (squash) April 18, 2026 14:48

github-actions Bot added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 18, 2026

sighingnow added 2 commits April 18, 2026 14:49

Fixes.

7f99370

Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>

Merge branch 'main' into fixes-sigmoid-gated-rmsnorm

72a5fc0

ZJY0516 approved these changes Apr 20, 2026

View reviewed changes

Merge branch 'main' into fixes-sigmoid-gated-rmsnorm

2482561

ZJY0516 reviewed Apr 20, 2026

View reviewed changes

sighingnow merged commit 8936118 into vllm-project:main Apr 20, 2026
60 checks passed

bnellnm pushed a commit to neuralmagic/vllm that referenced this pull request Apr 20, 2026

[Qwen][Bugfix] Fixes sigmoid activation in torch impl of RMSNormGated. (

2f0dc11

vllm-project#40245) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>

baonudesifeizhai pushed a commit to baonudesifeizhai/vllm that referenced this pull request Apr 23, 2026

[Qwen][Bugfix] Fixes sigmoid activation in torch impl of RMSNormGated. (

568fd70

vllm-project#40245) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>

Lafunamor pushed a commit to Lafunamor/vllm that referenced this pull request May 1, 2026

[Qwen][Bugfix] Fixes sigmoid activation in torch impl of RMSNormGated. (

cbb56a2

vllm-project#40245) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com> Signed-off-by: Adrian <info@zzit.ch>

gcanlin mentioned this pull request May 2, 2026

[Misc][Main2Main] Upgrade vLLM to 0429(DSV4/v0.20.0) vllm-project/vllm-ascend#8856

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Qwen][Bugfix] Fixes sigmoid activation in torch impl of RMSNormGated.#40245

[Qwen][Bugfix] Fixes sigmoid activation in torch impl of RMSNormGated.#40245
sighingnow merged 4 commits intovllm-project:mainfrom
sighingnow:fixes-sigmoid-gated-rmsnorm

sighingnow commented Apr 18, 2026

Uh oh!

claude Bot left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 18, 2026

Uh oh!

sighingnow commented Apr 20, 2026 •

edited

Loading

Uh oh!

ZJY0516 Apr 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		assert self.activation in ["silu", "sigmoid"]
		act_fn = F.sigmoid if self.activation == "sigmoid" else F.silu

Uh oh!

Conversation

sighingnow commented Apr 18, 2026

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

sighingnow commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ZJY0516 Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sighingnow commented Apr 20, 2026 •

edited

Loading