[BUGFIX ] fix undefined silu_and_mul_nvfp4_quant by youzhedian · Pull Request #23929 · vllm-project/vllm

youzhedian · 2025-08-29T10:36:14Z

Fix undefined silu_and_mul_nvfp4_quant on H or A cuda devices.

Signed-off-by: hongchao <hongchao@msh.team>

gemini-code-assist

Code Review

This pull request correctly identifies the need to restrict the compilation and usage of silu_and_mul_nvfp4_quant to specific CUDA architectures that support NVFP4. The changes in the C++ files appear to implement this correctly using preprocessor directives. However, there is a critical issue in the corresponding Python check within vllm/compilation/fix_functionalization.py. The device capability check uses an incorrect value, which will prevent this optimized kernel from being used on the intended hardware (e.g., Blackwell GPUs).

gemini-code-assist · 2025-08-29T10:42:31Z

vllm/compilation/fix_functionalization.py

+            elif current_platform.has_device_capability(
+                    100
+            ) and at_target == torch.ops._C.silu_and_mul_nvfp4_quant.default:


The check current_platform.has_device_capability(100) is incorrect. The has_device_capability method compares its integer argument with the major version of the CUDA compute capability. For Blackwell GPUs (SM 10.0), the major version is 10. Therefore, the check 10 >= 100 will always evaluate to false, preventing this code path from ever being taken on the intended hardware. This should be has_device_capability(10) to correctly target devices with compute capability 10.x and above.

Suggested change

elif current_platform.has_device_capability(

100

) and at_target == torch.ops._C.silu_and_mul_nvfp4_quant.default:

elif current_platform.has_device_capability(

10

) and at_target == torch.ops._C.silu_and_mul_nvfp4_quant.default:

Easier way to check here is "if hasattr(torch.ops._C, silu_and_mul_nvfp4_quant)"

i feel if hasattr(torch.ops._C, silu_and_mul_nvfp4_quant) makes more sense.

youzhedian · 2025-08-29T10:44:22Z

relative issue: #23916

youkaichao · 2025-08-29T10:46:47Z

@mgoin can you help take a look?

zou3519 · 2025-08-29T15:23:05Z

I got this as well, I bisected to #23671 as the cause. I have a revert open at #23947 but let me look at this PR

zou3519 · 2025-08-29T15:29:22Z

I confirmed that this PR fixes the issue for me

youkaichao

thanks for the fix!

Signed-off-by: Richard Zou <zou3519@gmail.com>

elvischenv · 2025-08-29T16:19:31Z

I already have a fix in #23727.

Like other nvfp4 kernels, I added the function definition in csrc/quantization/fp4/nvfp4_quant_entry.cu so the function will alway be defined in general code path. This should fix this issue in a cleaner way.

Signed-off-by: elvischenv <219235043+elvischenv@users.noreply.github.com>

Signed-off-by: hongchao <hongchao@msh.team> Signed-off-by: Richard Zou <zou3519@gmail.com> Co-authored-by: hongchao <hongchao@msh.team> Co-authored-by: Richard Zou <zou3519@gmail.com> Co-authored-by: Richard Zou <zou3519@users.noreply.github.com>

Signed-off-by: Danielle Robinson <dmmaddix@amazon.com>

fix undefined silu_and_mul_nvfp4_quant

5f206c3

Signed-off-by: hongchao <hongchao@msh.team>

youzhedian requested review from ProExpertProg, youkaichao and zou3519 as code owners August 29, 2025 10:36

gemini-code-assist bot reviewed Aug 29, 2025

View reviewed changes

youzhedian changed the title ~~fix undefined silu_and_mul_nvfp4_quant~~ [BUGFIX ] fix undefined silu_and_mul_nvfp4_quant Aug 29, 2025

zou3519 mentioned this pull request Aug 29, 2025

[Bug]: _C.abi3.so: undefined symbol: _Z24silu_and_mul_nvfp4_quantRN2at6TensorES1_S1_S1_ #23925

Closed

1 task

youkaichao approved these changes Aug 29, 2025

View reviewed changes

zou3519 and others added 2 commits August 29, 2025 09:13

update

551ea65

Signed-off-by: Richard Zou <zou3519@gmail.com>

Merge branch 'main' into hc/fix_undefined_symbol

72bbc90

zou3519 added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 29, 2025

mgoin approved these changes Aug 29, 2025

View reviewed changes

simon-mo merged commit 0dc9532 into vllm-project:main Aug 29, 2025
10 of 18 checks passed

mgoin mentioned this pull request Aug 29, 2025

[CI Failure] Skip failing nvfp4 silu test #23959

Merged

5 tasks

elvischenv added a commit to elvischenv/vllm that referenced this pull request Sep 3, 2025

revert part of vllm-project#23929

6190d91

Signed-off-by: elvischenv <219235043+elvischenv@users.noreply.github.com>

dcmaddix pushed a commit to dcmaddix/vllm that referenced this pull request Sep 19, 2025

Fix error with building vllm from source based on vllm-project#23929

dadce69

dcmaddix pushed a commit to dcmaddix/vllm that referenced this pull request Oct 1, 2025

Fix error with building vllm from source based on vllm-project#23929

ef82605

Signed-off-by: Danielle Robinson <dmmaddix@amazon.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUGFIX ] fix undefined silu_and_mul_nvfp4_quant#23929

[BUGFIX ] fix undefined silu_and_mul_nvfp4_quant#23929
simon-mo merged 3 commits intovllm-project:mainfrom
youzhedian:hc/fix_undefined_symbol

youzhedian commented Aug 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Aug 29, 2025

Uh oh!

zou3519 Aug 29, 2025 •

edited

Loading

Uh oh!

youkaichao Aug 29, 2025

Uh oh!

youzhedian commented Aug 29, 2025

Uh oh!

youkaichao commented Aug 29, 2025

Uh oh!

zou3519 commented Aug 29, 2025 •

edited

Loading

Uh oh!

zou3519 commented Aug 29, 2025

Uh oh!

youkaichao left a comment

Uh oh!

elvischenv commented Aug 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Conversation

youzhedian commented Aug 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

zou3519 Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

youkaichao Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

youzhedian commented Aug 29, 2025

Uh oh!

youkaichao commented Aug 29, 2025

Uh oh!

zou3519 commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zou3519 commented Aug 29, 2025

Uh oh!

youkaichao left a comment

Choose a reason for hiding this comment

Uh oh!

elvischenv commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

youzhedian commented Aug 29, 2025 •

edited by github-actions bot

Loading

zou3519 Aug 29, 2025 •

edited

Loading

zou3519 commented Aug 29, 2025 •

edited

Loading

elvischenv commented Aug 29, 2025 •

edited

Loading