[Bugfix] Fix Dtypes for Pynccl Wrapper by robertgshaw2-redhat · Pull Request #33030 · vllm-project/vllm

robertgshaw2-redhat · 2026-01-25T14:52:19Z

Purpose

fixed [CI Failure]: MoE Integration Tests #33029
recent PR [Perf][Kernel] Optimize FP4 quantization kernels (SM100F) #32520 swtiched scale type from uint8 to fp8
this breaks FI AG/RS, because scales are now in a dtype that was not supported by the collectives
also

Test Plan

ci

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Robert Shaw <robshaw@redhat.com>

gemini-code-assist

Code Review

This pull request adds support for fp8 data types in the pynccl wrapper, which is necessary for distributed communication with fp8 tensors. The changes correctly add ncclFloat8e4m3 to the data type enum and handle torch.float8_e4m3fn in the type conversion logic.

My review includes a suggestion to also handle the torch.float8_e4m3fnuz variant, as it appears to be used in other parts of the codebase, to prevent potential runtime errors. This will make the fp8 support more robust.

gemini-code-assist · 2026-01-25T14:53:46Z

vllm/distributed/device_communicators/pynccl_wrapper.py

+        if dtype == torch.float8_e4m3fn:
+            return cls.ncclFloat8e4m3
        raise ValueError(
            f"Unsupported dtype {dtype}: should be one of "
-            f"int8, uint8, int32, int64, float16, float32, float64, bfloat16."
+            f"int8, uint8, int32, int64, float16, float32, float64, bfloat16,"
+            " float8e4m3."
        )


The codebase, for example in vllm/model_executor/layers/quantization/utils/fp8_utils.py, seems to use both torch.float8_e4m3fn and torch.float8_e4m3fnuz. This function should handle both types to avoid ValueError during collective communication operations with torch.float8_e4m3fnuz tensors. The error message is also updated for clarity.

For more complete FP8 support, you might also consider adding torch.float8_e5m2. This would involve adding ncclFloat8e5m2 to ncclDataTypeEnum and handling torch.float8_e5m2 in this method.

Suggested change

if dtype == torch.float8_e4m3fn:

return cls.ncclFloat8e4m3

raise ValueError(

f"Unsupported dtype {dtype}: should be one of "

f"int8, uint8, int32, int64, float16, float32, float64, bfloat16."

f"int8, uint8, int32, int64, float16, float32, float64, bfloat16,"

" float8e4m3."

)

if dtype in (torch.float8_e4m3fn, torch.float8_e4m3fnuz):

return cls.ncclFloat8e4m3

raise ValueError(

f"Unsupported dtype {dtype}: should be one of "

"int8, uint8, int32, int64, float16, float32, float64, bfloat16, "

"float8_e4m3fn, "

"float8_e4m3fnuz."

)

LucasWilkinson

LGTM thanks for the quick fix!

Signed-off-by: Robert Shaw <robshaw@redhat.com>

LopezCastroRoberto · 2026-01-25T15:13:18Z

LGTM too. Thanks for the fix, Rob

robertgshaw2-redhat · 2026-01-26T17:06:56Z

updating main for CI

Signed-off-by: Robert Shaw <robshaw@redhat.com> Co-authored-by: Robert Shaw <robshaw@redhat.com> (cherry picked from commit 43a013c)

Signed-off-by: Robert Shaw <robshaw@redhat.com> Co-authored-by: Robert Shaw <robshaw@redhat.com>

updated

f0b38ee

Signed-off-by: Robert Shaw <robshaw@redhat.com>

mergify bot added the bug Something isn't working label Jan 25, 2026

gemini-code-assist bot reviewed Jan 25, 2026

View reviewed changes

robertgshaw2-redhat changed the title ~~[Bugfix] Add Support for Fp8 In Pynccl Wrapper~~ [Bugfix] Fix Dtypes for Pynccl Wrapper Jan 25, 2026

robertgshaw2-redhat added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 25, 2026

LucasWilkinson approved these changes Jan 25, 2026

View reviewed changes

updated

f2afc0f

Signed-off-by: Robert Shaw <robshaw@redhat.com>

robertgshaw2-redhat requested review from mgoin and pavanimajety as code owners January 25, 2026 15:08

int8 -> uint8

5ce9572

Signed-off-by: Robert Shaw <robshaw@redhat.com>

mergify bot added the nvidia label Jan 25, 2026

github-project-automation bot added this to NVIDIA Jan 25, 2026

github-project-automation bot moved this to Ready in NVIDIA Jan 25, 2026

robertgshaw2-redhat enabled auto-merge (squash) January 25, 2026 15:12

robertgshaw2-redhat mentioned this pull request Jan 25, 2026

[CI Failure]: MoE Integration Tests #33029

Closed

3 tasks

Merge branch 'main' into fix-fp8-dtype-sending

56d0322

mgoin approved these changes Jan 26, 2026

View reviewed changes

Merge branch 'main' into fix-fp8-dtype-sending

cf5aacb

robertgshaw2-redhat merged commit 43a013c into main Jan 26, 2026
62 checks passed

robertgshaw2-redhat deleted the fix-fp8-dtype-sending branch January 26, 2026 20:09

github-project-automation bot moved this from Ready to Done in NVIDIA Jan 26, 2026

khluu pushed a commit that referenced this pull request Jan 26, 2026

[Bugfix] Fix Dtypes for Pynccl Wrapper (#33030)

cf1167e

Signed-off-by: Robert Shaw <robshaw@redhat.com> Co-authored-by: Robert Shaw <robshaw@redhat.com> (cherry picked from commit 43a013c)

apd10 pushed a commit to apd10/vllm that referenced this pull request Jan 31, 2026

[Bugfix] Fix Dtypes for Pynccl Wrapper (vllm-project#33030)

95d2751

Signed-off-by: Robert Shaw <robshaw@redhat.com> Co-authored-by: Robert Shaw <robshaw@redhat.com>

micah-wil mentioned this pull request Feb 3, 2026

[Bugfix][ROCm] Include float8_e4m3fnuz in NCCL Dtype Dispatching #33713

Merged

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[Bugfix] Fix Dtypes for Pynccl Wrapper (vllm-project#33030)

798e2a9

Signed-off-by: Robert Shaw <robshaw@redhat.com> Co-authored-by: Robert Shaw <robshaw@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Fix Dtypes for Pynccl Wrapper#33030

[Bugfix] Fix Dtypes for Pynccl Wrapper#33030
robertgshaw2-redhat merged 5 commits intomainfrom
fix-fp8-dtype-sending

robertgshaw2-redhat commented Jan 25, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jan 25, 2026

Uh oh!

LucasWilkinson left a comment

Uh oh!

LopezCastroRoberto commented Jan 25, 2026

Uh oh!

robertgshaw2-redhat commented Jan 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

robertgshaw2-redhat commented Jan 25, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

LucasWilkinson left a comment

Choose a reason for hiding this comment

Uh oh!

LopezCastroRoberto commented Jan 25, 2026

Uh oh!

robertgshaw2-redhat commented Jan 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

robertgshaw2-redhat commented Jan 25, 2026 •

edited by github-actions bot

Loading