Skip to content

[Bugfix] Fix Typo from NVFP4 Refactor#31977

Merged
simon-mo merged 5 commits intomainfrom
fix-cutedsl
Jan 9, 2026
Merged

[Bugfix] Fix Typo from NVFP4 Refactor#31977
simon-mo merged 5 commits intomainfrom
fix-cutedsl

Conversation

@robertgshaw2-redhat
Copy link
Copy Markdown
Collaborator

@robertgshaw2-redhat robertgshaw2-redhat commented Jan 8, 2026

Purpose

  • broke cutedsl with typo
  • this fixes it and adds testing in the ci/cd

Test Plan

  • ci, running cutedsl

Test Result


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Robert Shaw <robshaw@redhat.com>
@robertgshaw2-redhat robertgshaw2-redhat changed the title updated [Bugfix] Fix Typo from NVFP4 Refactor Jan 8, 2026
@mergify mergify bot added the nvidia label Jan 8, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request correctly fixes a bug where NvFp4MoeBackend.FLASHINFER_CUTEDSL was missing from the list of supported backends, which would have caused an assertion failure. However, I've identified a related critical omission: the new backend also needs to be added to the weight reordering logic for gated activations to prevent incorrect computations. I've left a specific comment detailing the required change.

Robert Shaw added 2 commits January 8, 2026 11:22
Signed-off-by: Robert Shaw <robshaw@redhat.com>
Signed-off-by: Robert Shaw <robshaw@redhat.com>
Robert Shaw added 2 commits January 8, 2026 11:43
Signed-off-by: Robert Shaw <robshaw@redhat.com>
Signed-off-by: Robert Shaw <robshaw@redhat.com>
Copy link
Copy Markdown
Member

@mgoin mgoin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-project-automation github-project-automation bot moved this to Ready in NVIDIA Jan 8, 2026
@mgoin mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 8, 2026
Copy link
Copy Markdown
Member

@yewentao256 yewentao256 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, Thanks for the work!

@robertgshaw2-redhat robertgshaw2-redhat enabled auto-merge (squash) January 8, 2026 23:54
@github-project-automation github-project-automation bot moved this to Backlog in MoE Refactor Jan 9, 2026
@robertgshaw2-redhat robertgshaw2-redhat moved this from Backlog to In review in MoE Refactor Jan 9, 2026
@simon-mo simon-mo disabled auto-merge January 9, 2026 00:18
@simon-mo simon-mo merged commit 0fa8dd2 into main Jan 9, 2026
56 of 58 checks passed
@simon-mo simon-mo deleted the fix-cutedsl branch January 9, 2026 00:18
@github-project-automation github-project-automation bot moved this from Ready to Done in NVIDIA Jan 9, 2026
@github-project-automation github-project-automation bot moved this from In review to Done in MoE Refactor Jan 9, 2026
yugong333 pushed a commit to yugong333/vllm that referenced this pull request Jan 9, 2026
Signed-off-by: Robert Shaw <robshaw@redhat.com>
Co-authored-by: Robert Shaw <robshaw@redhat.com>
akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026
Signed-off-by: Robert Shaw <robshaw@redhat.com>
Co-authored-by: Robert Shaw <robshaw@redhat.com>
dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026
Signed-off-by: Robert Shaw <robshaw@redhat.com>
Co-authored-by: Robert Shaw <robshaw@redhat.com>
Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>
ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026
Signed-off-by: Robert Shaw <robshaw@redhat.com>
Co-authored-by: Robert Shaw <robshaw@redhat.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

nvidia ready ONLY add when PR is ready to merge/full CI is needed

Projects

Status: Done
Status: Done

Development

Successfully merging this pull request may close these issues.

4 participants