[Bugfix] Fix FP8 online quantization premature trigger with TP sharded weights by AjAnubolu · Pull Request #36621 · vllm-project/vllm

AjAnubolu · 2026-03-10T09:17:53Z

Use >= instead of == for loaded numel check to guard against edge cases in copy_ tracking with TP > 1.

gemini-code-assist

Code Review

This pull request addresses a bug in FP8 online quantization for models with tensor parallelism. The change modifies the condition for triggering the weight processing from an equality check to a greater-than-or-equal-to check. This is intended to make the logic more robust against edge cases with sharded weights. New regression tests have been added to cover this scenario and verify the behavior of the CopyNumelCounter.

vkuzo · 2026-03-12T12:09:07Z

@AjAnubolu could you share which models hit this edge case so we are aware?

note that #33814 should also take care of this

…d weights Signed-off-by: AjAnubolu <anuboluajay@gmail.com>

AjAnubolu requested review from mgoin, pavanimajety, robertgshaw2-redhat, tlrmchlsmth and yewentao256 as code owners March 10, 2026 09:17

mergify bot added the bug Something isn't working label Mar 10, 2026

gemini-code-assist bot reviewed Mar 10, 2026

View reviewed changes

[Bugfix] Fix FP8 online quantization premature trigger with TP sharde…

f3d6e91

…d weights Signed-off-by: AjAnubolu <anuboluajay@gmail.com>

AjAnubolu force-pushed the fix/fp8-tp-empty-output-36583 branch from 4635dc9 to f3d6e91 Compare March 13, 2026 03:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Fix FP8 online quantization premature trigger with TP sharded weights#36621

[Bugfix] Fix FP8 online quantization premature trigger with TP sharded weights#36621
AjAnubolu wants to merge 1 commit intovllm-project:mainfrom
AjAnubolu:fix/fp8-tp-empty-output-36583

AjAnubolu commented Mar 10, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

vkuzo commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

AjAnubolu commented Mar 10, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

vkuzo commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants