[Hardware][AMD][Bugfix] Fix PTPC FP8 quantization by mawong-amd · Pull Request #32813 · vllm-project/vllm

mawong-amd · 2026-01-21T22:10:57Z

Purpose

Fixes PTPC FP8 quantization and thus AMD Quantization Tests after the refactoring done in #32189. PTPCFP8LinearMethod should now inherit from FP8OnlineLinearMethod rather than FP8LinearMethod.

Test Plan

pytest -sv quantization/test_ptpc_fp8.py
The above is implicitly run as part of AMD CI's Quantization Tests group.

Test Result

The test and test group both pass.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

This pull request correctly fixes a bug in the PTPC FP8 quantization implementation. By changing the base class of PTPCFp8LinearMethod from Fp8LinearMethod to Fp8OnlineLinearMethod, the method now correctly inherits the behavior for online quantization, which is its intended purpose. This aligns with the fact that PTPC performs dynamic, on-the-fly quantization of weights rather than loading pre-quantized checkpoints. The change is logical, well-contained, and directly addresses the issue described. I find no issues with this correction.

mawong-amd · 2026-01-22T03:39:22Z

Closing since PTPC FP8 is being deprecated soon: #32700

Signed-off-by: Matthew Wong <Matthew.Wong2@amd.com>

mawong-amd requested review from mgoin, pavanimajety, robertgshaw2-redhat, tlrmchlsmth and yewentao256 as code owners January 21, 2026 22:10

mawong-amd changed the title ~~Fix PTPC quantization~~ [Hardware][AMD][Bugfix] Fix PTPC quantization Jan 21, 2026

mawong-amd changed the title ~~[Hardware][AMD][Bugfix] Fix PTPC quantization~~ [Hardware][AMD][Bugfix] Fix PTPC FP8 quantization Jan 21, 2026

mergify bot added rocm Related to AMD ROCm bug Something isn't working labels Jan 21, 2026

gemini-code-assist bot reviewed Jan 21, 2026

View reviewed changes

mawong-amd closed this Jan 22, 2026

mawong-amd deleted the fix_ptpc_quantization branch January 22, 2026 06:01

mawong-amd mentioned this pull request Jan 22, 2026

[Hardware][AMD][CI][Bugfix] Fix regressions from deprecated env vars #32837

Merged

5 tasks

mawong-amd restored the fix_ptpc_quantization branch March 2, 2026 17:50

mawong-amd reopened this Mar 2, 2026

Fix PTPC quantization

61d73ae

Signed-off-by: Matthew Wong <Matthew.Wong2@amd.com>

mawong-amd force-pushed the fix_ptpc_quantization branch from 4e61450 to 61d73ae Compare March 2, 2026 17:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Hardware][AMD][Bugfix] Fix PTPC FP8 quantization#32813

[Hardware][AMD][Bugfix] Fix PTPC FP8 quantization#32813
mawong-amd wants to merge 1 commit intovllm-project:mainfrom
ROCm:fix_ptpc_quantization

mawong-amd commented Jan 21, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

mawong-amd commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

mawong-amd commented Jan 21, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

mawong-amd commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mawong-amd commented Jan 21, 2026 •

edited by github-actions bot

Loading