Fix AWQ tests for GPTQModel migration by jiqing-feng · Pull Request #44654 · huggingface/transformers

jiqing-feng · 2026-03-13T07:31:19Z

This PR fixes the AWQ test suite to align with the GPTQModel migration (following #41567 and #42776).

Changes

Fix replace_with_awq_linear return value: The function now returns the model directly instead of a tuple (model, _), updated all call sites accordingly.
Use BaseQuantLinear for type checking: Replaced specific AwqGEMMQuantLinear / AwqGEMVQuantLinear isinstance checks with the unified gptqmodel.nn_modules.qlinear.BaseQuantLinear.
Remove explicit GEMM backend override in test setup: Let the model load with its default quantization config instead of forcing AwqBackend.GEMM.
Fix save and load test: Load a fresh model for save/reload testing to avoid corrupted buffers caused by in-place transforms from prior generate() calls on the shared model instance.
Update ground truth expected output: Added an additional valid expected output for bf16 inference.

@SunMarc @Qubitium

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

github-actions · 2026-03-13T08:16:18Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: autoawq

Qubitium · 2026-03-13T08:28:29Z

LGTM! Allowing gptqmodel to auto load the awq kernel will also make future tests regress less since transformers no longer need to know about format=awq to correct kernel to env mappings.

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

SunMarc

Thanks !

HuggingFaceDocBuilderDev · 2026-03-13T16:21:48Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jiqing-feng added 5 commits March 13, 2026 14:48

fix awq tests

d553d6b

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

update ground truth

bc045bf

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix return replace_with_awq_linear

f19fe55

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

update linear type

3285f4a

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix save and load

efafee2

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

jiqing-feng marked this pull request as ready for review March 13, 2026 07:32

github-actions bot requested a review from ydshieh March 13, 2026 07:33

fix format

9d495ca

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

SunMarc approved these changes Mar 13, 2026

View reviewed changes

SunMarc enabled auto-merge March 13, 2026 16:11

SunMarc added this pull request to the merge queue Mar 13, 2026

Merged via the queue into huggingface:main with commit f2f7c89 Mar 13, 2026
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix AWQ tests for GPTQModel migration#44654

Fix AWQ tests for GPTQModel migration#44654
SunMarc merged 6 commits intohuggingface:mainfrom
jiqing-feng:awq

jiqing-feng commented Mar 13, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

Qubitium commented Mar 13, 2026

Uh oh!

SunMarc left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Mar 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

jiqing-feng commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

github-actions bot commented Mar 13, 2026

Uh oh!

Qubitium commented Mar 13, 2026

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Mar 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jiqing-feng commented Mar 13, 2026 •

edited

Loading