Supports Loading Quantized Models with `from_preset()` #2367

JyotinderSingh · 2025-08-20T05:54:28Z

Description of the change

This change resolves an issue with loading quantized models from presets. Previously, the model's serialized DTypePolicyMap was not correctly passed to the backbone during loading, which caused failures during initialization of quantized layers.

The fix introduces a new _resolve_dtype utility function that determines the correct dtype for the model based on the following rules:

User-specified dtype: If a user explicitly provides a dtype in the from_preset call (e.g., from_preset("bert_tiny_en_uncased", num_classes=2, dtype="float32")), that value is used.
Float type casting: If no user dtype is provided and the saved dtype is a floating-point type (e.g., "float32"), the model will be loaded using the current Keras default dtype policy. This allows for safe casting between different floating-point precisions.
DTypePolicyMap: If no user dtype is provided and the saved dtype is a complex object (like a DTypePolicyMap for quantization), the saved type is used as is. This ensures that quantization configurations are preserved during loading.

Colab Notebook

Failing Case: This notebook demonstrates the original bug where loading the preset fails.
Successful Case: This notebook shows the corrected behavior where the quantized preset loads successfully with this change.

Checklist

I have added all the necessary unit tests for my change.
I have verified that my change does not break existing code and works with all backends (TensorFlow, JAX, and PyTorch).
My PR is based on the latest changes of the main branch (if unsure, rebase the code).
I have followed the Keras Hub Model contribution guidelines in making these changes.
I have followed the Keras Hub API design guidelines in making these changes.
I have signed the Contributor License Agreement.

mattdangerw

Thanks!

keras_hub/src/utils/preset_utils.py

keras_hub/src/models/backbone.py

keras_hub/src/utils/preset_utils.py

JyotinderSingh

Resolved comments

keras_hub/src/models/backbone.py

keras_hub/src/utils/preset_utils.py

divyashreepathihalli · 2025-08-25T22:04:00Z

/gemini review

gemini-code-assist

Code Review

This pull request effectively addresses an issue with loading quantized models from presets by introducing a _resolve_dtype utility function and ensuring dtype policies are correctly serialized. The changes are logical and well-tested. I have a couple of minor suggestions to fix a test assertion message and improve docstring formatting to align with the style guide.

keras_hub/src/models/task_test.py

keras_hub/src/utils/preset_utils.py

mattdangerw

lgtm! just a couple nits

keras_hub/src/models/task_test.py

keras_hub/src/utils/preset_utils.py

mattdangerw

Thanks!

ensures DTypePolicyMap is added to backbone kwargs during load_task

1b07517

JyotinderSingh force-pushed the save_quantized_presets branch from fd28a15 to 1b07517 Compare August 20, 2025 06:32

JyotinderSingh added 4 commits August 20, 2025 13:18

Added test for loading quantized presets

e5eff0a

marks test as large

a3b6f48

validate quantized dtypes in test

37db80d

add comments

84f26d2

JyotinderSingh marked this pull request as ready for review August 20, 2025 08:14

JyotinderSingh requested a review from mattdangerw August 20, 2025 08:15

mattdangerw reviewed Aug 20, 2025

View reviewed changes

keras_hub/src/utils/preset_utils.py Outdated Show resolved Hide resolved

JyotinderSingh changed the title ~~Fixes issue with loading quantized models from presets~~ Support Loading Quantized Models with from_preset() Aug 21, 2025

JyotinderSingh force-pushed the save_quantized_presets branch from 88e2cec to 430d7b9 Compare August 21, 2025 08:24

implements priority-based dtype resolution + tests

58dfab9

JyotinderSingh force-pushed the save_quantized_presets branch from 430d7b9 to 58dfab9 Compare August 21, 2025 08:36

JyotinderSingh requested a review from mattdangerw August 21, 2025 11:03

mattdangerw reviewed Aug 22, 2025

View reviewed changes

keras_hub/src/models/backbone.py Outdated Show resolved Hide resolved

keras_hub/src/models/backbone.py Outdated Show resolved Hide resolved

keras_hub/src/utils/preset_utils.py Outdated Show resolved Hide resolved

keras_hub/src/utils/preset_utils.py Outdated Show resolved Hide resolved

Fixes float check + improves logging

ef053d6

JyotinderSingh commented Aug 23, 2025

View reviewed changes

keras_hub/src/models/backbone.py Outdated Show resolved Hide resolved

keras_hub/src/models/backbone.py Outdated Show resolved Hide resolved

keras_hub/src/utils/preset_utils.py Outdated Show resolved Hide resolved

keras_hub/src/utils/preset_utils.py Outdated Show resolved Hide resolved

Fixes dtype serialization

6b45f05

JyotinderSingh requested a review from mattdangerw August 25, 2025 04:02

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Aug 25, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Aug 25, 2025

gemini-code-assist bot reviewed Aug 25, 2025

View reviewed changes

keras_hub/src/models/task_test.py Outdated Show resolved Hide resolved

keras_hub/src/utils/preset_utils.py Show resolved Hide resolved

mattdangerw reviewed Aug 26, 2025

View reviewed changes

keras_hub/src/models/task_test.py Outdated Show resolved Hide resolved

keras_hub/src/utils/preset_utils.py Show resolved Hide resolved

improves float check + adds tests

7eb8f1e

JyotinderSingh force-pushed the save_quantized_presets branch from 0161fb9 to 7eb8f1e Compare August 26, 2025 04:17

removes types not supported by standardize_dtypes

aa78876

mattdangerw approved these changes Aug 26, 2025

View reviewed changes

JyotinderSingh merged commit 3bfa89f into keras-team:master Aug 26, 2025
7 checks passed

JyotinderSingh deleted the save_quantized_presets branch August 26, 2025 19:37

JyotinderSingh mentioned this pull request Aug 27, 2025

Keras Fails to load quantized model keras-team/keras#21378

Closed

JyotinderSingh changed the title ~~Support Loading Quantized Models with from_preset()~~ Supports Loading Quantized Models with from_preset() Sep 12, 2025

JyotinderSingh mentioned this pull request Oct 27, 2025

fix quantization save and load error keras-team/keras#21504

Closed

Supports Loading Quantized Models with from_preset() #2367

Supports Loading Quantized Models with from_preset() #2367

Uh oh!

Conversation

JyotinderSingh commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the change

Colab Notebook

Checklist

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JyotinderSingh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

divyashreepathihalli commented Aug 25, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Supports Loading Quantized Models with `from_preset()` #2367

Supports Loading Quantized Models with `from_preset()` #2367

JyotinderSingh commented Aug 20, 2025 •

edited

Loading