Fix accessing final norm for Gemma-3 models by kunal-vaishnavi · Pull Request #1687 · microsoft/onnxruntime-genai

kunal-vaishnavi · 2025-08-18T03:00:01Z

Description

This PR fixes how the final norm is identified for the Gemma-3 models. It works with the latest version of Hugging Face's transformers (v4.55.2).

Motivation and Context

Previous versions of transformers would modify the class structure for the Gemma-3 models as breaking changes. Since transformers has landed on a stable way to load multi-modal models with AutoModelForCausalLM for now, the current approach is to identify the path to model.model.language_model.norm for the Gemma-3 models that are multi-modal.

Gemma-3 1B's final norm is accessible at model.model.norm while Gemma-3 4B's final norm is accessible at model.model.language_model.norm. For PEFT's decoder-only models, the core model is accessible at model.base_model.model and the final norm is usually accessible at model.base_model.model.model.norm.

We can read the parent-most class name to identify whether a model is from PEFT or not. One advantage with this approach is that it allows any adaptations in the path to the final norm of a Transformers model to still be found in the PEFT version of that model.

### Description This PR replaces any references to `NvTensorRtRtx` with `trt-rtx` except in the GenAI config file. ### Motivation and Context This abbreviation reduces [potential bugs](#1687 (comment)) that can be raised while using the EP name in the model builder. It also preserves the original intention to keep EP names short and brief.

### Description This PR adds a tutorial to show how to create the ONNX models for Gemma-3 vision (4B, 12B, 27B). ### Motivation and Context This PR requires the changes from the following PRs. - #1374 - #1687 - #1701 - #1786 This PR also resolves the following issues. - #1329 - #1536 - #1655 - #1698

Fix accessing final norm for Gemma-3 models

0cbb5b1

kunal-vaishnavi added the 0.9.1 label Aug 18, 2025

github-advanced-security AI found potential problems Aug 18, 2025

View reviewed changes

Comment thread src/python/py/models/builder.py Fixed

nenad1002 previously approved these changes Aug 19, 2025

View reviewed changes

Comment thread src/python/py/models/builder.py

Comment thread src/python/py/models/builder.py

Update documentation for final norm paths

18914e5

kunal-vaishnavi dismissed nenad1002’s stale review via 18914e5 August 19, 2025 20:37

kunal-vaishnavi enabled auto-merge (squash) August 20, 2025 04:33

nenad1002 approved these changes Aug 20, 2025

View reviewed changes

kunal-vaishnavi merged commit bee5dca into main Aug 20, 2025
14 checks passed

kunal-vaishnavi deleted the kvaishnavi/gemma3-v4.55 branch August 20, 2025 16:43

kunal-vaishnavi mentioned this pull request Sep 16, 2025

Use abbreviation for TensorRT RTX EP #1763

Merged

kunal-vaishnavi mentioned this pull request Sep 25, 2025

Add Gemma-3 vision tutorial to ONNX Runtime GenAI #1793

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix accessing final norm for Gemma-3 models#1687

Fix accessing final norm for Gemma-3 models#1687
kunal-vaishnavi merged 2 commits into
mainfrom
kvaishnavi/gemma3-v4.55

kunal-vaishnavi commented Aug 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kunal-vaishnavi commented Aug 18, 2025

Description

Motivation and Context

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants