update model_config for granite 4 models by tjohnson31415 · Pull Request #821 · torch-spyre/sendnn-inference

tjohnson31415 · 2026-03-10T22:07:49Z

Description

Updates model configuration for Granite 4 dense models including for granite variant (instead of granitemoehybrid).

Related Issues

Test Plan

Checklist

I have read the contributing guidelines
My code follows the project's code style (run bash format.sh)
I have added tests for my changes (if applicable)
I have updated the documentation (if applicable)
My commits include a Signed-off-by: line (DCO compliance)

Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>

github-actions · 2026-03-10T22:07:59Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, run ./format.sh.
Now you are good to go 🚀.

We also recommend installing prek and configuring it to check your code before every local commit.

Signed-off-by: Joe Runde <joe@joerun.de>

joerunde · 2026-03-10T22:39:28Z

 {
  "architectures": [
-    "GraniteMoeHybridForCausalLM"
+    "GraniteForCausalLM"


This is the diff between the old and new checkpoint configs

prashantgupta24 · 2026-03-10T22:45:52Z

  "tie_word_embeddings": true,
-  "torch_dtype": "bfloat16",
-  "transformers_version": "4.56.0",
+  "transformers_version": "4.53.3",


Would different transformers version cause any issues between the 2 variants?

I don't know the answer :/

It's probably not too relevant for us as we're not using transformers to load the model

prashantgupta24 · 2026-03-10T22:52:47Z

+        # This is really a dense model, but it has model type "granitemoehybrid"
+        # It has the same overrides as the regular dense variant


This makes me think that there might have be a mistake when creating the new checkpoint 🤔

yeah it's kinda unclear how things evolved here, probably part of the reason why these configs haven't quite yet landed on hf hub

joerunde

Confirmed this loads new checkpoints correctly

update model_config for granite 4 models

c23130c

Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>

🍱 add new granite config to tests

753e381

Signed-off-by: Joe Runde <joe@joerun.de>

joerunde marked this pull request as ready for review March 10, 2026 22:36

joerunde requested review from nikolaospapandreou, prashantgupta24, rafvasq, sducouedic, tdoublep and yannicks1 as code owners March 10, 2026 22:36

joerunde reviewed Mar 10, 2026

View reviewed changes

prashantgupta24 reviewed Mar 10, 2026

View reviewed changes

joerunde approved these changes Mar 11, 2026

View reviewed changes

joerunde merged commit 906db8a into torch-spyre:main Mar 11, 2026
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update model_config for granite 4 models#821

update model_config for granite 4 models#821
joerunde merged 2 commits intotorch-spyre:mainfrom
tjohnson31415:granite-4-dense

tjohnson31415 commented Mar 10, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

joerunde Mar 10, 2026

Uh oh!

prashantgupta24 Mar 10, 2026

Uh oh!

joerunde Mar 10, 2026

Uh oh!

prashantgupta24 Mar 10, 2026

Uh oh!

joerunde Mar 10, 2026

Uh oh!

joerunde left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# This is really a dense model, but it has model type "granitemoehybrid"
		# It has the same overrides as the regular dense variant

Conversation

tjohnson31415 commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Test Plan

Checklist

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

joerunde Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

joerunde Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

joerunde Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

joerunde left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tjohnson31415 commented Mar 10, 2026 •

edited

Loading