[janus] Fix failing tests on mi3XX #38426

remi-or · 2025-05-28T08:38:26Z

This PR fixes a few test fails of the janus model on MI3XX:

an AttributeError when trying to retrieve BOI token in a generation_kwargs attribute that is not guaranteed to be in the config;
three multiple devices errors (device map start with the wrong module, module split when it should not be, inputs on different devices)
a test fail due to numerical difference, fixed that with Expectations

HuggingFaceDocBuilderDev · 2025-05-28T08:51:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yaswanth19 · 2025-05-28T09:18:51Z

src/transformers/models/janus/modular_janus.py

+        # Get BOI token ID
+        if hasattr(generation_config, "generation_kwargs"):
+            boi_token_id = generation_config.generation_kwargs.get("boi_token_id", generation_config.bos_token_id)
+        else:
+            boi_token_id = kwargs.get("boi_token_id", generation_config.bos_token_id)
+


Hi @remi-or , The other changes look logical to me, thanks for fixing them 🤗 . Can you expand on why boi_token_id won't be present in the generation_kwargs coz AFAIK I have added it explicitly in conversion file; hence should be present in checkpoints 🤔

Hi @yaswanth19 , it seems like it is missing from the checkpoint used in testing (deepseek-community/Janus-Pro-1B) .

I can see in the checkpoints: https://huggingface.co/deepseek-community/Janus-Pro-1B/blob/main/generation_config.json

same question

It seems like generation_config is either 1. not loaded, which would be weird because guidance_scale is equal to 5 (but idk if that's usual) or 2. the generation_kwargs attribute is dropped at some point. Before model.generate is called in the test, if I check the value of model.generation_config I get:

model.generation_config = GenerationConfig { "bos_token_id": 100000, "eos_token_id": 100001, "pad_token_id": 100002 }

@remi-or do you have a snippet reproducing the issue, would be nice to add it in the PR body as well 🤗

I wonder if the issue you describe appears only on certain hardware (which is very unlikely) or the inference script is doing smth unexpected

Sure! It's taken from tests/models/janus/test_modeling_janus.py::JanusIntegrationTest::test_model_generate_image :

from transformers import AutoProcessor, JanusForConditionalGeneration if __name__ == "__main__": model_id = "deepseek-community/Janus-Pro-1B" model = JanusForConditionalGeneration.from_pretrained(model_id, device_map="auto") processor = AutoProcessor.from_pretrained(model_id) inputs = processor( text=["A portrait of young girl. masterpiece, film grained, best quality."], padding=True, generation_mode="image", return_tensors="pt", ).to(model.device) out = model.generate(**inputs, generation_mode="image", do_sample=False)

I tried changing device_map to CPU and it still crashed with AttributeError: 'GenerationConfig' object has no attribute 'generation_kwargs' so I dont think it's device-related.

indeed, weird since I'd assume the config from the hub would be picked up. At least that was try for Whisper in the past. Let me check why this isn't loaded, we better make sure the pre-saved config values are used when running inference

Fixed, the issue was in the config saved in the hub. One of the flags was set to True thus overwriting config values from scratch

I think now the only issue is the multi-device inferece. @remi-or can you update the PR so we can merge?

yaswanth19 · 2025-05-28T10:31:30Z

CC: @zucchini-nlp

zucchini-nlp · 2025-05-28T11:23:08Z

src/transformers/models/janus/modular_janus.py

+        # Get BOI token ID
+        if hasattr(generation_config, "generation_kwargs"):
+            boi_token_id = generation_config.generation_kwargs.get("boi_token_id", generation_config.bos_token_id)
+        else:
+            boi_token_id = kwargs.get("boi_token_id", generation_config.bos_token_id)
+


same question

src/transformers/models/janus/modular_janus.py

zucchini-nlp

Great thanks!

* [janus] Fix failing tests on mi3XX (#38426) * Fix multiple devices error on Janus * Fix AttributeError on Janus BOI token * Initialize lm first in Janus to get correct device map * Added expectations for Janus test_model_generate_images * Fixed JanusVisionEncoderLayer being split across devices * Code formatting * Adding modeling file * Reverted changes out of scope for this PR * [seamless_m4t] Skip some tests when speech is not available (#38430) * Added the require_speech decorator * Added require_speecj to some seamless_m4t tests * Changed skip message

* Fix multiple devices error on Janus * Fix AttributeError on Janus BOI token * Initialize lm first in Janus to get correct device map * Added expectations for Janus test_model_generate_images * Fixed JanusVisionEncoderLayer being split across devices * Code formatting * Adding modeling file * Reverted changes out of scope for this PR

remi-or added 6 commits May 28, 2025 02:52

Fix multiple devices error on Janus

0808a34

Fix AttributeError on Janus BOI token

a772bf8

Initialize lm first in Janus to get correct device map

82109b4

Added expectations for Janus test_model_generate_images

985b75c

Fixed JanusVisionEncoderLayer being split across devices

a21b570

Code formatting

55d84ee

Adding modeling file

19977a7

yaswanth19 reviewed May 28, 2025

View reviewed changes

zucchini-nlp reviewed May 28, 2025

View reviewed changes

zucchini-nlp mentioned this pull request May 30, 2025

[static cache] fix device map per layer in VLMs #38488

Merged

Reverted changes out of scope for this PR

3a82946

zucchini-nlp approved these changes Jun 3, 2025

View reviewed changes

zucchini-nlp merged commit 037acf1 into huggingface:main Jun 4, 2025
14 checks passed

remi-or mentioned this pull request Jun 4, 2025

Janus seamless transfer #38580

Merged

[janus] Fix failing tests on mi3XX #38426

[janus] Fix failing tests on mi3XX #38426

Uh oh!

Conversation

remi-or commented May 28, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 28, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yaswanth19 commented May 28, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants