fix(flaky): `test_generate_with_and_without_position_ids` in GLM ORC by tarekziade · Pull Request #44173 · huggingface/transformers

tarekziade · 2026-02-20T09:28:48Z

Fixes flaky GLM OCR generation behavior when 2D position_ids are passed explicitly.

Reproducible locally with:

pytest tests/models/glm_ocr/test_modeling_glm_ocr.py::GlmOcrModelTest::test_generate_with_and_without_position_ids --flake-finder --flake-runs=500

Fix

We skip 2D from GenerationTesterMixin.test_generate_with_and_without_position_ids when model uses get_rope_index

15 models impacted

glm_ocr
glm4v
glm46v,
glm_image
qwen2_vl
qwen2_5_vl
qwen2_5_omni
qwen3_vl
qwen3_vl_moe
qwen3_5
qwen3_5_moe
qwen3_omni_moe
paddleocr_vl
ernie4_5_vl_moe
glm4v_moe

HuggingFaceDocBuilderDev · 2026-02-20T09:38:00Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

tarekziade · 2026-02-20T10:13:05Z

run-slow: glm_ocr

github-actions · 2026-02-20T10:14:18Z

Workflow Run ⚙️

This comment contains run-slow, running the specified jobs:

models: ["models/glm_ocr"]
quantizations: []

github-actions · 2026-02-20T11:26:11Z

CI Results

Workflow Run ⚙️

Commit Info

Context	Commit	Description
RUN	969af025	workflow commit (merge commit)
PR	7d35dffb	branch commit (from PR)
main	00cc937c	base commit (on `main`)

✅ No failing test specific to this PR 🎉 👏 !

Changes: 1. src/transformers/models/glm_ocr/modeling_glm_ocr.py:1431 - In prepare_inputs_for_generation, normalize user-provided 2D position_ids into packed multimodal format. - On cache continuation, only reuse rope_deltas when cache length is non-zero (fixes stale-delta reuse across sequential generate() calls). 2. tests/models/glm_ocr/test_modeling_glm_ocr.py:166 - Avoid random accidental pad tokens in synthetic input IDs by remapping pad_token_id in random portions (0 -> 1), reducing test instability from pad-derived masks.

github-actions · 2026-02-20T13:17:00Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: glm_ocr

tarekziade · 2026-02-20T13:23:52Z

run-slow: glm_ocr

github-actions · 2026-02-20T13:25:05Z

Workflow Run ⚙️

This comment contains run-slow, running the specified jobs:

models: ["models/glm_ocr"]
quantizations: []

…rate_with_and_without_position_ids`

tarekziade · 2026-02-20T13:55:45Z

run-slow: glm_ocr now, glm4v, glm46v, glm_image, qwen2_vl, qwen2_5_vl, qwen2_5_omni, qwen3_vl, qwen3_vl_moe, qwen3_5, qwen3_5_moe, qwen3_omni_moe, paddleocr_vl, ernie4_5_vl_moe, glm4v_moe

tarekziade · 2026-02-20T13:59:36Z

run-slow: glm_ocr, glm4v, glm46v, glm_image, qwen2_vl, qwen2_5_vl, qwen2_5_omni, qwen3_vl, qwen3_vl_moe, qwen3_5, qwen3_5_moe, qwen3_omni_moe, paddleocr_vl, ernie4_5_vl_moe, glm4v_moe

tarekziade · 2026-02-20T14:07:40Z

note that the idefics failure is yet another different flake I will fix in a separate branch

tarekziade · 2026-02-20T14:14:44Z

run-slow: glm_ocr, glm4v, glm46v

zucchini-nlp

Thanks!

zucchini-nlp · 2026-02-20T14:15:22Z

+            if has_3d_rope_positions:
+                continue


nit: we can skip right away because usually all models in the set will be the same arch

github-actions · 2026-02-20T14:29:08Z

CI Results

Workflow Run ⚙️

Commit Info

Context	Commit	Description
RUN	32d35e03	workflow commit (merge commit)
PR	893ab53a	branch commit (from PR)
main	8151000f	base commit (on `main`)

⚠️ No test being reported (jobs are skipped or cancelled)!

github-actions · 2026-02-20T14:30:26Z

Workflow Run ⚙️

This comment contains run-slow, running the specified jobs:

models: ["models/glm46v", "models/glm4v", "models/glm_ocr"]
quantizations: []

github-actions · 2026-02-20T16:02:39Z

CI Results

Workflow Run ⚙️

Commit Info

Context	Commit	Description
RUN	25f42d53	workflow commit (merge commit)
PR	2a4e58e4	branch commit (from PR)
main	32deb4c3	base commit (on `main`)

✅ No failing test specific to this PR 🎉 👏 !

tarekziade self-assigned this Feb 20, 2026

tarekziade requested a review from Rocketknight1 February 20, 2026 09:28

tarekziade marked this pull request as draft February 20, 2026 09:31

tarekziade marked this pull request as ready for review February 20, 2026 10:00

tarekziade mentioned this pull request Feb 20, 2026

chore(typing): initial ty integration #44167

Merged

zucchini-nlp reviewed Feb 20, 2026

View reviewed changes

Comment thread src/transformers/models/glm_ocr/modeling_glm_ocr.py Outdated

tarekziade added 4 commits February 20, 2026 13:39

moved the fix to modular

af2ea60

simplify code

875c8a0

dont tweak position ids here

cfc1321

tarekziade marked this pull request as draft February 20, 2026 12:57

improve patch so we don't alter position ids in the wrong path

893ab53

tarekziade force-pushed the tarekziade-fix-glm-ocr-flakiness branch from 7d35dff to 893ab53 Compare February 20, 2026 13:15

zucchini-nlp reviewed Feb 20, 2026

View reviewed changes

Comment thread src/transformers/models/glm_ocr/modeling_glm_ocr.py Outdated

reverted initial fix and skip 2D from GenerationTesterMixin.test_gene…

34e24ba

…rate_with_and_without_position_ids`

tarekziade marked this pull request as ready for review February 20, 2026 13:55

Merge branch 'main' into tarekziade-fix-glm-ocr-flakiness

2a4e58e

tarekziade requested a review from zucchini-nlp February 20, 2026 13:59

zucchini-nlp approved these changes Feb 20, 2026

View reviewed changes

Merge branch 'main' into tarekziade-fix-glm-ocr-flakiness

ec02c63

tarekziade merged commit a3bfc8b into main Feb 20, 2026
26 of 27 checks passed

tarekziade deleted the tarekziade-fix-glm-ocr-flakiness branch February 20, 2026 19:06

Conversation

tarekziade commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fix

15 models impacted

Uh oh!

HuggingFaceDocBuilderDev commented Feb 20, 2026

Uh oh!

tarekziade commented Feb 20, 2026

Uh oh!

github-actions bot commented Feb 20, 2026

Uh oh!

github-actions bot commented Feb 20, 2026

CI Results

Commit Info

Uh oh!

Uh oh!

github-actions bot commented Feb 20, 2026

Uh oh!

tarekziade commented Feb 20, 2026

Uh oh!

github-actions bot commented Feb 20, 2026

Uh oh!

Uh oh!

tarekziade commented Feb 20, 2026

Uh oh!

tarekziade commented Feb 20, 2026

Uh oh!

tarekziade commented Feb 20, 2026

Uh oh!

tarekziade commented Feb 20, 2026

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Feb 20, 2026

CI Results

Commit Info

Uh oh!

github-actions bot commented Feb 20, 2026

Uh oh!

github-actions bot commented Feb 20, 2026

CI Results

Commit Info

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tarekziade commented Feb 20, 2026 •

edited

Loading