Fixes configuration default values by zucchini-nlp · Pull Request #43592 · huggingface/transformers

zucchini-nlp · 2026-01-29T12:42:02Z

What does this PR do?

Adds missing pad_token_id and tie_word_embeddings to config classes with their defaults

HuggingFaceDocBuilderDev · 2026-01-29T12:51:34Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2026-01-29T14:09:07Z

Adding fix to tie_word_embeddings, don't merge!

zucchini-nlp · 2026-01-29T14:48:15Z

run-slow: cohere2, deformable_detr, emu3, exaone4, falcon_mamba, fast_vlm, flava, florence2, glm46v, got_ocr2, gpt_bigcode, gpt_neox, gptj, internvl, jetmoe, mamba

github-actions · 2026-01-29T14:49:42Z

This comment contains run-slow, running the specified jobs:

models: ["models/cohere2", "models/deformable_detr", "models/emu3", "models/exaone4", "models/falcon_mamba", "models/fast_vlm", "models/flava", "models/florence2", "models/glm46v", "models/got_ocr2", "models/gpt_bigcode", "models/gpt_neox", "models/gptj", "models/internvl", "models/jetmoe", "models/mamba"]
quantizations: []

Rocketknight1

LGTM with one comment!

src/transformers/models/cohere2/configuration_cohere2.py

github-actions · 2026-01-29T15:23:47Z

CI Results

Workflow Run ⚙️

✅ No failing test specific to this PR 🎉 !

zucchini-nlp · 2026-01-29T16:22:01Z

@bot /style

zucchini-nlp · 2026-01-29T16:58:13Z

Deformable detr is flaky now, apparently related to the random order of tests 😢 Not reproducible locally if I run a single testcase

zucchini-nlp · 2026-01-29T16:58:19Z

@bot /repo

github-actions · 2026-01-29T16:58:53Z

Repo. Consistency bot fixed some files and pushed the changes.

github-actions · 2026-01-30T10:18:57Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: cohere2, cohere2_vision, deepseek_vl, deepseek_vl_hybrid, deformable_detr, emu3, exaone4, falcon_mamba, fast_vlm, flava, florence2, glm46v, got_ocr2, gpt_bigcode, gpt_neox, gptj

zucchini-nlp · 2026-01-30T10:40:41Z

run-slow: llava_onevision, llava_next_video

github-actions · 2026-01-30T10:41:52Z

This comment contains run-slow, running the specified jobs:

models: ["models/llava_next_video", "models/llava_onevision"]
quantizations: []

github-actions · 2026-01-30T10:58:51Z

CI Results

Workflow Run ⚙️

✅ No failing test specific to this PR 🎉 !

fixes it finally I hope for all models!

239a652

zucchini-nlp requested a review from Rocketknight1 January 29, 2026 12:50

Merge branch 'main' into pad-token-ids

002c656

zucchini-nlp added the for patch Tag issues / labels that should be included in the next patch label Jan 29, 2026

hmellor mentioned this pull request Jan 29, 2026

Update to transformers v5 vllm-project/vllm#30566

Open

zucchini-nlp changed the title ~~Fixes 'pad_token_id' issues~~ Fixes configuration default values Jan 29, 2026

tie word embeddings!

e01b712

Rocketknight1 approved these changes Jan 29, 2026

View reviewed changes

src/transformers/models/cohere2/configuration_cohere2.py Show resolved Hide resolved

zucchini-nlp mentioned this pull request Jan 29, 2026

Fix: Add missing pad_token_id to StableLmConfig #43573

Closed

3 tasks

zucchini-nlp added 2 commits January 29, 2026 16:57

vision LLMs

64d77ef

Merge branch 'main' into pad-token-ids

aed3cef

Apply repo consistency fixes

1361169

github-actions bot and others added 5 commits January 29, 2026 17:04

Apply repo consistency fixes

3ae0ac3

revert VLMs

4359584

unpleasant bug

9309c0a

Merge branch 'main' into pad-token-ids

13e58e5

fix repo

de2fcb6

zucchini-nlp enabled auto-merge (squash) January 30, 2026 09:55

skip it

b6246d1

Merge branch 'main' into pad-token-ids

818607f

zucchini-nlp disabled auto-merge January 30, 2026 10:35

llava mismatch

18548e5

zucchini-nlp mentioned this pull request Jan 30, 2026

Fix tie_word_embedding issue for llava_onevision model #43617

Closed

zucchini-nlp enabled auto-merge (squash) January 30, 2026 11:02

zucchini-nlp merged commit 562106f into huggingface:main Jan 30, 2026
26 checks passed

This was referenced Feb 1, 2026

[PR] Fixes configuration default values Sandgarden-Demo/transformers#49

Closed

[PR] Fix tie_word_embedding issue for llava_onevision model Sandgarden-Demo/transformers#61

Closed

Conversation

zucchini-nlp commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jan 29, 2026

Uh oh!

zucchini-nlp commented Jan 29, 2026

Uh oh!

zucchini-nlp commented Jan 29, 2026

Uh oh!

github-actions bot commented Jan 29, 2026

Uh oh!

Rocketknight1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Jan 29, 2026

CI Results

Uh oh!

zucchini-nlp commented Jan 29, 2026

Uh oh!

zucchini-nlp commented Jan 29, 2026

Uh oh!

zucchini-nlp commented Jan 29, 2026

Uh oh!

github-actions bot commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 30, 2026

Uh oh!

zucchini-nlp commented Jan 30, 2026

Uh oh!

github-actions bot commented Jan 30, 2026

Uh oh!

github-actions bot commented Jan 30, 2026

CI Results

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zucchini-nlp commented Jan 29, 2026 •

edited

Loading

github-actions bot commented Jan 29, 2026 •

edited

Loading