Skip to content

Fix tied weights in weight mapping test for Transformers v5#36788

Merged
DarkLight1337 merged 1 commit intovllm-project:mainfrom
hmellor:fix-weight-mapping-test
Mar 11, 2026
Merged

Fix tied weights in weight mapping test for Transformers v5#36788
DarkLight1337 merged 1 commit intovllm-project:mainfrom
hmellor:fix-weight-mapping-test

Conversation

@hmellor
Copy link
Copy Markdown
Member

@hmellor hmellor commented Mar 11, 2026

Remove the workaround in create_dummy_model as huggingface/transformers#43522 is now resolved

Add a new workaround for tied weights in Transformers v5 when:

  • The tied weight is in the checkpoint (and therefore in the ref_weight_names derived from the safetensors file)
  • tie_word_embeddings was not present in the config in Transformers v4
  • tie_word_embeddings is now present in the config in Transformers v5 and is True
  • The newly tied weights are no longer present in named_parameters, leading to missing_weights containing whatever was not correctly tied before

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
@mergify mergify bot added the multi-modality Related to multi-modality (#4194) label Mar 11, 2026
@DarkLight1337 DarkLight1337 enabled auto-merge (squash) March 11, 2026 13:19
@github-actions github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 11, 2026
@DarkLight1337 DarkLight1337 merged commit d5816c8 into vllm-project:main Mar 11, 2026
18 checks passed
@hmellor hmellor deleted the fix-weight-mapping-test branch March 11, 2026 15:12
wendyliu235 pushed a commit to wendyliu235/vllm-public that referenced this pull request Mar 18, 2026
…ject#36788)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
fxdawnn pushed a commit to fxdawnn/vllm that referenced this pull request Mar 19, 2026
…ject#36788)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
khairulkabir1661 pushed a commit to khairulkabir1661/vllm that referenced this pull request Mar 27, 2026
…ject#36788)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
vrdn-23 pushed a commit to vrdn-23/vllm that referenced this pull request Mar 30, 2026
…ject#36788)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Vinay Damodaran <vrdn@hey.com>
EricccYang pushed a commit to EricccYang/vllm that referenced this pull request Apr 1, 2026
…ject#36788)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: EricccYang <yangyang4991@gmail.com>
liuchenbing2026 pushed a commit to liuchenbing2026/vllm that referenced this pull request Apr 4, 2026
…ject#36788)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
big-yellow-duck pushed a commit to EmbeddedLLM/vllm that referenced this pull request Apr 8, 2026
…ject#36788)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
mtparet pushed a commit to blackfuel-ai/vllm that referenced this pull request Apr 9, 2026
…ject#36788)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

multi-modality Related to multi-modality (#4194) ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants