-
-
Notifications
You must be signed in to change notification settings - Fork 11.1k
Update transformers to v4.55
#21931
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
WoosukKwon
merged 32 commits into
vllm-project:main
from
hmellor:update-transformers-4-54
Aug 6, 2025
Merged
Update transformers to v4.55
#21931
Changes from all commits
Commits
Show all changes
32 commits
Select commit
Hold shift + click to select a range
a742162
Update transformers to `v4.54.1`
hmellor 97d7f25
Use public method to set attn implementation in Transformers backend
hmellor fa697f5
Fix MPT
hmellor 24bb2c4
Exaone is a remote model
hmellor ecebd0c
Fix solar
hmellor ad38ae2
Fix telechat
hmellor d439137
Fix skywork
hmellor 1dcf9f4
Fix hunyuan
hmellor 30bdcde
spaces
hmellor 3ce689f
Merge branch 'main' into update-transformers-4-54
DarkLight1337 36621f4
Drop `min_transformers_version="4.53"`
DarkLight1337 c305846
Fix duplicated code
DarkLight1337 59cd39e
Revert telechat2 to how it is on main
hmellor 0af4810
Revert public method as it's too brittle to use for our purposes righ…
hmellor f286825
Merge branch 'main' into update-transformers-4-54
hmellor a6d56b9
Merge branch 'main' into update-transformers-4-54
hmellor 1c2570d
Merge branch 'main' into update-transformers-4-54
hmellor d5ab6f9
fix ovis
Isotr0py ce19024
fix tarsier2 processing
Isotr0py 7f32eb6
Fix type hint in `replace_linear_class`
hmellor 462ebc7
Fix `tp_plan` retrieval in Transformers backend
hmellor 3d9754a
Fix basic Models Test
hmellor 800edac
Fix pipeline parallel test
hmellor afe9f80
Handle `base_model_tp_plan` being explicitly `None`
hmellor d819ce4
Cap transformers version for custom models which are now broken
hmellor ba55881
Merge branch 'main' into update-transformers-4-54
hmellor f306e75
disable fuyu temporarily
Isotr0py bfa2f3f
syntax error
hmellor 05240fd
Ficx quantization tests
hmellor 3b61cf7
Update to 4.55
hmellor 4104f9d
Remove 4.55 min version as that's what we're on now
hmellor 0a6ff09
Merge branch 'main' into update-transformers-4-54
WoosukKwon File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My point is that vllm correctly loads and executes these models, but when performing accuracy tests compared to hf, it requires hf Implementation 4.53
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I understand, but vLLM's CI is not set up to arbitrarily change dependency versions at test time for reference models which are unmaintained.
The best course of action would be to make PRs in
Qwen/Qwen2.5-Math-RM-72BandQwen/Qwen2.5-Math-PRM-7Bto fix them, but we're not going to block upgrading the Transformers pin waiting for those.