Use Xet high performance mode for Transformers v5 by hmellor · Pull Request #35098 · vllm-project/vllm

hmellor · 2026-02-23T12:59:16Z

Enable high performance mode for Xet when Transformers v5 is installed, falling back to hf_transfer for Transformers v4. This change optimizes performance based on the installed version of the Transformers library.

…s v5 installed Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

gemini-code-assist

Code Review

This pull request updates the logic for enabling high-performance download modes in huggingface_hub to support both Transformers v4 and v5. The change correctly detects the huggingface_hub version and enables either Xet high-performance mode (for v5) or hf_transfer (for v4). However, the implementation for enabling Xet mode unconditionally sets HF_XET_HIGH_PERFORMANCE to True, which overrides any user-defined environment variable setting. I've suggested a change to respect the user's environment configuration, making it consistent with the existing logic for hf_transfer.

vllm/model_executor/model_loader/weight_utils.py

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

DarkLight1337

Though I'm not sure whether we should force users to enable this mode

hmellor · 2026-02-23T13:18:53Z

In v4 we would force hf_transfer if it was installed, but didn't want to force the extra dependency.

For v5 I think this is safe because all it does is boost the number of CPU cores used in order to saturate network/disk bandwidth while downloading models. (docs). Since it doesn't require an extra dependency and can still be disabled by users who really don't want it, it should be ok.

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: Andrii Skliar <askliar@nvidia.com>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Use Xet high performance mode instead of hf_transfer if Transformer…

2ade747

…s v5 installed Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor requested a review from 22quinn as a code owner February 23, 2026 12:59

gemini-code-assist bot reviewed Feb 23, 2026

View reviewed changes

vllm/model_executor/model_loader/weight_utils.py Show resolved Hide resolved

Gemini comment

cf3ddcd

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor mentioned this pull request Feb 23, 2026

Improve Transformers v4/v5 compatibility in tokenizers and processors #34768

Closed

DarkLight1337 approved these changes Feb 23, 2026

View reviewed changes

DarkLight1337 enabled auto-merge (squash) February 23, 2026 13:13

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 23, 2026

hmellor mentioned this pull request Feb 23, 2026

Update to transformers v5 #30566

Open

vllm-bot merged commit c4f3869 into vllm-project:main Feb 23, 2026
49 of 51 checks passed

hmellor deleted the v5-xet-high-perf branch February 23, 2026 16:23

llsj14 pushed a commit to llsj14/vllm that referenced this pull request Mar 1, 2026

Use Xet high performance mode for Transformers v5 (vllm-project#35098)

91e321d

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Mar 4, 2026

Use Xet high performance mode for Transformers v5 (vllm-project#35098)

c696923

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Copilot AI pushed a commit to machov/vllm that referenced this pull request Mar 10, 2026

Use Xet high performance mode for Transformers v5 (vllm-project#35098)

e632cb3

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use Xet high performance mode for Transformers v5#35098

Use Xet high performance mode for Transformers v5#35098
vllm-bot merged 2 commits intovllm-project:mainfrom
hmellor:v5-xet-high-perf

hmellor commented Feb 23, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

DarkLight1337 left a comment •

edited

Loading

Uh oh!

hmellor commented Feb 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

hmellor commented Feb 23, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

DarkLight1337 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hmellor commented Feb 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DarkLight1337 left a comment •

edited

Loading