[Chore] Try remove `init_cached_hf_modules` by DarkLight1337 · Pull Request #31786 · vllm-project/vllm

DarkLight1337 · 2026-01-06T07:28:20Z

Purpose

See if we can remove init_cached_hf_modules now, which would simplify the initialization of worker wrapper inside the executor.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

gemini-code-assist

Code Review

This pull request refactors the initialization of WorkerWrapperBase by removing the vllm_config parameter from its constructor and instead passing it during the init_worker method call. This change is propagated across various executor implementations (UniProcExecutor, MultiprocExecutor, RayExecutor, and test executors) where WorkerWrapperBase is instantiated. Additionally, the init_cached_hf_modules function and its calls are removed from vllm/utils/import_utils.py and worker initialization logic in gpu_worker.py and tpu_worker.py, indicating a change in how Hugging Face modules are handled. The execute_model method in WorkerWrapperBase is also simplified by removing *args and **kwargs.

DarkLight1337 · 2026-01-06T16:14:36Z

Seems ok to remove

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

### What this PR does / why we need it? Upgrade vllm commit to 0109 (bde38c11df0ea066a740efe9b77fff5418be45df) 1. remove `init_cached_hf_modules ` due to vllm-project/vllm#31786 2. fix spec_decode e2e test due to vllm-project/vllm#29821 break 3. fix `vllm.v1.attention.backends.utils` duo to vllm-project/vllm#31891 4. fix `self.seq_lens - query_lens` on same device due to vllm-project/vllm#31773 5. skip model_runner_v2 e2e test due to `'_OpNamespace' '_C' object has no attribute 'get_cuda_view_from_cpu_tensor'` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2f4e654 Signed-off-by: hfadzxy <starmoon_zhang@163.com>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

### What this PR does / why we need it? Upgrade vllm commit to 0109 (bde38c11df0ea066a740efe9b77fff5418be45df) 1. remove `init_cached_hf_modules ` due to vllm-project/vllm#31786 2. fix spec_decode e2e test due to vllm-project/vllm#29821 break 3. fix `vllm.v1.attention.backends.utils` duo to vllm-project/vllm#31891 4. fix `self.seq_lens - query_lens` on same device due to vllm-project/vllm#31773 5. skip model_runner_v2 e2e test due to `'_OpNamespace' '_C' object has no attribute 'get_cuda_view_from_cpu_tensor'` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2f4e654 Signed-off-by: hfadzxy <starmoon_zhang@163.com>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

### What this PR does / why we need it? Upgrade vllm commit to 0109 (bde38c11df0ea066a740efe9b77fff5418be45df) 1. remove `init_cached_hf_modules ` due to vllm-project/vllm#31786 2. fix spec_decode e2e test due to vllm-project/vllm#29821 break 3. fix `vllm.v1.attention.backends.utils` duo to vllm-project/vllm#31891 4. fix `self.seq_lens - query_lens` on same device due to vllm-project/vllm#31773 5. skip model_runner_v2 e2e test due to `'_OpNamespace' '_C' object has no attribute 'get_cuda_view_from_cpu_tensor'` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2f4e654 Signed-off-by: hfadzxy <starmoon_zhang@163.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? Upgrade vllm commit to 0109 (bde38c11df0ea066a740efe9b77fff5418be45df) 1. remove `init_cached_hf_modules ` due to vllm-project/vllm#31786 2. fix spec_decode e2e test due to vllm-project/vllm#29821 break 3. fix `vllm.v1.attention.backends.utils` duo to vllm-project/vllm#31891 4. fix `self.seq_lens - query_lens` on same device due to vllm-project/vllm#31773 5. skip model_runner_v2 e2e test due to `'_OpNamespace' '_C' object has no attribute 'get_cuda_view_from_cpu_tensor'` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2f4e654 Signed-off-by: hfadzxy <starmoon_zhang@163.com>

### What this PR does / why we need it? Upgrade vllm commit to 0109 (bde38c11df0ea066a740efe9b77fff5418be45df) 1. remove `init_cached_hf_modules ` due to vllm-project/vllm#31786 2. fix spec_decode e2e test due to vllm-project/vllm#29821 break 3. fix `vllm.v1.attention.backends.utils` duo to vllm-project/vllm#31891 4. fix `self.seq_lens - query_lens` on same device due to vllm-project/vllm#31773 5. skip model_runner_v2 e2e test due to `'_OpNamespace' '_C' object has no attribute 'get_cuda_view_from_cpu_tensor'` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2f4e654 Signed-off-by: hfadzxy <starmoon_zhang@163.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? Upgrade vllm commit to 0109 (bde38c11df0ea066a740efe9b77fff5418be45df) 1. remove `init_cached_hf_modules ` due to vllm-project/vllm#31786 2. fix spec_decode e2e test due to vllm-project/vllm#29821 break 3. fix `vllm.v1.attention.backends.utils` duo to vllm-project/vllm#31891 4. fix `self.seq_lens - query_lens` on same device due to vllm-project/vllm#31773 5. skip model_runner_v2 e2e test due to `'_OpNamespace' '_C' object has no attribute 'get_cuda_view_from_cpu_tensor'` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2f4e654 Signed-off-by: hfadzxy <starmoon_zhang@163.com>

[Chore] Try remove init_cached_hf_modules

57f8f87

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 requested review from WoosukKwon, njhill and robertgshaw2-redhat January 6, 2026 07:28

DarkLight1337 requested a review from NickLucche as a code owner January 6, 2026 07:28

DarkLight1337 added the ready-run-all-tests Trigger CI with all tests for wide-ranging PRs label Jan 6, 2026

mergify bot added v1 tpu Related to Google TPUs labels Jan 6, 2026

mergify bot assigned sangstar Jan 6, 2026

gemini-code-assist bot reviewed Jan 6, 2026

View reviewed changes

DarkLight1337 requested a review from hmellor January 6, 2026 08:56

Merge branch 'main' into cleanup-executor

d711b74

DarkLight1337 requested review from Isotr0py and ywang96 January 7, 2026 04:02

Isotr0py approved these changes Jan 7, 2026

View reviewed changes

Isotr0py merged commit aafd4d2 into vllm-project:main Jan 7, 2026
137 checks passed

DarkLight1337 deleted the cleanup-executor branch January 7, 2026 04:43

kyuyeunk mentioned this pull request Jan 7, 2026

[CI] Fix worker wrapper base vllm-project/tpu-inference#1411

Merged

zhangxinyuehfad mentioned this pull request Jan 7, 2026

[Main2Main] Upgrade vllm commit to 0112 vllm-project/vllm-ascend#5691

Closed

iboiko-habana mentioned this pull request Jan 7, 2026

[FIX_FOR_VLLM_LATEST] Fix embedding models, after bug found in #27614 vllm-project/vllm-gaudi#774

Closed

pawel-olejniczak mentioned this pull request Jan 8, 2026

[FIX_FOR_VLLM_LATEST] Fix block_size used in eagle vllm-project/vllm-gaudi#773

Merged

zhangxinyuehfad mentioned this pull request Jan 8, 2026

[Main2Main] Upgrade vllm commit to 0108 vllm-project/vllm-ascend#5727

Closed

yugong333 pushed a commit to yugong333/vllm that referenced this pull request Jan 9, 2026

[Chore] Try remove init_cached_hf_modules (vllm-project#31786)

7686221

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

zhangxinyuehfad mentioned this pull request Jan 13, 2026

[Main2Main] Upgrade vllm commit to 0109 vllm-project/vllm-ascend#5752

Merged

wjunLu mentioned this pull request Jan 13, 2026

[Main2Main] Upgrade vllm commit to 0113 vllm-project/vllm-ascend#5839

Merged

catswe mentioned this pull request Jan 14, 2026

Remove init_cached_hf_modules after upstream removal vllm-project/tpu-inference#1454

Merged

muskansh-google mentioned this pull request Jan 16, 2026

[Bug]: Remove usage of init_cached_hf_modules method vllm-project/tpu-inference#1474

Closed

1 task

akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026

[Chore] Try remove init_cached_hf_modules (vllm-project#31786)

7ad3291

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026

[Chore] Try remove init_cached_hf_modules (vllm-project#31786)

abf2838

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[Chore] Try remove init_cached_hf_modules (vllm-project#31786)

7acbf2d

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Chore] Try remove `init_cached_hf_modules`#31786

[Chore] Try remove `init_cached_hf_modules`#31786
Isotr0py merged 2 commits intovllm-project:mainfrom
DarkLight1337:cleanup-executor

DarkLight1337 commented Jan 6, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

DarkLight1337 commented Jan 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

DarkLight1337 commented Jan 6, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

DarkLight1337 commented Jan 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DarkLight1337 commented Jan 6, 2026 •

edited by github-actions bot

Loading