[Bugfix] Implement multimodal_cpu_fields in model runner by zhangxinyuehfad · Pull Request #5196 · vllm-project/vllm-ascend

zhangxinyuehfad · 2025-12-19T08:44:46Z

What this PR does / why we need it?

Related to #4084 Implement multimodal_cpu_fields in model runner

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

gemini-code-assist

Code Review

This pull request introduces a mechanism to ensure specific multimodal data fields are on the CPU before being used by the model, which is a good refactoring of model-specific logic into a more general component. My main feedback is to make the list of CPU-bound fields configurable rather than hardcoded to improve maintainability and support for future models.

github-actions · 2025-12-19T12:13:49Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

shen-shanshan · 2025-12-19T14:46:08Z

Please delete the whole file patch_qwen3_vl.py.

shen-shanshan · 2025-12-19T14:52:04Z

According to vllm-project/vllm#28168, I suppose this multimodal_cpu_fields is different for every model, we should not just use one multimodal_cpu_fields for all MM models.

Besides, we should avoid letting users to manually configure this by passing an additional_config, which could lead to an awful user experience.

shen-shanshan · 2025-12-19T14:56:56Z

Currently, this multimodal_cpu_fields is only used by Qwen series VL models, and the multimodal_cpu_fields of these models are all the same. Thus, your resolution could work temporarily, but I suppose this is not a good choice to implement the function like this for long term.

Signed-off-by: hfadzxy <starmoon_zhang@163.com>

shen-shanshan · 2025-12-22T03:49:11Z

LGTM if all CI passed.

Note that we need to remove this logic after this field is deprecated in vllm.

zhangxinyuehfad · 2025-12-22T06:35:46Z

Please delete the whole file patch_qwen3_vl.py.

done

zhangxinyuehfad · 2025-12-22T06:41:18Z

According to vllm-project/vllm#28168, I suppose this multimodal_cpu_fields is different for every model, we should not just use one multimodal_cpu_fields for all MM models.

Besides, we should avoid letting users to manually configure this by passing an additional_config, which could lead to an awful user experience.

Yes, I deleted multimodal_cpu_fields from additional_config.

shen-shanshan · 2025-12-22T09:48:32Z

CC @wangxiyuan All CI passed.

…t#5196) ### What this PR does / why we need it? Related to vllm-project#4084 Implement multimodal_cpu_fields in model runner - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e Signed-off-by: hfadzxy <starmoon_zhang@163.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

vllm-ascend-ci added ready read for review ready-for-test start test by label for PR labels Dec 19, 2025

gemini-code-assist bot reviewed Dec 19, 2025

View reviewed changes

Comment thread vllm_ascend/worker/model_runner_v1.py Outdated

zhangxinyuehfad force-pushed the zxy_fix_cpu_tens branch from 0c72cb7 to a57d5f8 Compare December 19, 2025 09:09

github-actions bot added the module:core label Dec 19, 2025

shen-shanshan mentioned this pull request Dec 19, 2025

[RFC]: Remove VL Modeling Files #4084

Closed

17 tasks

zhangxinyuehfad force-pushed the zxy_fix_cpu_tens branch from a57d5f8 to 70a2ddc Compare December 19, 2025 14:51

zhangxinyuehfad force-pushed the zxy_fix_cpu_tens branch from 70a2ddc to 451667a Compare December 22, 2025 02:34

[Bugfix] Implement multimodal_cpu_fields in model runner

721287c

Signed-off-by: hfadzxy <starmoon_zhang@163.com>

zhangxinyuehfad force-pushed the zxy_fix_cpu_tens branch from 451667a to 721287c Compare December 22, 2025 02:50

shen-shanshan added ready read for review ready-for-test start test by label for PR and removed ready read for review ready-for-test start test by label for PR labels Dec 22, 2025

wangxiyuan approved these changes Dec 22, 2025

View reviewed changes

wangxiyuan merged commit 61efaff into vllm-project:main Dec 22, 2025
46 of 59 checks passed

wangxiyuan mentioned this pull request Dec 22, 2025

[CustomOp] Register AscendApplyRotaryEmb CustomOp and remove related patch #4667

Merged

zhangxinyuehfad deleted the zxy_fix_cpu_tens branch March 19, 2026 02:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Implement multimodal_cpu_fields in model runner#5196

[Bugfix] Implement multimodal_cpu_fields in model runner#5196
wangxiyuan merged 1 commit intovllm-project:mainfrom
zhangxinyuehfad:zxy_fix_cpu_tens

zhangxinyuehfad commented Dec 19, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

github-actions bot commented Dec 19, 2025

Uh oh!

shen-shanshan commented Dec 19, 2025

Uh oh!

shen-shanshan commented Dec 19, 2025 •

edited

Loading

Uh oh!

shen-shanshan commented Dec 19, 2025

Uh oh!

shen-shanshan commented Dec 22, 2025

Uh oh!

zhangxinyuehfad commented Dec 22, 2025

Uh oh!

zhangxinyuehfad commented Dec 22, 2025

Uh oh!

shen-shanshan commented Dec 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

zhangxinyuehfad commented Dec 19, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Dec 19, 2025

Uh oh!

shen-shanshan commented Dec 19, 2025

Uh oh!

shen-shanshan commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shen-shanshan commented Dec 19, 2025

Uh oh!

shen-shanshan commented Dec 22, 2025

Uh oh!

zhangxinyuehfad commented Dec 22, 2025

Uh oh!

zhangxinyuehfad commented Dec 22, 2025

Uh oh!

shen-shanshan commented Dec 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zhangxinyuehfad commented Dec 19, 2025 •

edited by github-actions bot

Loading

shen-shanshan commented Dec 19, 2025 •

edited

Loading