fix qwen3vl mrope op by shaopeng-666 · Pull Request #4484 · vllm-project/vllm-ascend

shaopeng-666 · 2025-11-27T03:36:14Z

What this PR does / why we need it?

Qwen2.5-VL mrope precision problem would been solved once this pr is merged

Does this PR introduce any user-facing change?

No

How was this patch tested?

Test on G8600 with textVQA dataset

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

Signed-off-by: 李少鹏 <lishaopeng21@huawei.com>

github-actions · 2025-11-27T03:36:23Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request addresses a precision issue with the mrope fusion operator on Ascend NPUs. The changes involve removing a workaround that disabled the operator on x86 platforms, indicating the underlying precision problem has been resolved for the specific qwen3vl model configuration. Additionally, an unused import is cleaned up, and a call to .contiguous() is added for the positions tensor before passing it to the npu_mrope operator. This is a good defensive measure to ensure correctness by providing a contiguous tensor as expected by the kernel. The changes are logical and well-contained.

Signed-off-by: 李少鹏 <lishaopeng21@huawei.com>

ApsarasX · 2025-12-08T15:30:07Z

please cherry-pick this PR to branch v0.11.0-dev

### What this PR does / why we need it? Qwen2.5-VL mrope precision problem would been solved once this pr is merged ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Test on G8600 with textVQA dataset - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 --------- Signed-off-by: 李少鹏 <lishaopeng21@huawei.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com>

Qwen2.5-VL mrope precision problem would been solved once this pr is merged No Test on G8600 with textVQA dataset - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 --------- Signed-off-by: 李少鹏 <lishaopeng21@huawei.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com> Co-authored-by: hfadzxy <starmoon_zhang@163.com>

Qwen2.5-VL mrope precision problem would been solved once this pr is merged No Test on G8600 with textVQA dataset - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 --------- Co-authored-by: 李少鹏 <lishaopeng21@huawei.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com>

Qwen2.5-VL mrope precision problem would been solved once this pr is merged No Test on G8600 with textVQA dataset - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 --------- Co-authored-by: 李少鹏 <lishaopeng21@huawei.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by: hfadzxy <starmoon_zhang@163.com>

### What this PR does / why we need it? Qwen2.5-VL mrope precision problem would been solved once this pr is merged ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Test on G8600 with textVQA dataset - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 --------- Signed-off-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: shaopeng-666 <lishaopeng21@huawei.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com>

### What this PR does / why we need it? Qwen2.5-VL mrope precision problem would been solved once this pr is merged ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Test on G8600 with textVQA dataset - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 --------- Signed-off-by: 李少鹏 <lishaopeng21@huawei.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>

) ### What this PR does / why we need it? Qwen2.5-VL mrope precision problem would been solved once this pr is merged ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Test on G8600 with textVQA dataset - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 --------- Signed-off-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: shaopeng-666 <lishaopeng21@huawei.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com>

### What this PR does / why we need it? Qwen2.5-VL mrope precision problem would been solved once this pr is merged ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Test on G8600 with textVQA dataset - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 --------- Signed-off-by: 李少鹏 <lishaopeng21@huawei.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com>

fix qwen3vl mrope op

64fabac

Signed-off-by: 李少鹏 <lishaopeng21@huawei.com>

github-actions bot added the module:ops label Nov 27, 2025

gemini-code-assist bot reviewed Nov 27, 2025

View reviewed changes

fix qwen3vl mrope op

88f1c73

Signed-off-by: 李少鹏 <lishaopeng21@huawei.com>

ApsarasX approved these changes Dec 4, 2025

View reviewed changes

wangxiyuan mentioned this pull request Dec 8, 2025

[Bug]: Accuracy issue for Prefill-Decode Disaggregation on Qwen2.5VL in 0.11.0rc2 #4497

Closed

wangxiyuan approved these changes Dec 8, 2025

View reviewed changes

Merge branch 'main' into fix_mrope

a0ad69f

wangxiyuan merged commit 9766cf9 into vllm-project:main Dec 8, 2025
17 of 19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix qwen3vl mrope op#4484

fix qwen3vl mrope op#4484
wangxiyuan merged 3 commits intovllm-project:mainfrom
shaopeng-666:fix_mrope

shaopeng-666 commented Nov 27, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

ApsarasX commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

shaopeng-666 commented Nov 27, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ApsarasX commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shaopeng-666 commented Nov 27, 2025 •

edited by github-actions bot

Loading