[Bugfix] Fix output tensor shape in vanilla_chunked_prefill and update import paths for model_loader #773

yiz-liu · 2025-05-07T07:44:39Z

What this PR does / why we need it?

Fix output tensor shape in vanilla_chunked_prefill function.

Does this PR introduce any user-facing change?

None.

How was this patch tested?

Run offline inference on DeepSeek models.

Signed-off-by: Yizhou Liu <[email protected]>

…oader in NPUModelRunnerBase Signed-off-by: Yizhou Liu <[email protected]>

jianzs

We should make sure the vllm-ascend main branch works with vllm versions 0.8.5 and 0.8.5.post1.

vllm_ascend/worker/model_runner.py

tests/utils.py

ganyi1996ppo · 2025-05-07T10:48:19Z

Looks CI is incomplete, can you trigger it again with a new commit?

yiz-liu · 2025-05-07T12:29:14Z

@jianzs @Yikun I don't think vllm_version_is is working properly, got any idea? If it is not working, then #753 might be broken too?

Signed-off-by: Yizhou Liu <[email protected]>

yiz-liu · 2025-05-08T04:04:41Z

@jianzs @Yikun I don't think vllm_version_is is working properly, got any idea? If it is not working, then #753 might be broken too?

NVM, I found a workaround.

MengqingCao · 2025-05-08T05:16:59Z

vllm_ascend/worker/model_runner.py

    ) -> None:
-        from vllm.model_executor.model_loader.loader import ShardedStateLoader
+        if vllm_version_is("0.8.5") or vllm_version_is("0.8.5.post1"):
+            from vllm.model_executor.model_loader.loader import ShardedStateLoader  # type: ignore[import]  # isort: skip  # noqa


I think this is a reasonable approach to skip code static checking. we could not install 2 version of vllm in the same python env, thus I agree to skip this in v0.8.5

Should we ensure vllm-ascend main branch is compatible with both vllm version 0.8.5 and 0.8.5.post1?

Yeah, but this is just skip code format static check of 0.8.5 in CI of main branch. This has no impact on the features.

Using vLLM v0.8.5 without checking this condition for v0.8.5 will trigger the else branch, causing a problem.

jianzs · 2025-05-08T06:17:17Z

Merge this pull request ASAP; many others are blocked. @MengqingCao @Yikun

…e import paths for model_loader (vllm-project#773)  ### What this PR does / why we need it?  Fix output tensor shape in vanilla_chunked_prefill function. ### Does this PR introduce _any_ user-facing change?  None. ### How was this patch tested?  Run offline inference on DeepSeek models. --------- Signed-off-by: Yizhou Liu <[email protected]>

[Bugfix] Fix output tensor shape in vanilla_chunked_prefill function

0b3cf74

Signed-off-by: Yizhou Liu <[email protected]>

github-actions bot added the module:ops label May 7, 2025

[Refactor] Update import paths for ShardedStateLoader and TensorizerL…

368a39e

…oader in NPUModelRunnerBase Signed-off-by: Yizhou Liu <[email protected]>

yiz-liu force-pushed the fix-output-shape branch from e96f48f to 368a39e Compare May 7, 2025 07:58

github-actions bot added the module:tests label May 7, 2025

yiz-liu changed the title ~~[Bugfix] Fix output tensor shape in vanilla_chunked_prefill function~~ [Bugfix] Fix output tensor shape in vanilla_chunked_prefill and update import paths for model_loader May 7, 2025

yiz-liu mentioned this pull request May 7, 2025

[Bug]: Wrong output tensor shape when running DeepSeek model with V1 engine #778

Closed

jianzs reviewed May 7, 2025

View reviewed changes

vllm_ascend/worker/model_runner.py Outdated Show resolved Hide resolved

vllm_ascend/worker/model_runner.py Outdated Show resolved Hide resolved

tests/utils.py Outdated Show resolved Hide resolved

yiz-liu force-pushed the fix-output-shape branch 2 times, most recently from 3e53cb9 to 87badbf Compare May 7, 2025 09:56

yiz-liu force-pushed the fix-output-shape branch 4 times, most recently from 10a64bc to ef7fb88 Compare May 7, 2025 12:24

yiz-liu force-pushed the fix-output-shape branch 10 times, most recently from 046deab to eb8ccd2 Compare May 8, 2025 04:00

[Refactor] Update model loader imports based on vllm version

6903a00

Signed-off-by: Yizhou Liu <[email protected]>

yiz-liu force-pushed the fix-output-shape branch from eb8ccd2 to 6903a00 Compare May 8, 2025 04:00

MengqingCao reviewed May 8, 2025

View reviewed changes

ganyi1996ppo approved these changes May 8, 2025

View reviewed changes

ganyi1996ppo merged commit 2e3520e into vllm-project:main May 8, 2025
14 checks passed

yiz-liu deleted the fix-output-shape branch May 8, 2025 06:20

Yikun mentioned this pull request Jun 13, 2025

Add ShouJian Zheng (@jianzs) as vLLM Ascend maintainer #1203

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Fix output tensor shape in vanilla_chunked_prefill and update import paths for model_loader #773

[Bugfix] Fix output tensor shape in vanilla_chunked_prefill and update import paths for model_loader #773

Uh oh!

yiz-liu commented May 7, 2025

Uh oh!

jianzs left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ganyi1996ppo commented May 7, 2025

Uh oh!

yiz-liu commented May 7, 2025

Uh oh!

yiz-liu commented May 8, 2025

Uh oh!

MengqingCao May 8, 2025

Uh oh!

jianzs May 8, 2025

Uh oh!

MengqingCao May 8, 2025

Uh oh!

jianzs May 8, 2025

Uh oh!

jianzs commented May 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Bugfix] Fix output tensor shape in vanilla_chunked_prefill and update import paths for model_loader #773

[Bugfix] Fix output tensor shape in vanilla_chunked_prefill and update import paths for model_loader #773

Uh oh!

Conversation

yiz-liu commented May 7, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

jianzs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ganyi1996ppo commented May 7, 2025

Uh oh!

yiz-liu commented May 7, 2025

Uh oh!

yiz-liu commented May 8, 2025

Uh oh!

MengqingCao May 8, 2025

Choose a reason for hiding this comment

Uh oh!

jianzs May 8, 2025

Choose a reason for hiding this comment

Uh oh!

MengqingCao May 8, 2025

Choose a reason for hiding this comment

Uh oh!

jianzs May 8, 2025

Choose a reason for hiding this comment

Uh oh!

jianzs commented May 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants