Upgrade to 0.11.1 newest vllm commit by wxsIcey · Pull Request #3762 · vllm-project/vllm-ascend

wxsIcey · 2025-10-25T08:33:22Z

What this PR does / why we need it?

vllm-project/vllm@83f478b

Fix spec decode rejection sampler, caused by vllm-project/vllm#26060
Fix some import, caused by vllm-project/vllm#27374
Fix scheduler_config.send_delta_data, caused by #3719
Fix init_with_cudagraph_sizes, caused by vllm-project/vllm#26016
Fix vl modelof replacing PatchEmbed's conv3d to linear layer, caused by vllm-project/vllm#27418

Does this PR introduce any user-facing change?

N/A

How was this patch tested?

CI passed with new added/existing test.

vLLM version: v0.11.0rc3
vLLM main: vllm-project/vllm@83f478b

github-actions · 2025-10-25T08:38:04Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: Icey <1790571317@qq.com>

wxsIcey · 2025-10-27T11:03:07Z

Here we first fix spec decoding, return logprobs for spec decoding can be a future work.

Signed-off-by: Icey <1790571317@qq.com>

wxsIcey · 2025-10-27T13:17:25Z

test_embedding_aclgraph.py is skipped

Signed-off-by: Icey <1790571317@qq.com>

wangxiyuan · 2025-10-28T06:45:17Z

    return max(layer_counts)


+# Update cudagraph capture sizes for vllm config


this is maybe not correct. I'll look more

wangxiyuan · 2025-10-28T06:54:52Z

+            if vllm_version_is("0.11.0"):
+                if not model_config.is_multimodal_model and \
+                    structured_outputs_config.backend == "auto" and \
+                    not scheduler_config.send_delta_data and \


getattr(scheduler_config, "send_delta_data", False)

### What this PR does / why we need it? vllm-project/vllm@c9461e0 Fix ```spec decode rejection sampler```, caused by vllm-project/vllm#26060 Fix some ```import```, caused by vllm-project/vllm#27374 Fix ```scheduler_config.send_delta_data```, caused by vllm-project#3719 Fix ```init_with_cudagraph_sizes```, caused by vllm-project/vllm#26016 Fix ```vl model```of replacing PatchEmbed's conv3d to linear layer, caused by vllm-project/vllm#27418 ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? CI passed with new added/existing test. - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@c9461e0 --------- Signed-off-by: Icey <1790571317@qq.com> Signed-off-by: luolun <luolun1995@cmbchina.com>

### What this PR does / why we need it? vllm-project/vllm@c9461e0 Fix ```spec decode rejection sampler```, caused by vllm-project/vllm#26060 Fix some ```import```, caused by vllm-project/vllm#27374 Fix ```scheduler_config.send_delta_data```, caused by vllm-project#3719 Fix ```init_with_cudagraph_sizes```, caused by vllm-project/vllm#26016 Fix ```vl model```of replacing PatchEmbed's conv3d to linear layer, caused by vllm-project/vllm#27418 ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? CI passed with new added/existing test. - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@c9461e0 --------- Signed-off-by: Icey <1790571317@qq.com> Signed-off-by: hwhaokun <haokun0405@163.com>

### What this PR does / why we need it? vllm-project/vllm@c9461e0 Fix ```spec decode rejection sampler```, caused by vllm-project/vllm#26060 Fix some ```import```, caused by vllm-project/vllm#27374 Fix ```scheduler_config.send_delta_data```, caused by vllm-project#3719 Fix ```init_with_cudagraph_sizes```, caused by vllm-project/vllm#26016 Fix ```vl model```of replacing PatchEmbed's conv3d to linear layer, caused by vllm-project/vllm#27418 ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? CI passed with new added/existing test. - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@c9461e0 --------- Signed-off-by: Icey <1790571317@qq.com> Signed-off-by: nsdie <yeyifan@huawei.com>

### What this PR does / why we need it? vllm-project/vllm@c9461e0 Fix ```spec decode rejection sampler```, caused by vllm-project/vllm#26060 Fix some ```import```, caused by vllm-project/vllm#27374 Fix ```scheduler_config.send_delta_data```, caused by vllm-project#3719 Fix ```init_with_cudagraph_sizes```, caused by vllm-project/vllm#26016 Fix ```vl model```of replacing PatchEmbed's conv3d to linear layer, caused by vllm-project/vllm#27418 ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? CI passed with new added/existing test. - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@c9461e0 --------- Signed-off-by: Icey <1790571317@qq.com>

wxsIcey marked this pull request as ready for review October 25, 2025 08:43

wxsIcey added ready read for review ready-for-test start test by label for PR labels Oct 25, 2025

github-actions bot added the module:core label Oct 25, 2025

wangxiyuan reviewed Oct 25, 2025

View reviewed changes

Comment thread .github/workflows/format_pr_body.yaml

github-actions bot added module:tests and removed module:tests labels Oct 27, 2025

wxsIcey added 6 commits October 27, 2025 08:38

Upgrade to 0.11.1 newest vllm commit

7c50b3e

Signed-off-by: Icey <1790571317@qq.com>

change commit and fix send_delta_data

94c9125

Signed-off-by: Icey <1790571317@qq.com>

fix init_with_cudagraph_sizes

c2dc165

Signed-off-by: Icey <1790571317@qq.com>

skit embed aclgraph e2e

6ba3f39

Signed-off-by: Icey <1790571317@qq.com>

fix init_with_cudagraph_sizes

e8849b4

Signed-off-by: Icey <1790571317@qq.com>

change commit id to 0.11.1

0ca98f5

Signed-off-by: Icey <1790571317@qq.com>

wxsIcey force-pushed the 0.11.1_1025 branch from ecc96bb to 0ca98f5 Compare October 27, 2025 08:38

tiny fix

e8f87f6

Signed-off-by: Icey <1790571317@qq.com>

wxsIcey added 3 commits October 27, 2025 11:03

fix eagle

8e82843

Signed-off-by: Icey <1790571317@qq.com>

fix aclgraph

c166742

Signed-off-by: Icey <1790571317@qq.com>

skip test_embedding_aclgraph test

89db007

Signed-off-by: Icey <1790571317@qq.com>

wxsIcey added 3 commits October 27, 2025 13:30

tiny fix

28b1306

Signed-off-by: Icey <1790571317@qq.com>

fix vl

445650b

Signed-off-by: Icey <1790571317@qq.com>

tiny fix

5b09cc0

Signed-off-by: Icey <1790571317@qq.com>

wangxiyuan approved these changes Oct 28, 2025

View reviewed changes

wangxiyuan reviewed Oct 28, 2025

View reviewed changes

wangxiyuan merged commit a7450db into vllm-project:main Oct 28, 2025
24 checks passed

wxsIcey mentioned this pull request Oct 28, 2025

[Wip] fix problems introduced by vllm #26016 #3826

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade to 0.11.1 newest vllm commit#3762

Upgrade to 0.11.1 newest vllm commit#3762
wangxiyuan merged 13 commits intovllm-project:mainfrom
wxsIcey:0.11.1_1025

wxsIcey commented Oct 25, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 25, 2025

Uh oh!

Uh oh!

wxsIcey commented Oct 27, 2025

Uh oh!

wxsIcey commented Oct 27, 2025

Uh oh!

wangxiyuan Oct 28, 2025

Uh oh!

wangxiyuan Oct 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		return max(layer_counts)


		# Update cudagraph capture sizes for vllm config

Conversation

wxsIcey commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Oct 25, 2025

Uh oh!

Uh oh!

wxsIcey commented Oct 27, 2025

Uh oh!

wxsIcey commented Oct 27, 2025

Uh oh!

wangxiyuan Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

wangxiyuan Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wxsIcey commented Oct 25, 2025 •

edited

Loading