Skip to content

Revert "[Refactor] Unify full-graph parameter update logic (#6041)"#6227

Merged
wangxiyuan merged 1 commit intovllm-project:mainfrom
wangxiyuan:revert_main
Jan 25, 2026
Merged

Revert "[Refactor] Unify full-graph parameter update logic (#6041)"#6227
wangxiyuan merged 1 commit intovllm-project:mainfrom
wangxiyuan:revert_main

Conversation

@wangxiyuan
Copy link
Copy Markdown
Collaborator

@wangxiyuan wangxiyuan commented Jan 25, 2026

This reverts commit 8966a99.

It breaks the test tests/e2e/singlecard/spec_decode/test_mtp_eagle_correctness.py::test_deepseek_mtp_correctness[True-FULL_DECODE_ONLY-2-wemaster/deepseek_mtp_main_random_bf16]

@wangxiyuan wangxiyuan added ready read for review ready-for-test start test by label for PR labels Jan 25, 2026
@github-actions
Copy link
Copy Markdown
Contributor

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

  • A PR should do only one thing, smaller PRs enable faster reviews.
  • Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
  • Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request reverts commit 8966a99, which refactored the full-graph parameter update logic. The revert moves the parameter update logic from static methods within attention implementation classes back to standalone functions in vllm_ascend/compilation/acl_graph.py. The changes appear to correctly restore the previous implementation across all affected files, including tests. My main feedback is regarding code duplication for the graph parameter update dispatch logic, which is now present in three different locations. Consolidating this into a helper method would improve maintainability.

Comment on lines +1145 to +1164
if self.vllm_config.model_config.use_mla:
if self.pcp_size * self.dcp_size > 1:
# FIXME: Try using `auto_dispatch_capture=True`
update_mla_attn_dcp_pcp_params(self.update_stream,
forward_context,
maybe_padded_num_tokens)
else:
# FIXME: Try using `auto_dispatch_capture=True`
update_mla_attn_params(self.update_stream, forward_context,
maybe_padded_num_tokens,
self.speculative_config)
else:
if self.pcp_size * self.dcp_size > 1:
update_attn_dcp_pcp_params(self.update_stream,
forward_context,
maybe_padded_num_tokens)
else:
update_attn_params(self.update_stream, forward_context,
maybe_padded_num_tokens,
self.vllm_config)
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

This dispatch logic for updating graph parameters is duplicated in _generate_dummy_run_hidden_states (lines 2041-2059) and also in vllm_ascend/spec_decode/eagle_proposer.py (lines 1184-1198). This introduces a maintainability issue, as changes to this logic must be replicated in three places, increasing the risk of inconsistencies.

To address this, consider refactoring this logic into a helper method within the NPUModelRunner class. This would centralize the dispatch logic, making the code cleaner and less prone to errors.

For example, a new method _update_graph_params could be created to encapsulate this logic, and then called from all three locations.

@gcanlin
Copy link
Copy Markdown
Collaborator

gcanlin commented Jan 25, 2026

CI all pass after reverting #6041. Do we plan to revert it directly or fix the bug it introduced?

@wangxiyuan
Copy link
Copy Markdown
Collaborator Author

let's merge this first

@wangxiyuan wangxiyuan merged commit 9564934 into vllm-project:main Jan 25, 2026
43 checks passed
wangxiyuan added a commit to wangxiyuan/vllm-ascend that referenced this pull request Jan 25, 2026
wangxiyuan added a commit that referenced this pull request Jan 26, 2026
…6227) (#6231)

This reverts commit 9564934.

The CI failure doesn't related to this change. Let's reapply it.

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094
@wangxiyuan wangxiyuan deleted the revert_main branch January 26, 2026 11:31
starmountain1997 pushed a commit to starmountain1997/vllm-ascend that referenced this pull request Jan 31, 2026
…ect#6041)" (vllm-project#6227)

This reverts commit 8966a99.

It breaks the test
`tests/e2e/singlecard/spec_decode/test_mtp_eagle_correctness.py::test_deepseek_mtp_correctness[True-FULL_DECODE_ONLY-2-wemaster/deepseek_mtp_main_random_bf16]`

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094
starmountain1997 pushed a commit to starmountain1997/vllm-ascend that referenced this pull request Jan 31, 2026
…ject#6041)" (vllm-project#6227) (vllm-project#6231)

This reverts commit 9564934.

The CI failure doesn't related to this change. Let's reapply it.

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094
ZRJ026 pushed a commit to ZRJ026/vllm-ascend that referenced this pull request Feb 28, 2026
…ect#6041)" (vllm-project#6227)

This reverts commit 8966a99.

It breaks the test
`tests/e2e/singlecard/spec_decode/test_mtp_eagle_correctness.py::test_deepseek_mtp_correctness[True-FULL_DECODE_ONLY-2-wemaster/deepseek_mtp_main_random_bf16]`

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094

Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>
ZRJ026 pushed a commit to ZRJ026/vllm-ascend that referenced this pull request Feb 28, 2026
…ject#6041)" (vllm-project#6227) (vllm-project#6231)

This reverts commit 9564934.

The CI failure doesn't related to this change. Let's reapply it.

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094

Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>
maoxx241 pushed a commit to maoxx241/vllm-ascend that referenced this pull request Mar 2, 2026
…ect#6041)" (vllm-project#6227)

This reverts commit 8966a99.

It breaks the test
`tests/e2e/singlecard/spec_decode/test_mtp_eagle_correctness.py::test_deepseek_mtp_correctness[True-FULL_DECODE_ONLY-2-wemaster/deepseek_mtp_main_random_bf16]`

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094
maoxx241 pushed a commit to maoxx241/vllm-ascend that referenced this pull request Mar 2, 2026
…ject#6041)" (vllm-project#6227) (vllm-project#6231)

This reverts commit 9564934.

The CI failure doesn't related to this change. Let's reapply it.

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094
ZRJ026 pushed a commit to ZRJ026/vllm-ascend that referenced this pull request Mar 4, 2026
…ect#6041)" (vllm-project#6227)

This reverts commit 8966a99.

It breaks the test
`tests/e2e/singlecard/spec_decode/test_mtp_eagle_correctness.py::test_deepseek_mtp_correctness[True-FULL_DECODE_ONLY-2-wemaster/deepseek_mtp_main_random_bf16]`

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094

Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>
ZRJ026 pushed a commit to ZRJ026/vllm-ascend that referenced this pull request Mar 4, 2026
…ject#6041)" (vllm-project#6227) (vllm-project#6231)

This reverts commit 9564934.

The CI failure doesn't related to this change. Let's reapply it.

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094

Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>
LCAIZJ pushed a commit to LCAIZJ/vllm-ascend that referenced this pull request Mar 7, 2026
…ect#6041)" (vllm-project#6227)

This reverts commit 8966a99.

It breaks the test
`tests/e2e/singlecard/spec_decode/test_mtp_eagle_correctness.py::test_deepseek_mtp_correctness[True-FULL_DECODE_ONLY-2-wemaster/deepseek_mtp_main_random_bf16]`

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094
LCAIZJ pushed a commit to LCAIZJ/vllm-ascend that referenced this pull request Mar 7, 2026
…ject#6041)" (vllm-project#6227) (vllm-project#6231)

This reverts commit 9564934.

The CI failure doesn't related to this change. Let's reapply it.

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094
jiangyunfan1 pushed a commit to jiangyunfan1/vllm-ascend that referenced this pull request Apr 9, 2026
…ect#6041)" (vllm-project#6227)

This reverts commit 8966a99.

It breaks the test
`tests/e2e/singlecard/spec_decode/test_mtp_eagle_correctness.py::test_deepseek_mtp_correctness[True-FULL_DECODE_ONLY-2-wemaster/deepseek_mtp_main_random_bf16]`

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094
jiangyunfan1 pushed a commit to jiangyunfan1/vllm-ascend that referenced this pull request Apr 9, 2026
…ject#6041)" (vllm-project#6227) (vllm-project#6231)

This reverts commit 9564934.

The CI failure doesn't related to this change. Let's reapply it.

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094
yangzhe-2026 pushed a commit to yangzhe-2026/vllm-ascend that referenced this pull request May 6, 2026
…ect#6041)" (vllm-project#6227)

This reverts commit 8966a99.

It breaks the test
`tests/e2e/singlecard/spec_decode/test_mtp_eagle_correctness.py::test_deepseek_mtp_correctness[True-FULL_DECODE_ONLY-2-wemaster/deepseek_mtp_main_random_bf16]`

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094
yangzhe-2026 pushed a commit to yangzhe-2026/vllm-ascend that referenced this pull request May 6, 2026
…ject#6041)" (vllm-project#6227) (vllm-project#6231)

This reverts commit 9564934.

The CI failure doesn't related to this change. Let's reapply it.

- vLLM version: v0.14.0
- vLLM main:
vllm-project/vllm@d682094
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

module:tests ready read for review ready-for-test start test by label for PR

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants