[Refactor][EAGLE] 1/N delete __init__ in mtp_proposer by slippersss · Pull Request #5176 · vllm-project/vllm-ascend

slippersss · 2025-12-18T13:50:52Z

What this PR does / why we need it?

This PR aims to refactor eagle-related modules in vllm-ascend.

This is the starting PR of eagle refactoring. Provided with vllm-eagle, ascend-eagle and ascend-mtp, we first let ascend-mtp inherit from ascend-eagle and let ascend-eagle inherit from vllm-eagle. As a initialization, we just delete __init__ in mtp_proposer and simplify the corresponding logic in eagle_proposer.

Based on "vllm-eagle <----- ascend-eagle <----- ascend-mtp", our target is to gradually delete ascend-mtp and enable ascend-eagle to converge to vllm-eagle. So the main workspace is eagle_proposer. In this way, we hope that contributors can concurrently refactor eagle.

Incoming changes:

delete common methods in vllm-eagle & ascend-eagle & ascend-mtp
delete load_model in mtp_proposer
delete dummy_run and propose in mtp_proposer
......

RFC: #5467

Does this PR introduce any user-facing change?

N/A

How was this patch tested?

by ci

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

gemini-code-assist

Code Review

This pull request refactors the speculative decoding proposers by making MtpProposer inherit from EagleProposer, which in turn inherits from the base vllm EagleProposer. This is a good step towards simplifying the code and reducing duplication. The changes are mostly about moving initialization logic to the parent classes and updating attribute access (e.g., from self.name to self.method).

However, I've found a critical issue where the refactoring has inadvertently removed support for M-RoPE in MtpProposer. The initialization logic for M-RoPE was deleted but not moved to the new parent class, which will lead to runtime errors for models using this feature. I've provided a comment with a suggested fix to restore this functionality.

gemini-code-assist · 2025-12-18T13:53:11Z

-        self.hidden_size = vllm_config.speculative_config.draft_model_config.get_hidden_size(
-        )
+        super().__init__(vllm_config, device, runner)



This refactoring appears to have dropped support for M-RoPE in MtpProposer. The initialization logic for self.uses_mrope and self.mrope_positions was removed from MtpProposer.__init__ but not moved to this parent class. This will cause a runtime error when using a model with M-RoPE.

Please add the M-RoPE initialization logic back into the __init__ method.

Additionally, other methods in MtpProposer that depend on this, such as _propose and dummy_run, will also need to be updated to correctly handle M-RoPE based on self.uses_mrope. For example, _propose should use self.mrope_positions when M-RoPE is enabled, but this logic seems to be missing from the current implementation in the branch.

self.uses_mrope = self.vllm_config.model_config.uses_mrope if self.uses_mrope: self.mrope_positions = torch.zeros((3, self.max_num_tokens), dtype=torch.int64, device=device)

github-actions · 2025-12-18T14:13:52Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

github-actions · 2025-12-22T06:10:57Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

realliujiaxu · 2025-12-26T08:55:25Z

        self.vllm_config.scheduler_config.max_num_seqs = 32
        self.vllm_config.model_config.dtype = torch.float16
        self.vllm_config.model_config.max_model_len = 2048
+        self.vllm_config.model_config.uses_mrope = False


why is mrope disabled?

Related ut mainly focuses on testing language model instead of multimodal model. Since the following assertion involves positions, we have to disable uses_mrope here, otherwise mrope_positions will be initialized replacing positions. By the way, we should and will complement this ut in the near feature.

realliujiaxu · 2025-12-26T09:02:44Z

could you create a RFC to record this PR and further PR? This will help community know the process of refactoring mtp.

slippersss · 2025-12-26T11:01:07Z

could you create a RFC to record this PR and further PR? This will help community know the process of refactoring mtp.

Thank you for the suggestion. We are working on it and will release very soon.

github-actions · 2025-12-28T02:39:24Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

github-actions · 2025-12-29T02:03:29Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: Zetong Li <slippersss@126.com>

…to FIA_rebase * 'main' of https://github.com/vllm-project/vllm-ascend: (88 commits) [1/N] Refactor nightly test structure (vllm-project#5479) Docs: Remove deprecated --task parameter for embedding models (vllm-project#5257) Revert "moe_gating_top_k" (vllm-project#5512) [Doc] Fix issue link for 0.12.0 (vllm-project#5500) [CI]update triton ascend version (vllm-project#5392) moe_gating_top_k (vllm-project#5271) [refactor] refactor model runner capture model (vllm-project#5230) Update corresponding vllm commit ID to 12 29 (vllm-project#5475) [Kernel]update csrc cmakelist for open-source cann (vllm-project#5458) [OP] add custom op aclnnMoeInitRoutingCustom (vllm-project#5251) [Refactor][EAGLE] 1/N delete __init__ in mtp_proposer (vllm-project#5176) [Refactor][Triton] Move reject sample triton kernels into ops/triton (vllm-project#5324) [Feature] support eager mode in model runner v2 (vllm-project#5210) [feature] fia support sliding windows (vllm-project#5239) Optimize some rejectsampler functions to make npu op launch non-blocking (vllm-project#4587) [Feature] Support to use fullgraph with eagle (vllm-project#5118) [EPLB][refactor] Modification of the initialization logic for expert_map and log2phy（depend on pr5285） (vllm-project#5311) [Refactor]6/N Extract common code of class AscendMLAImpl (vllm-project#5314) [Refactor] cache cos/sin in mla & remove parameter model in builder. (vllm-project#5277) update vllm pin to 12.27 (vllm-project#5412) ...

) ### What this PR does / why we need it? This PR aims to refactor eagle-related modules in vllm-ascend. This is the starting PR of eagle refactoring. Provided with vllm-eagle, ascend-eagle and ascend-mtp, we first let ascend-mtp inherit from ascend-eagle and let ascend-eagle inherit from vllm-eagle. As a initialization, we just delete `__init__` in mtp_proposer and simplify the corresponding logic in eagle_proposer. Based on "vllm-eagle <----- ascend-eagle <----- ascend-mtp", our target is to gradually delete ascend-mtp and enable ascend-eagle to converge to vllm-eagle. So the main workspace is eagle_proposer. In this way, we hope that contributors can concurrently refactor eagle. Incoming changes: 1. delete common methods in vllm-eagle & ascend-eagle & ascend-mtp 2. delete `load_model` in mtp_proposer 3. delete `dummy_run` and `propose` in mtp_proposer 4. ...... RFC: vllm-project#5467 ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? by ci - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e --------- Signed-off-by: Zetong Li <slippersss@126.com>

) ### What this PR does / why we need it? This PR aims to refactor eagle-related modules in vllm-ascend. This is the starting PR of eagle refactoring. Provided with vllm-eagle, ascend-eagle and ascend-mtp, we first let ascend-mtp inherit from ascend-eagle and let ascend-eagle inherit from vllm-eagle. As a initialization, we just delete `__init__` in mtp_proposer and simplify the corresponding logic in eagle_proposer. Based on "vllm-eagle <----- ascend-eagle <----- ascend-mtp", our target is to gradually delete ascend-mtp and enable ascend-eagle to converge to vllm-eagle. So the main workspace is eagle_proposer. In this way, we hope that contributors can concurrently refactor eagle. Incoming changes: 1. delete common methods in vllm-eagle & ascend-eagle & ascend-mtp 2. delete `load_model` in mtp_proposer 3. delete `dummy_run` and `propose` in mtp_proposer 4. ...... RFC: vllm-project#5467 ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? by ci - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e --------- Signed-off-by: Zetong Li <slippersss@126.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

) ### What this PR does / why we need it? This PR aims to refactor eagle-related modules in vllm-ascend. This is the starting PR of eagle refactoring. Provided with vllm-eagle, ascend-eagle and ascend-mtp, we first let ascend-mtp inherit from ascend-eagle and let ascend-eagle inherit from vllm-eagle. As a initialization, we just delete `__init__` in mtp_proposer and simplify the corresponding logic in eagle_proposer. Based on "vllm-eagle <----- ascend-eagle <----- ascend-mtp", our target is to gradually delete ascend-mtp and enable ascend-eagle to converge to vllm-eagle. So the main workspace is eagle_proposer. In this way, we hope that contributors can concurrently refactor eagle. Incoming changes: 1. delete common methods in vllm-eagle & ascend-eagle & ascend-mtp 2. delete `load_model` in mtp_proposer 3. delete `dummy_run` and `propose` in mtp_proposer 4. ...... RFC: vllm-project#5467 ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? by ci - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e --------- Signed-off-by: Zetong Li <slippersss@126.com>

) ### What this PR does / why we need it? This PR aims to refactor eagle-related modules in vllm-ascend. This is the starting PR of eagle refactoring. Provided with vllm-eagle, ascend-eagle and ascend-mtp, we first let ascend-mtp inherit from ascend-eagle and let ascend-eagle inherit from vllm-eagle. As a initialization, we just delete `__init__` in mtp_proposer and simplify the corresponding logic in eagle_proposer. Based on "vllm-eagle <----- ascend-eagle <----- ascend-mtp", our target is to gradually delete ascend-mtp and enable ascend-eagle to converge to vllm-eagle. So the main workspace is eagle_proposer. In this way, we hope that contributors can concurrently refactor eagle. Incoming changes: 1. delete common methods in vllm-eagle & ascend-eagle & ascend-mtp 2. delete `load_model` in mtp_proposer 3. delete `dummy_run` and `propose` in mtp_proposer 4. ...... RFC: vllm-project#5467 ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? by ci - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e --------- Signed-off-by: Zetong Li <slippersss@126.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

gemini-code-assist bot reviewed Dec 18, 2025

View reviewed changes

slippersss force-pushed the refactor_1 branch from 6f4ef53 to 5ae54a7 Compare December 19, 2025 02:29

github-actions bot added the merge-conflicts label Dec 22, 2025

slippersss force-pushed the refactor_1 branch from f6ba1d5 to 18321f7 Compare December 25, 2025 14:06

github-actions bot removed the merge-conflicts label Dec 25, 2025

slippersss force-pushed the refactor_1 branch from a735460 to f145706 Compare December 26, 2025 03:37

wangxiyuan approved these changes Dec 26, 2025

View reviewed changes

realliujiaxu reviewed Dec 26, 2025

View reviewed changes

slippersss force-pushed the refactor_1 branch from f145706 to d9820d2 Compare December 27, 2025 01:28

linfeng-yuan added ready-for-test start test by label for PR ready read for review labels Dec 27, 2025

github-actions bot added the merge-conflicts label Dec 28, 2025

slippersss force-pushed the refactor_1 branch from d9820d2 to 784de33 Compare December 28, 2025 09:26

github-actions bot added merge-conflicts and removed merge-conflicts labels Dec 28, 2025

slippersss added 2 commits December 29, 2025 11:16

[Refactor][EAGLE] 1/N delete __init__ in mtp_proposer

108b57c

Signed-off-by: Zetong Li <slippersss@126.com>

fix ut errors

07d0f9b

Signed-off-by: Zetong Li <slippersss@126.com>

slippersss force-pushed the refactor_1 branch from 784de33 to 07d0f9b Compare December 29, 2025 03:33

github-actions bot removed the merge-conflicts label Dec 29, 2025

slippersss mentioned this pull request Dec 29, 2025

[RFC]: Refactor and unify eagle_proposer and mtp_proposer #5467

Closed

wangxiyuan merged commit 92353c0 into vllm-project:main Dec 29, 2025
19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Refactor][EAGLE] 1/N delete init in mtp_proposer#5176

[Refactor][EAGLE] 1/N delete init in mtp_proposer#5176
wangxiyuan merged 2 commits intovllm-project:mainfrom
slippersss:refactor_1

slippersss commented Dec 18, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 18, 2025

Uh oh!

github-actions bot commented Dec 18, 2025

Uh oh!

github-actions bot commented Dec 22, 2025

Uh oh!

realliujiaxu Dec 26, 2025 •

edited

Loading

Uh oh!

slippersss Dec 26, 2025

Uh oh!

realliujiaxu commented Dec 26, 2025

Uh oh!

slippersss commented Dec 26, 2025

Uh oh!

github-actions bot commented Dec 28, 2025

Uh oh!

github-actions bot commented Dec 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

slippersss commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 18, 2025

Uh oh!

github-actions bot commented Dec 22, 2025

Uh oh!

realliujiaxu Dec 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

slippersss Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

realliujiaxu commented Dec 26, 2025

Uh oh!

slippersss commented Dec 26, 2025

Uh oh!

github-actions bot commented Dec 28, 2025

Uh oh!

github-actions bot commented Dec 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

slippersss commented Dec 18, 2025 •

edited

Loading

realliujiaxu Dec 26, 2025 •

edited

Loading