[Core] Remove unused `num_tokens` parameter from `_init_model_kwargs` by maang-h · Pull Request #31517 · vllm-project/vllm

maang-h · 2025-12-30T02:36:35Z

Purpose

Remove the unused num_tokens parameter from GPUModelRunner._init_model_kwargs() method.

Changes

Removed num_tokens: int parameter from method signature
Updated all 5 call sites to remove the argument
No functional changes - the parameter was never used in the method body

Motivation

The num_tokens parameter is passed to _init_model_kwargs() but never used.

Test Plan

None

Test Result

None

Signed-off-by: maang <maang_h@163.com>

gemini-code-assist

Code Review

This pull request correctly removes the unused num_tokens parameter from the _init_model_kwargs method in GPUModelRunner. This is a good refactoring that improves code clarity by removing a redundant parameter. All five call sites have been correctly updated to reflect this change. The change is purely stylistic and has no functional impact, as confirmed by the PR description and my review. The changes look good and are approved.

DarkLight1337

cc @maxdebayser @noooop do you remember why this argument was there in the first place?

maxdebayser · 2025-12-30T14:07:14Z

I can't find a reason for the existence of this parameter now. PR #21985 which introduced this function was the result of a lot of refactoring. I think the code that needed parameter was removed in this process but it seems I've forgotten to remove it.

DarkLight1337

Let's remove it then

### What this PR does / why we need it? Upgrade vllm commit to 0105 (8be6432bdaf6275664d857b1e5e9bf8ed1ce299e) 1. Remove `maybe_padded_num_tokens` arg in `model_runner_v1.py` since vllm-project/vllm#31517 deleted unused arg 2. Remove dense `Qwen/Qwen3-0.6B` in `tests/e2e/multicard/test_aclgraph_capture_replay.py` and `tests/e2e/multicard/test_data_parallel.py` due to vllm-project/vllm#30739 where offline data parallel mode will not be supported/useful for dense models 3. Adapt `vllm_ascend/worker/worker.py` due to vllm-project/vllm#31584 4. Adapt `self.block_size` calling due to vllm-project/vllm#31540 5. Modify `test_mla_v1.py` due to vllm-project/vllm#28454 , which refactorred `get_head_size()` ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@7157596 Signed-off-by: wjunLu <wjunlu217@gmail.com>

…vllm-project#31517) Signed-off-by: maang <maang_h@163.com>

### What this PR does / why we need it? Upgrade vllm commit to 0105 (8be6432bdaf6275664d857b1e5e9bf8ed1ce299e) 1. Remove `maybe_padded_num_tokens` arg in `model_runner_v1.py` since vllm-project/vllm#31517 deleted unused arg 2. Remove dense `Qwen/Qwen3-0.6B` in `tests/e2e/multicard/test_aclgraph_capture_replay.py` and `tests/e2e/multicard/test_data_parallel.py` due to vllm-project/vllm#30739 where offline data parallel mode will not be supported/useful for dense models 3. Adapt `vllm_ascend/worker/worker.py` due to vllm-project/vllm#31584 4. Adapt `self.block_size` calling due to vllm-project/vllm#31540 5. Modify `test_mla_v1.py` due to vllm-project/vllm#28454 , which refactorred `get_head_size()` ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@7157596 Signed-off-by: wjunLu <wjunlu217@gmail.com>

…vllm-project#31517) Signed-off-by: maang <maang_h@163.com>

…vllm-project#31517) Signed-off-by: maang <maang_h@163.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

…vllm-project#31517) Signed-off-by: maang <maang_h@163.com>

### What this PR does / why we need it? Upgrade vllm commit to 0105 (8be6432bdaf6275664d857b1e5e9bf8ed1ce299e) 1. Remove `maybe_padded_num_tokens` arg in `model_runner_v1.py` since vllm-project/vllm#31517 deleted unused arg 2. Remove dense `Qwen/Qwen3-0.6B` in `tests/e2e/multicard/test_aclgraph_capture_replay.py` and `tests/e2e/multicard/test_data_parallel.py` due to vllm-project/vllm#30739 where offline data parallel mode will not be supported/useful for dense models 3. Adapt `vllm_ascend/worker/worker.py` due to vllm-project/vllm#31584 4. Adapt `self.block_size` calling due to vllm-project/vllm#31540 5. Modify `test_mla_v1.py` due to vllm-project/vllm#28454 , which refactorred `get_head_size()` ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@7157596 Signed-off-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? Upgrade vllm commit to 0105 (8be6432bdaf6275664d857b1e5e9bf8ed1ce299e) 1. Remove `maybe_padded_num_tokens` arg in `model_runner_v1.py` since vllm-project/vllm#31517 deleted unused arg 2. Remove dense `Qwen/Qwen3-0.6B` in `tests/e2e/multicard/test_aclgraph_capture_replay.py` and `tests/e2e/multicard/test_data_parallel.py` due to vllm-project/vllm#30739 where offline data parallel mode will not be supported/useful for dense models 3. Adapt `vllm_ascend/worker/worker.py` due to vllm-project/vllm#31584 4. Adapt `self.block_size` calling due to vllm-project/vllm#31540 5. Modify `test_mla_v1.py` due to vllm-project/vllm#28454 , which refactorred `get_head_size()` ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@7157596 Signed-off-by: wjunLu <wjunlu217@gmail.com>

### What this PR does / why we need it? Upgrade vllm commit to 0105 (8be6432bdaf6275664d857b1e5e9bf8ed1ce299e) 1. Remove `maybe_padded_num_tokens` arg in `model_runner_v1.py` since vllm-project/vllm#31517 deleted unused arg 2. Remove dense `Qwen/Qwen3-0.6B` in `tests/e2e/multicard/test_aclgraph_capture_replay.py` and `tests/e2e/multicard/test_data_parallel.py` due to vllm-project/vllm#30739 where offline data parallel mode will not be supported/useful for dense models 3. Adapt `vllm_ascend/worker/worker.py` due to vllm-project/vllm#31584 4. Adapt `self.block_size` calling due to vllm-project/vllm#31540 5. Modify `test_mla_v1.py` due to vllm-project/vllm#28454 , which refactorred `get_head_size()` ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@7157596 Signed-off-by: wjunLu <wjunlu217@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? Upgrade vllm commit to 0105 (8be6432bdaf6275664d857b1e5e9bf8ed1ce299e) 1. Remove `maybe_padded_num_tokens` arg in `model_runner_v1.py` since vllm-project/vllm#31517 deleted unused arg 2. Remove dense `Qwen/Qwen3-0.6B` in `tests/e2e/multicard/test_aclgraph_capture_replay.py` and `tests/e2e/multicard/test_data_parallel.py` due to vllm-project/vllm#30739 where offline data parallel mode will not be supported/useful for dense models 3. Adapt `vllm_ascend/worker/worker.py` due to vllm-project/vllm#31584 4. Adapt `self.block_size` calling due to vllm-project/vllm#31540 5. Modify `test_mla_v1.py` due to vllm-project/vllm#28454 , which refactorred `get_head_size()` ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@7157596 Signed-off-by: wjunLu <wjunlu217@gmail.com>

remove unused params in gpu_model_runner

b8915c0

Signed-off-by: maang <maang_h@163.com>

mergify bot added the v1 label Dec 30, 2025

gemini-code-assist bot reviewed Dec 30, 2025

View reviewed changes

jeejeelee requested a review from DarkLight1337 December 30, 2025 03:11

DarkLight1337 reviewed Dec 30, 2025

View reviewed changes

maxdebayser approved these changes Dec 30, 2025

View reviewed changes

DarkLight1337 approved these changes Dec 30, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) December 30, 2025 14:08

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 30, 2025

vllm-bot merged commit b4bb5f3 into vllm-project:main Dec 31, 2025
47 of 50 checks passed

This was referenced Jan 4, 2026

[Main2Main] Upgrade vllm commit to 0102 vllm-project/vllm-ascend#5546

Closed

[Main2Main] Upgrade vllm commit to 0102 vllm-project/vllm-ascend#5573

Closed

wjunLu mentioned this pull request Jan 5, 2026

[Main2Main] Upgrade vllm commit to 0105 vllm-project/vllm-ascend#5595

Merged

yugong333 pushed a commit to yugong333/vllm that referenced this pull request Jan 9, 2026

[Core] Remove unused num_tokens parameter from _init_model_kwargs (…

986a4a1

…vllm-project#31517) Signed-off-by: maang <maang_h@163.com>

akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026

[Core] Remove unused num_tokens parameter from _init_model_kwargs (…

29954a8

…vllm-project#31517) Signed-off-by: maang <maang_h@163.com>

dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026

[Core] Remove unused num_tokens parameter from _init_model_kwargs (…

0921254

…vllm-project#31517) Signed-off-by: maang <maang_h@163.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[Core] Remove unused num_tokens parameter from _init_model_kwargs (…

a77e1e4

…vllm-project#31517) Signed-off-by: maang <maang_h@163.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Core] Remove unused `num_tokens` parameter from `_init_model_kwargs`#31517

[Core] Remove unused `num_tokens` parameter from `_init_model_kwargs`#31517
vllm-bot merged 1 commit intovllm-project:mainfrom
maang-h:remove-unused-params

maang-h commented Dec 30, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

DarkLight1337 left a comment

Uh oh!

maxdebayser commented Dec 30, 2025

Uh oh!

DarkLight1337 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

maang-h commented Dec 30, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Changes

Motivation

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

maxdebayser commented Dec 30, 2025

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

maang-h commented Dec 30, 2025 •

edited by github-actions bot

Loading