[BugFix] Don’t compute reorder threshold when there are no attention groups by hl475 · Pull Request #27861 · vllm-project/vllm

hl475 · 2025-10-31T06:40:30Z

Purpose

This PR fixes a startup crash in the v1 runtime for attention‑free models (e.g., Terratorch) introduced after #27809. The engine unconditionally computed the batch reorder threshold even when no attention backends were created, leading to:

TypeError: reduce() of empty iterable with no initial value

from the nightly run (https://buildkite.com/vllm/ci/builds/37041/steps/canvas?sid=019a386d-1b25-4c07-9a9b-085c1e07ea05, https://buildkite.com/vllm/ci/builds/37041/steps/canvas?sid=019a386d-1b26-4f42-b55f-f0125da20368)

This PR (1) skip the calculation when there are no attention groups, and (2) make calculate_reorder_batch_threshold() defensive by resolving an empty list to None.

Test Plan

CI

Test Result

Basic Models Tests (Extra Initialization) 1 + Basic Models Tests (Extra Initialization) 2
https://buildkite.com/vllm/ci/builds/37052/steps/canvas?sid=019a3901-5ee6-45ba-bace-2ccb858b53a1
Basic Models Tests (Initialization)
https://buildkite.com/vllm/ci/builds/37052/steps/canvas?sid=019a3901-5ee5-48b7-ac1d-0cf8db1b054d

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

LucasWilkinson

Thanks for the contribution! Overall LGTM, left a couple nits

vllm/v1/worker/gpu_model_runner.py

Signed-off-by: Huamin Li <3ericli@gmail.com>

hl475 · 2025-10-31T08:37:49Z

Thanks @LucasWilkinson for reviewing!

I just updated this PR to address the comments, please take another look!

…groups (vllm-project#27861)

mergify bot added the v1 label Oct 31, 2025

hl475 changed the title ~~fix_attention_free_models~~ v1: Don’t compute reorder threshold when there are no attention groups Oct 31, 2025

hl475 marked this pull request as ready for review October 31, 2025 07:59

LucasWilkinson changed the title ~~v1: Don’t compute reorder threshold when there are no attention groups~~ [BugFix] Don’t compute reorder threshold when there are no attention groups Oct 31, 2025

LucasWilkinson approved these changes Oct 31, 2025

View reviewed changes

vllm/v1/worker/gpu_model_runner.py Outdated Show resolved Hide resolved

vllm/v1/worker/gpu_model_runner.py Outdated Show resolved Hide resolved

fix_attention_free_models

2e0dc6a

Signed-off-by: Huamin Li <3ericli@gmail.com>

hl475 force-pushed the fix_attention_free_models branch from a92ec36 to 2e0dc6a Compare October 31, 2025 08:36

LucasWilkinson enabled auto-merge (squash) October 31, 2025 09:22

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 31, 2025

LucasWilkinson merged commit 933cdea into vllm-project:main Oct 31, 2025
47 checks passed

This was referenced Oct 31, 2025

[Metrics] Enable sleep state metric outside of dev mode #27867

Merged

Add ORCA endpoint load metrics support #24905

Merged

[Misc] Refactor Attention kv transfer methods into decorator #27816

Merged

ZhengHongming888 pushed a commit to ZhengHongming888/vllm that referenced this pull request Nov 8, 2025

[BugFix] Don’t compute reorder threshold when there are no attention …

74709b5

…groups (vllm-project#27861)

rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025

[BugFix] Don’t compute reorder threshold when there are no attention …

9f97263

…groups (vllm-project#27861)

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[BugFix] Don’t compute reorder threshold when there are no attention …

4f36970

…groups (vllm-project#27861)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BugFix] Don’t compute reorder threshold when there are no attention groups#27861

[BugFix] Don’t compute reorder threshold when there are no attention groups#27861
LucasWilkinson merged 1 commit intovllm-project:mainfrom
hl475:fix_attention_free_models

hl475 commented Oct 31, 2025 •

edited by github-actions bot

Loading

Uh oh!

LucasWilkinson left a comment

Uh oh!

Uh oh!

Uh oh!

hl475 commented Oct 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

hl475 commented Oct 31, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

LucasWilkinson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hl475 commented Oct 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hl475 commented Oct 31, 2025 •

edited by github-actions bot

Loading