Skip to content

removed lens calculation for qwen2_5#906

Closed
slokesha wants to merge 3 commits into
vllm-project:releases/v0.14.1from
slokesha:qwen2_5_perf_fix
Closed

removed lens calculation for qwen2_5#906
slokesha wants to merge 3 commits into
vllm-project:releases/v0.14.1from
slokesha:qwen2_5_perf_fix

Conversation

@slokesha
Copy link
Copy Markdown
Contributor

@slokesha slokesha commented Jan 30, 2026

Remove the lens = (cu_seqlens[1:] - cu_seqlens[:-1]).tolist() computation from the Qwen2.5 path.

This calculation is not required for Qwen2.5 and was causing a performance regression after PR #884. Removing it restores the previous performance without changing model behavior.

Signed-off-by: slokesha <slokeshappa@habana.ai>
Signed-off-by: slokesha <slokeshappa@habana.ai>
@github-actions
Copy link
Copy Markdown

✅ CI Passed

All checks passed successfully against the following vllm commit:
d7de043d55d1dd629554467e23874097e1c48993

Signed-off-by: slokesha <slokeshappa@habana.ai>
@github-actions
Copy link
Copy Markdown

✅ CI Passed

All checks passed successfully against the following vllm commit:
d7de043d55d1dd629554467e23874097e1c48993

lens = (cu_seqlens[1:] - cu_seqlens[:-1]).tolist()
if mask is not None or len(lens) == 1:
if not qwen2_5_vl or (qwen2_5_vl and q_len < 65536):
if qwen2_5_vl:
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this needed on main?

@adobrzyn
Copy link
Copy Markdown
Collaborator

adobrzyn commented Feb 4, 2026

@slokesha - is this PR needed? Similiar changes were already merged to 0.14.1

@adobrzyn adobrzyn closed this Feb 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants