[FIX] Fix shape mismatch for swapped sequences when logprobs > 0 by derange-alembic · Pull Request #1971 · vllm-project/vllm

derange-alembic · 2023-12-07T23:26:48Z

When the logprobs of a sequence is set to be larger than 0 and it was swapped out using recomputation policy, the _get_logprobs() in sampler.py will encounter IndexError: shape mismatch: indexing tensors could not be broadcast together with shapes due to the obliviousness of the output_token_ids.

derange-alembic · 2023-12-07T23:31:43Z

This issue is also reported in #1847.

Yard1

This looks good to me. cc @WoosukKwon @zhuohan123

WoosukKwon · 2023-12-15T21:09:02Z

~~BTW, I found another bug when running a batch with n = 2.~~

  File "/home/wskwon/workspace/vllm/vllm/core/scheduler.py", line 284, in schedule
    scheduler_outputs = self._schedule()
  File "/home/wskwon/workspace/vllm/vllm/core/scheduler.py", line 142, in _schedule
    assert seq_group.num_seqs() == 1, (
AssertionError: Waiting sequence group should have only one prompt sequence.

~~I think this happens when one of the two sequences finish earlier and the other one is preempted and resumed using recomputation.~~

This was fixed by #2186

WoosukKwon · 2023-12-21T21:02:03Z

vllm/model_executor/layers/sampler.py

+            # Swapped seqs have output tokens.
+            output_tokens = sampling_metadata.seq_data[
+                seq_ids[0]].output_token_ids
            group_prompt_logprobs: PromptLogprobs = [None]
-            for token_id in prompt_tokens[1:]:
+            for token_id in prompt_tokens[1:] + output_tokens:


Why is output_tokens used for prompt logprobs?

richardzhuang0412 · 2024-02-22T07:52:43Z

For prompt_logprobs I think the issue still remains

AetherPrior · 2024-02-26T10:37:26Z

Hi, I've raised a new issue (#3032), as the codebase has changed since then, and the issue still persists.
CC: @derange-alembic , @Yard1 , @WoosukKwon

github-actions · 2024-10-30T02:04:28Z

This pull request has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this pull request should remain open. Thank you!

mergify · 2024-10-30T02:05:13Z

This pull request has merge conflicts that must be resolved before it can be
merged. @derange-alembic please rebase it. https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

…1971) Signed-off-by: Youlei Yang <youlei.yang@intel.com>

derange-alembic added 2 commits December 7, 2023 23:14

Fix shape mismatch of swapped sequences when logprobs > 0

2dc7101

Format the code.

4838080

derange-alembic marked this pull request as ready for review December 7, 2023 23:31

Yard1 approved these changes Dec 8, 2023

View reviewed changes

WoosukKwon self-requested a review December 10, 2023 03:25

WoosukKwon reviewed Dec 21, 2023

View reviewed changes

AetherPrior mentioned this pull request Feb 26, 2024

[BUG] Prompt logprobs causing tensor broadcast issue in sampler.py #3032

Closed

github-actions bot added the stale Over 90 days of inactivity label Oct 30, 2024

mergify bot added the needs-rebase label Oct 30, 2024

github-actions bot added unstale Recieved activity after being labelled stale and removed stale Over 90 days of inactivity labels Nov 3, 2024

simon-mo requested review from alexm-redhat, comaniac, njhill, youkaichao and zhuohan123 as code owners November 26, 2024 05:49

hmellor added stale Over 90 days of inactivity and removed unstale Recieved activity after being labelled stale labels Jan 28, 2025

hmellor closed this Jan 28, 2025

jinyouzhi pushed a commit to jinyouzhi/vllm that referenced this pull request Sep 26, 2025

add -q to the scripts to specify the QUANT_CONFIG file (vllm-project#…

05770c7

…1971) Signed-off-by: Youlei Yang <youlei.yang@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FIX] Fix shape mismatch for swapped sequences when logprobs > 0#1971

[FIX] Fix shape mismatch for swapped sequences when logprobs > 0#1971
derange-alembic wants to merge 2 commits intovllm-project:mainfrom
derange-alembic:fix_swap_seq_logprobs_shape_mismatch

derange-alembic commented Dec 7, 2023

Uh oh!

derange-alembic commented Dec 7, 2023

Uh oh!

Yard1 left a comment

Uh oh!

WoosukKwon commented Dec 15, 2023 •

edited

Loading

Uh oh!

WoosukKwon Dec 21, 2023

Uh oh!

richardzhuang0412 commented Feb 22, 2024

Uh oh!

AetherPrior commented Feb 26, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Oct 30, 2024

Uh oh!

mergify bot commented Oct 30, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Conversation

derange-alembic commented Dec 7, 2023

Uh oh!

derange-alembic commented Dec 7, 2023

Uh oh!

Yard1 left a comment

Choose a reason for hiding this comment

Uh oh!

WoosukKwon commented Dec 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WoosukKwon Dec 21, 2023

Choose a reason for hiding this comment

Uh oh!

richardzhuang0412 commented Feb 22, 2024

Uh oh!

AetherPrior commented Feb 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 30, 2024

Uh oh!

mergify bot commented Oct 30, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

WoosukKwon commented Dec 15, 2023 •

edited

Loading

AetherPrior commented Feb 26, 2024 •

edited

Loading