Skip to content

Enables GQA support in the prefix prefill kernels#3007

Merged
WoosukKwon merged 1 commit intovllm-project:mainfrom
sighingnow:ht/prefix-gqa
Feb 27, 2024
Merged

Enables GQA support in the prefix prefill kernels#3007
WoosukKwon merged 1 commit intovllm-project:mainfrom
sighingnow:ht/prefix-gqa

Conversation

@sighingnow
Copy link
Copy Markdown
Collaborator

No description provided.

@sighingnow
Copy link
Copy Markdown
Collaborator Author

sighingnow commented Feb 23, 2024

The failure in CI's "Model Test" shouldn't be caused by this pull request, and I have noticed the same failure in other PR as well as main.

Copy link
Copy Markdown
Collaborator

@WoosukKwon WoosukKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@sighingnow Awesome! Thanks for submitting the PR! Left a minor comment on a variable name.

Signed-off-by: Tao He <sighingnow@gmail.com>
@WoosukKwon
Copy link
Copy Markdown
Collaborator

@sighingnow Thanks for the fix! I will merge the PR once it passes the CI tests.

@sighingnow
Copy link
Copy Markdown
Collaborator Author

@sighingnow Thanks for the fix! I will merge the PR once it passes the CI tests.

Thank you!

@sighingnow
Copy link
Copy Markdown
Collaborator Author

@sighingnow Thanks for the fix! I will merge the PR once it passes the CI tests.

Hi @WoosukKwon, CI turns green now. (Just a polite reminding).

@WoosukKwon WoosukKwon merged commit 71bcaf9 into vllm-project:main Feb 27, 2024
@sighingnow sighingnow deleted the ht/prefix-gqa branch February 27, 2024 14:06
xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024
Signed-off-by: Tao He <sighingnow@gmail.com>


@pytest.mark.parametrize("num_heads", NUM_HEADS)
@pytest.mark.parametrize("num_queries_per_kv", NUM_HEADS)
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this parameter be NUM_QUERIES_PER_KV?

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed in #3246.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants