Skip to content

[Attention] Remove max cudagraph size limit of 992#27840

Merged
22quinn merged 2 commits intovllm-project:mainfrom
22quinn:remove-992
Nov 8, 2025
Merged

[Attention] Remove max cudagraph size limit of 992#27840
22quinn merged 2 commits intovllm-project:mainfrom
22quinn:remove-992

Conversation

@22quinn
Copy link
Collaborator

@22quinn 22quinn commented Oct 30, 2025

This is to support cuda graph capturing beyond 992. Tested working for larger size

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>
@mergify mergify bot added the v1 label Oct 30, 2025
@22quinn 22quinn added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 30, 2025
@22quinn 22quinn marked this pull request as ready for review November 8, 2025 00:05
@22quinn 22quinn requested a review from zhuohan123 November 8, 2025 00:26
Copy link
Member

@zhuohan123 zhuohan123 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@22quinn 22quinn merged commit 608bb14 into vllm-project:main Nov 8, 2025
49 checks passed
@22quinn 22quinn deleted the remove-992 branch November 8, 2025 06:33
devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025
Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants