Skip to content

[Model Runner V2] Simplify BlockTables with UVA#31965

Merged
WoosukKwon merged 2 commits intomainfrom
woosuk/uva-block-table
Jan 8, 2026
Merged

[Model Runner V2] Simplify BlockTables with UVA#31965
WoosukKwon merged 2 commits intomainfrom
woosuk/uva-block-table

Conversation

@WoosukKwon
Copy link
Copy Markdown
Collaborator

@WoosukKwon WoosukKwon commented Jan 8, 2026

This PR simplifies the append_block_ids method in BlockTables by using UVA buffer to store the block tables.

Previously, the block table was stored in GPU, and we send the "diff" to GPU every step. While efficient, this approach complicated the code quite a bit to pack the "diff"s into contiguous GPU tensors and perform async-copy for them.

Using the UVA tensor eliminates the need for such packing and async copies, at the cost of transferring the block tables from CPU to GPU every step.

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
@mergify mergify bot added the v1 label Jan 8, 2026
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
@WoosukKwon WoosukKwon merged commit 7508243 into main Jan 8, 2026
8 of 9 checks passed
@WoosukKwon WoosukKwon deleted the woosuk/uva-block-table branch January 8, 2026 18:24
yugong333 pushed a commit to yugong333/vllm that referenced this pull request Jan 9, 2026
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>
ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant