[MRV2] Fix for DS v3.2 by WoosukKwon · Pull Request #38030 · vllm-project/vllm

WoosukKwon · 2026-03-24T18:37:54Z

No description provided.

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

gemini-code-assist

Code Review

This pull request modifies the _reshape_kv_cache function to accommodate more flexible KV cache specifications, allowing for both uniform and layer-specific AttentionSpec configurations. A review comment highlights a potential issue where the assert statement used for type checking kv_cache_spec could be bypassed in production if assertions are disabled, suggesting a more robust type validation using an explicit TypeError or ValueError to ensure consistent error handling.

gemini-code-assist · 2026-03-24T18:43:33Z

vllm/v1/worker/gpu/attn_utils.py

+            kv_cache_spec = kv_cache_group_spec.kv_cache_spec
+            if isinstance(kv_cache_spec, UniformTypeKVCacheSpecs):
+                kv_cache_spec = kv_cache_spec.kv_cache_specs[layer_name]
+            assert isinstance(kv_cache_spec, AttentionSpec)


The assert statement on this line performs a critical type check. If assertions are disabled in a production environment, this check will be skipped, potentially leading to AttributeError or TypeError in subsequent operations if kv_cache_spec is not an AttentionSpec. For robust error handling, consider replacing this assert with an explicit TypeError or ValueError to ensure type validation always occurs, regardless of assertion settings.

Suggested change

assert isinstance(kv_cache_spec, AttentionSpec)

if not isinstance(kv_cache_spec, AttentionSpec):

raise TypeError(f"Expected kv_cache_spec to be AttentionSpec, but got {type(kv_cache_spec)}")

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: Michel Belleau <michel.belleau@malaiwah.com>

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: Nithin Chalapathi <nithin.ch10@gmail.com>

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: Vinay Damodaran <vrdn@hey.com>

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: EricccYang <yangyang4991@gmail.com>

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: bhargav-patel-29 <bhargav.patel@tihiitb.org>

[MRV2] Fix for DS v3.2

db5a852

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

WoosukKwon requested a review from njhill as a code owner March 24, 2026 18:37

WoosukKwon added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 24, 2026

mergify bot added the v1 label Mar 24, 2026

gemini-code-assist bot reviewed Mar 24, 2026

View reviewed changes

njhill approved these changes Mar 24, 2026

View reviewed changes

WoosukKwon merged commit 4b53740 into main Mar 24, 2026
62 checks passed

WoosukKwon deleted the woosuk/mrv2-fix-ds32 branch March 24, 2026 21:03

RhizoNymph pushed a commit to RhizoNymph/vllm that referenced this pull request Mar 26, 2026

[MRV2] Fix for DS v3.2 (vllm-project#38030)

51e3742

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

HenryTangDev pushed a commit to HenryTangMain/vllm that referenced this pull request Mar 27, 2026

[MRV2] Fix for DS v3.2 (vllm-project#38030)

dbd8a23

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

malaiwah pushed a commit to malaiwah/vllm that referenced this pull request Mar 27, 2026

[MRV2] Fix for DS v3.2 (vllm-project#38030)

e74fdce

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: Michel Belleau <michel.belleau@malaiwah.com>

khairulkabir1661 pushed a commit to khairulkabir1661/vllm that referenced this pull request Mar 27, 2026

[MRV2] Fix for DS v3.2 (vllm-project#38030)

a3620b3

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

Monishver11 pushed a commit to Monishver11/vllm that referenced this pull request Mar 27, 2026

[MRV2] Fix for DS v3.2 (vllm-project#38030)

22bfa22

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>

nithinvc pushed a commit to nithinvc/vllm that referenced this pull request Mar 27, 2026

[MRV2] Fix for DS v3.2 (vllm-project#38030)

c2a9a64

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: Nithin Chalapathi <nithin.ch10@gmail.com>

JiantaoXu pushed a commit to JiantaoXu/vllm that referenced this pull request Mar 28, 2026

[MRV2] Fix for DS v3.2 (vllm-project#38030)

5dae327

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

vrdn-23 pushed a commit to vrdn-23/vllm that referenced this pull request Mar 30, 2026

[MRV2] Fix for DS v3.2 (vllm-project#38030)

e49d818

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: Vinay Damodaran <vrdn@hey.com>

EricccYang pushed a commit to EricccYang/vllm that referenced this pull request Apr 1, 2026

[MRV2] Fix for DS v3.2 (vllm-project#38030)

fcb38bb

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: EricccYang <yangyang4991@gmail.com>

bhargav-patel-29 pushed a commit to Bharatgen-Tech/vllm that referenced this pull request Apr 1, 2026

[MRV2] Fix for DS v3.2 (vllm-project#38030)

cd02b74

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai> Signed-off-by: bhargav-patel-29 <bhargav.patel@tihiitb.org>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MRV2] Fix for DS v3.2#38030

[MRV2] Fix for DS v3.2#38030
WoosukKwon merged 1 commit intomainfrom
woosuk/mrv2-fix-ds32

WoosukKwon commented Mar 24, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Mar 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	assert isinstance(kv_cache_spec, AttentionSpec)
	if not isinstance(kv_cache_spec, AttentionSpec):
	raise TypeError(f"Expected kv_cache_spec to be AttentionSpec, but got {type(kv_cache_spec)}")

Uh oh!

Conversation

WoosukKwon commented Mar 24, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants