[Bugfix] Remove incorrect assertion in causal_conv1d_update for Qwen3.5 GDN layersfix: remove incorrect assertion in causal_conv1d_update for GDN index… by Rks2302 · Pull Request #36324 · vllm-project/vllm

Rks2302 · 2026-03-07T11:21:43Z

Summary

Removes incorrect assertion in causal_conv1d_update that causes AssertionError
when running Qwen3.5 models with GDN (Gated Delta Networks) layers.

Root Cause

Line 1161 in causal_conv1d.py:

assert num_cache_lines >= batch

This assertion is incorrect when conv_state_indices is provided. In that case:

conv_state is a shared cache pool, not a per-batch tensor
num_cache_lines is the state length (typically 4-6), NOT the batch dimension
The actual batch validation is already correctly handled in the if/else block above

The correct validations are already present:

Without indices: assert conv_state.size(0) >= batch ✅
With indices: assert batch == conv_state_indices.shape[0] ✅

The removed assertion is therefore redundant and incorrect.

Error Reproduced On

GPU: NVIDIA RTX 5090 (Blackwell, sm_120)
vLLM: 0.17.0
Model: Qwen3.5-35B-A3B-AWQ, Qwen3.5-27B-AWQ
CUDA: 12.8

Related Issues

[Bug]: AssertionError in causal_conv1d_update when capturing CUDA graphs for Qwen3.5/GDN layers #35945 (same root cause, closed without merge)
[Bug]: Qwen 3.5 27B AWQ 4bit capturing CUDA graph fails #35743 (Qwen3.5 AWQ CUDA graph failures)
[Bugfix] Cap FULL decode cudagraph sizes for Mamba/hybrid models (#34094) #34571 (related fix for Mamba models, does not cover GDN path)

gemini-code-assist

Code Review

This pull request removes an incorrect assertion in causal_conv1d_update that causes an AssertionError when running Qwen3.5 models with Gated Delta Networks (GDN) layers. The assertion assert num_cache_lines >= batch is invalid when conv_state_indices is provided, as conv_state is a shared cache pool in that scenario, and num_cache_lines represents the cache size, not a per-batch dimension. The removal of this assertion is correct as the necessary batch validation is already handled by other checks.

…on MI300x (vllm-project#36247) Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by: Rks2302 <rahulksharma2302@gmail.com>

…ed conv_state Signed-off-by: Rks2302 <rahulksharma2302@gmail.com>

Rks2302 requested a review from tdoublep as a code owner March 7, 2026 11:21

mergify bot added qwen Related to Qwen models bug Something isn't working labels Mar 7, 2026

gemini-code-assist bot reviewed Mar 7, 2026

View reviewed changes

vllmellm and others added 2 commits March 7, 2026 18:40

[Bugfix] Fix compressed-tensors quantization failure for DeepSeek-R1 …

f091b96

…on MI300x (vllm-project#36247) Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by: Rks2302 <rahulksharma2302@gmail.com>

fix: remove incorrect assertion in causal_conv1d_update for GDN index…

444783f

…ed conv_state Signed-off-by: Rks2302 <rahulksharma2302@gmail.com>

Rks2302 force-pushed the fix/gdn-causal-conv1d-assertion branch from 70f1470 to 444783f Compare March 7, 2026 13:11

mergify bot added the deepseek Related to DeepSeek models label Mar 7, 2026

Rks2302 added 2 commits March 7, 2026 18:45

Merge branch 'main' into fix/gdn-causal-conv1d-assertion

69929f5

Merge branch 'main' into fix/gdn-causal-conv1d-assertion

275bc8a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Remove incorrect assertion in causal_conv1d_update for Qwen3.5 GDN layersfix: remove incorrect assertion in causal_conv1d_update for GDN index…#36324

[Bugfix] Remove incorrect assertion in causal_conv1d_update for Qwen3.5 GDN layersfix: remove incorrect assertion in causal_conv1d_update for GDN index…#36324
Rks2302 wants to merge 4 commits intovllm-project:mainfrom
Rks2302:fix/gdn-causal-conv1d-assertion

Rks2302 commented Mar 7, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Rks2302 commented Mar 7, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Root Cause

Error Reproduced On

Related Issues

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Rks2302 commented Mar 7, 2026 •

edited by github-actions bot

Loading