Add Qwen3NextForCausalLM to mamba_like_arch by rsmyrek · Pull Request #1450 · vllm-project/vllm-gaudi

rsmyrek · 2026-05-15T16:43:14Z

Qwen3Next uses a hybrid GDN+attention architecture that requires separate KV cache groups for GDN vs standard attention layers. Add it to the mamba_like_arch list so maybe_set_mamba_kv_cache_groups_ids() sets up the cache groups correctly.

Qwen3Next uses a hybrid GDN+attention architecture that requires separate KV cache groups for GDN vs standard attention layers. Add it to the mamba_like_arch list so maybe_set_mamba_kv_cache_groups_ids() sets up the cache groups correctly. Signed-off-by: Radoslaw Smyrek <radoslawx.smyrek@intel.com>

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Updates Gaudi HPU model runner logic to treat an additional Qwen model architecture as “mamba-like” for KV-cache group ID configuration.

Changes:

Add Qwen3NextForCausalLM to the mamba_like_arch allowlist used by maybe_set_mamba_kv_cache_groups_ids.

github-actions · 2026-05-15T22:23:31Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
54f548e9e58087f0155e4e164e416ad7efdfde6d

Copilot AI review requested due to automatic review settings May 15, 2026 16:43

rsmyrek requested review from PatrykWo, adobrzyn, afierka-intel, iboiko-habana, jbyczkow, kamil-kaczor, ksmusz, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners May 15, 2026 16:43

Copilot AI reviewed May 15, 2026

View reviewed changes

github-actions Bot mentioned this pull request May 15, 2026

🚦 Team Review Dashboard #701

Open

jbyczkow approved these changes May 19, 2026

View reviewed changes

iboiko-habana approved these changes May 19, 2026

View reviewed changes

jbyczkow merged commit d999b2e into vllm-project:main May 19, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Qwen3NextForCausalLM to mamba_like_arch#1450

Add Qwen3NextForCausalLM to mamba_like_arch#1450
jbyczkow merged 1 commit into
vllm-project:mainfrom
rsmyrek:qwen3-next-mamba-like-arch

rsmyrek commented May 15, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

github-actions Bot commented May 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

rsmyrek commented May 15, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

github-actions Bot commented May 15, 2026

✅ CI Passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants