Skip to content

[Bugfix] Split attention groups by num_heads_q for spec-decode drafts#43543

Merged
Isotr0py merged 3 commits into
vllm-project:mainfrom
lucianommartins:lucianommartins/gemma4-fix-mtp
May 27, 2026
Merged

[Bugfix] Split attention groups by num_heads_q for spec-decode drafts#43543
Isotr0py merged 3 commits into
vllm-project:mainfrom
lucianommartins:lucianommartins/gemma4-fix-mtp

Commits

Commits on May 25, 2026