Skip to content

[PERF] Speed-up of GDN attention decode part (Qwen3-Next)#31722

Merged
mgoin merged 3 commits intovllm-project:mainfrom
CentML:vadim/speedup-gdn-dec
Jan 6, 2026
Merged

[PERF] Speed-up of GDN attention decode part (Qwen3-Next)#31722
mgoin merged 3 commits intovllm-project:mainfrom
CentML:vadim/speedup-gdn-dec

Commits

Commits on Dec 31, 2025

Commits on Jan 5, 2026

Commits on Jan 6, 2026