[Kernel] Gemma4 MoE decode GEMV optimization — up to 46% TPOT improvement at BS=1-8#41379
Closed
kailashbuki wants to merge 1 commit into
Closed
[Kernel] Gemma4 MoE decode GEMV optimization — up to 46% TPOT improvement at BS=1-8#41379kailashbuki wants to merge 1 commit into
kailashbuki wants to merge 1 commit into