webgpu: merge batchA into M dimension when batchB==1 by xhcao · Pull Request #28197 · microsoft/onnxruntime

xhcao · 2026-04-23T06:18:10Z

When M is small and batchA is large, there are some invalid elements in each tile, merge batchA into M dimesion would reduce the workgroup count.

Description

Motivation and Context

When M is small and batchA is large, there are some invalid elements in each tile, merge batchA into M dimesion would reduce the workgroup count.

Copilot

Pull request overview

This PR updates the WebGPU MatMul implementation to flatten A’s batch dimensions into the effective M dimension when B has no batching (batchB==1), aiming to reduce workgroup overhead for cases with small M and large batchA. It also adds WebGPU-specific regression tests for additional 3D batched MatMul shapes.

Changes:

WebGPU MatMul: reshape A/B and treat output as {1, batchA*M, N} when batchA != 1 && batchB == 1 (applies to both the generic and Intel subgroup paths).
Add WebGPU-only MatMul test cases covering 3D inputs with batchA=3, M=2 and N in {3,4}.

Reviewed changes

Copilot reviewed 2 out of 3 changed files in this pull request and generated 2 comments.

File	Description
onnxruntime/test/providers/cpu/math/matmul_test.cc	Adds WebGPU-only test cases for 3D batched MatMul with `batchA>1` and `M>1`.
onnxruntime/core/providers/webgpu/vendor/intel/math/matmul.cc	Extends the Intel subgroup MatMul reshape optimization from `M==1` to all `batchA!=1 && batchB==1` cases.
onnxruntime/core/providers/webgpu/math/matmul.cc	Extends the generic WebGPU MatMul reshape optimization similarly, flattening batch dims into M when `batchB==1`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

xhcao · 2026-04-24T02:54:06Z

@qjia7 @jchen10 PTAL. Do you think there is necessary to add a threshold as Copilot mentioned? Are there correct issues when changing dispatch geometry here?

qjia7 · 2026-04-24T09:24:01Z

@qjia7 @jchen10 PTAL. Do you think there is necessary to add a threshold as Copilot mentioned? Are there correct issues when changing dispatch geometry here?

No. It should be a general optimization. Just update your comments in code.

qjia7

LGTM. Please also update your PR description. Thanks.

webgpu: merge batchA into M dimension when batchB==1

c133531

When M is small and batchA is large, there are some invalid elements in each tile, merge batchA into M dimesion would reduce the workgroup count.

guschmue added the ep:WebGPU ort-web webgpu provider label Apr 23, 2026

guschmue requested a review from Copilot April 23, 2026 16:09

Copilot started reviewing on behalf of guschmue April 23, 2026 16:13 View session

Copilot AI reviewed Apr 23, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/webgpu/math/matmul.cc Outdated

Comment thread onnxruntime/core/providers/webgpu/math/matmul.cc

qjia7 reviewed Apr 24, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/webgpu/math/matmul.cc Outdated

Address Jiajia's comments

75377d7

qjia7 approved these changes Apr 29, 2026

View reviewed changes

guschmue enabled auto-merge (squash) April 29, 2026 16:43

guschmue merged commit 6f47410 into microsoft:main Apr 29, 2026
85 of 87 checks passed

BrewTestBot mentioned this pull request May 8, 2026

onnxruntime 1.26.0 Homebrew/homebrew-core#281672

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

webgpu: merge batchA into M dimension when batchB==1#28197

webgpu: merge batchA into M dimension when batchB==1#28197
guschmue merged 2 commits intomicrosoft:mainfrom
xhcao:merge-all-outer-dims

xhcao commented Apr 23, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

xhcao commented Apr 24, 2026

Uh oh!

Uh oh!

qjia7 commented Apr 24, 2026

Uh oh!

qjia7 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

xhcao commented Apr 23, 2026

Description

Motivation and Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

xhcao commented Apr 24, 2026

Uh oh!

Uh oh!

qjia7 commented Apr 24, 2026

Uh oh!

qjia7 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants