[webgpu] Use 1D dispatch groups for attention #24228

qjia7 · 2025-03-28T07:40:28Z

This PR uses 1d disptach group size and uses workgroup_idx instead of workgroup.x|workgroup.y in case they are normalized.

Fixed the bug in #24228 which causes the incorrect result for phi models when flash attention is disabled.

Fixed the bug in microsoft#24228 which causes the incorrect result for phi models when flash attention is disabled.

[webgpu] Use 1D Dispatch groups

3a92713

qjia7 requested review from fs-eire, guschmue and sushraja-msft March 28, 2025 08:04

guschmue added the ep:WebGPU ort-web webgpu provider label Mar 28, 2025

snnn closed this Apr 3, 2025

snnn reopened this Apr 3, 2025

fs-eire previously approved these changes Apr 4, 2025

View reviewed changes

Merge branch 'main' into attention_1d_dispatch_groups

1cc9d0c

qjia7 dismissed fs-eire’s stale review via 1cc9d0c April 7, 2025 02:35

qjia7 requested a review from fs-eire April 7, 2025 02:38

fs-eire approved these changes Apr 7, 2025

View reviewed changes

fs-eire merged commit a1186f6 into main Apr 7, 2025
87 of 89 checks passed

fs-eire deleted the attention_1d_dispatch_groups branch April 7, 2025 13:36

qjia7 mentioned this pull request Apr 24, 2025

[webgpu] Fix bug in 1D dispatch workgroups #24519

Merged

qjia7 added a commit that referenced this pull request Apr 25, 2025

[webgpu] Fix bug in 1D dispatch workgroups (#24519)

5c014e2

Fixed the bug in #24228 which causes the incorrect result for phi models when flash attention is disabled.

vraspar pushed a commit that referenced this pull request Apr 28, 2025

[webgpu] Fix bug in 1D dispatch workgroups (#24519)

9b18e32

Fixed the bug in #24228 which causes the incorrect result for phi models when flash attention is disabled.

ankitm3k pushed a commit to intel/onnxruntime that referenced this pull request May 12, 2025

[webgpu] Fix bug in 1D dispatch workgroups (microsoft#24519)

9a8c272

Fixed the bug in microsoft#24228 which causes the incorrect result for phi models when flash attention is disabled.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[webgpu] Use 1D dispatch groups for attention #24228

[webgpu] Use 1D dispatch groups for attention #24228

Uh oh!

qjia7 commented Mar 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[webgpu] Use 1D dispatch groups for attention #24228

[webgpu] Use 1D dispatch groups for attention #24228

Uh oh!

Conversation

qjia7 commented Mar 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants