[WebGPU] allow async shader compilation #25941

fs-eire · 2025-09-03T22:56:00Z

Description

Reduce the time blocked waiting for the shader to be compiled.

Motivation and Context

Try to optimize the responsiveness of the application when running ort-web in main thread. See #25882

grazder · 2025-09-04T04:27:14Z

Try to optimize the responsiveness of the application when running ort-web in main thread

I actually launch ORT-WEB in worker, so these GPU blocks appear regardless of whether it is launched in worker or in main thread

fs-eire · 2025-09-04T20:53:03Z

Try to optimize the responsiveness of the application when running ort-web in main thread

I actually launch ORT-WEB in worker, so these GPU blocks appear regardless of whether it is launched in worker or in main thread

Do you mean that the UI responsiveness problem mentioned in #25882 is caused by GPU exhausted but not caused by the UI threads running JavaScript?

grazder · 2025-09-05T07:01:03Z

Do you mean that the UI responsiveness problem mentioned in #25882 is caused by GPU exhausted but not caused by the UI threads running JavaScript?

Yes, the main problem is that when the model initialized, it causes large GPU operations (not CPU operations in the main thread) that lock up the GPU and prevent the user interface from being rendered, which is also rendered using the GPU.

The image shows that during large GPU-based operations, frames were not rendered.

qjia7 · 2025-09-05T08:26:22Z

I think the async compilation is resolving the cpu issue that gpu process is occupied a long time due to shader compilation. The UI threads' render commands have to wait on gpu process until one CreateComputePipeline is finished. So with this change, the CreateComputePipeline is moved into a gpu thread and won't block the gpu main thread so that the ui commands can send to gpu in time.
GPU busy is another issue that one ort task is too big and the ui task has to be wait on gpu. Currently we batch 16 dispatches and submit once to minimize the submit overhead. Too frequently submit will bring gpu bubbles and not friendly for small operations or models. It's challenging to determine an optimal batch size that suits all models. Maybe we could consider exposing the batch size as a session option, allowing users to customize this value to better fit their needs.

grazder · 2025-09-05T08:57:14Z

Maybe we could consider exposing the batch size as a session option, allowing users to customize this value to better fit their needs.

Yeah, that would be great

…-shader-async

Copilot

Pull Request Overview

This PR refactors the WebGPU shader compilation to use asynchronous pipeline creation, improving application responsiveness when running in the main thread. The change replaces synchronous CreateComputePipeline with CreateComputePipelineAsync to avoid blocking while waiting for shader compilation to complete.

Key Changes

ProgramManager constructor now accepts a WebGpuContext reference instead of separate device and limits parameters
Shader compilation changed from synchronous to asynchronous using CreateComputePipelineAsync with callback-based completion handling
Error handling added for async pipeline creation failures

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
onnxruntime/core/providers/webgpu/webgpu_context.cc	Updated ProgramManager instantiation to pass WebGpuContext reference
onnxruntime/core/providers/webgpu/program_manager.h	Modified constructor to accept WebGpuContext reference and updated member variables
onnxruntime/core/providers/webgpu/program_manager.cc	Implemented async shader compilation with CreateComputePipelineAsync and callback handling

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

onnxruntime/core/providers/webgpu/program_manager.cc

fs-eire · 2025-10-24T18:04:04Z

@microsoft-github-policy-service rerun

### Description Reduce the time blocked waiting for the shader to be compiled. ### Motivation and Context Try to optimize the responsiveness of the application when running ort-web in main thread. See microsoft#25882

[WebGPU] allow async shader compilation

dae0d38

fs-eire mentioned this pull request Sep 3, 2025

[Web] WebGPU first‑run model warm‑up causes long GPU‑blocking operations (maxDispatchNumber & synchronous pipeline creation) #25882

Closed

fix build break

71441fe

guschmue added the ep:WebGPU ort-web webgpu provider label Sep 4, 2025

Merge remote-tracking branch 'origin/main' into fs-eire/allow-compile…

401a744

…-shader-async

fs-eire requested a review from Copilot October 22, 2025 00:38

Copilot AI reviewed Oct 22, 2025

View reviewed changes

onnxruntime/core/providers/webgpu/program_manager.cc Show resolved Hide resolved

onnxruntime/core/providers/webgpu/program_manager.cc Show resolved Hide resolved

fs-eire requested a review from guschmue October 22, 2025 00:43

guschmue previously approved these changes Oct 22, 2025

View reviewed changes

fix build break

4fd4bcf

fs-eire dismissed guschmue’s stale review via 4fd4bcf October 22, 2025 06:11

guschmue approved these changes Oct 24, 2025

View reviewed changes

fs-eire merged commit 954bb7b into main Oct 27, 2025
93 of 94 checks passed

fs-eire deleted the fs-eire/allow-compile-shader-async branch October 27, 2025 19:10

ambroser53 mentioned this pull request Nov 7, 2025

Enable graph capture for webgpu microsoft/onnxruntime-genai#1848

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebGPU] allow async shader compilation #25941

[WebGPU] allow async shader compilation #25941

Uh oh!

fs-eire commented Sep 3, 2025

Uh oh!

grazder commented Sep 4, 2025

Uh oh!

fs-eire commented Sep 4, 2025

Uh oh!

grazder commented Sep 5, 2025 •

edited

Loading

Uh oh!

qjia7 commented Sep 5, 2025

Uh oh!

grazder commented Sep 5, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

fs-eire commented Oct 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[WebGPU] allow async shader compilation #25941

[WebGPU] allow async shader compilation #25941

Uh oh!

Conversation

fs-eire commented Sep 3, 2025

Description

Motivation and Context

Uh oh!

grazder commented Sep 4, 2025

Uh oh!

fs-eire commented Sep 4, 2025

Uh oh!

grazder commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qjia7 commented Sep 5, 2025

Uh oh!

grazder commented Sep 5, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

fs-eire commented Oct 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

grazder commented Sep 5, 2025 •

edited

Loading