ggml-webgpu: Support GPU profiling beyond the maximum query count by yomaytk · Pull Request #22995 · ggml-org/llama.cpp

yomaytk · 2026-05-13T00:42:36Z

Overview

This PR fixes the bug described in the Additional Information section.

Flush timestamp slots and reset the timestamp state when the number of used timestamp slots is nearly full.

I confirmed that GPU profiles can now be collected for Qwen3.5-35B-A3B-GGUF and several other models (Qwen3.5, Qwen3.6, Gemma 4, and Llama 3).

Additional Information

I noticed that unsloth/Qwen3.5-35B-A3B-GGUF overflowed the timestamp QuerySet when I tried to collect a GPU profile:

llama.cpp/ggml/src/ggml-webgpu/ggml-webgpu.cpp:571: GGML_ASSERT(ctx->profile_timestamp_query_count + 2 <= WEBGPU_MAX_PROFILE_QUERY_COUNT) failed

This suggests that we need logic to allow profile collection even when a model requires more than 4096 timestamp queries.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: YES - I used AI to investigate WebGPU specification

reeselevine · 2026-05-13T16:30:06Z

thanks, this is a nice clean addition!

…ml-org#22995)

…ml-org#22995) (cherry picked from commit 527045b)

…ml-org#22995)

flush the gpu profile timestamp before the queryset is overflowed

5576c7d

yomaytk requested a review from a team as a code owner May 13, 2026 00:42

github-actions Bot added ggml changes relating to the ggml tensor library for machine learning WebGPU labels May 13, 2026

reeselevine approved these changes May 13, 2026

View reviewed changes

reeselevine requested a review from CISC May 13, 2026 16:30

CISC approved these changes May 13, 2026

View reviewed changes

reeselevine merged commit 527045b into ggml-org:master May 13, 2026
46 checks passed

xxmustafacooTR pushed a commit to xxPlayground/llama-cpp-turboquant that referenced this pull request May 13, 2026

flush the gpu profile timestamp before the queryset is overflowed (gg…

141b1df

…ml-org#22995)

yomaytk deleted the new-flush-gpu-profile branch May 18, 2026 13:18

rsenthilkumar6 pushed a commit to rsenthilkumar6/llama.cpp that referenced this pull request May 19, 2026

flush the gpu profile timestamp before the queryset is overflowed (gg…

9c59549

…ml-org#22995)

ArberSephirotheca pushed a commit to ArberSephirotheca/llama.cpp that referenced this pull request May 19, 2026

flush the gpu profile timestamp before the queryset is overflowed (gg…

9cb54a2

…ml-org#22995)

baramofme pushed a commit to baramofme/llama-cpp-turboquant that referenced this pull request May 23, 2026

flush the gpu profile timestamp before the queryset is overflowed (gg…

d863d26

…ml-org#22995)

carlosfundora pushed a commit to carlosfundora/llama.cpp-1-bit-turbo that referenced this pull request May 24, 2026

flush the gpu profile timestamp before the queryset is overflowed (gg…

1341a93

…ml-org#22995) (cherry picked from commit 527045b)

winstonma pushed a commit to winstonma/llama.cpp that referenced this pull request May 27, 2026

flush the gpu profile timestamp before the queryset is overflowed (gg…

f711259

…ml-org#22995)

fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026

flush the gpu profile timestamp before the queryset is overflowed (gg…

09f4c4b

…ml-org#22995)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml-webgpu: Support GPU profiling beyond the maximum query count#22995

ggml-webgpu: Support GPU profiling beyond the maximum query count#22995
reeselevine merged 1 commit into
ggml-org:masterfrom
yomaytk:new-flush-gpu-profile

yomaytk commented May 13, 2026 •

edited

Loading

Uh oh!

reeselevine commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yomaytk commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Additional Information

Requirements

Uh oh!

reeselevine commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yomaytk commented May 13, 2026 •

edited

Loading