opencl: fix crash when warming up MoE on Adreno by lhez · Pull Request #22876 · ggml-org/llama.cpp

lhez · 2026-05-09T17:23:08Z

Overview

When warming up MoE models on Adreno (in this case, gpt-oss-20b-mxfp4), it crashes with invalid workgroup size.

This is because the warmup run ne20 = 128 (use all experts) and the workgroup size ends up exceeding the max workgroup size of 1024. During a normal run, ne20 is the number of used experts and the workgroup size does not exceed the max workgroup size.

llama.cpp/ggml/src/ggml-opencl/ggml-opencl.cpp

Lines 12788 to 12790 in 1e5ad35

    
           size_t histogram_global_size[] = {(size_t)(((ne21 + 63) / 64) * 64), static_cast<size_t>(ne20), 1}; 
        
           size_t histogram_local_size[] = {64, static_cast<size_t>(ne20), 1}; 
        
           backend_ctx->enqueue_ndrange_kernel(kernel, 3, histogram_global_size, histogram_local_size, src);

Additional information

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: Asked AI to investigate

lhez · 2026-05-13T04:48:24Z

@ggml-org/maintainers Can I get another approval?

(cherry picked from commit 1e4579f)

opencl: fix crash when warming up MoE on Adreno

cc90d49

github-actions Bot added ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend labels May 9, 2026

lhez marked this pull request as ready for review May 12, 2026 06:35

lhez requested a review from a team as a code owner May 12, 2026 06:35

max-krasnyansky approved these changes May 12, 2026

View reviewed changes

CISC approved these changes May 13, 2026

View reviewed changes

lhez merged commit 1e4579f into ggml-org:master May 13, 2026
77 of 78 checks passed

xxmustafacooTR pushed a commit to xxPlayground/llama-cpp-turboquant that referenced this pull request May 13, 2026

opencl: fix crash when warming up MoE on Adreno (ggml-org#22876)

2f1d1c8

dandm1 pushed a commit to dandm1/llama.cpp that referenced this pull request May 16, 2026

opencl: fix crash when warming up MoE on Adreno (ggml-org#22876)

3fe064e

rsenthilkumar6 pushed a commit to rsenthilkumar6/llama.cpp that referenced this pull request May 19, 2026

opencl: fix crash when warming up MoE on Adreno (ggml-org#22876)

7752e60

ArberSephirotheca pushed a commit to ArberSephirotheca/llama.cpp that referenced this pull request May 19, 2026

opencl: fix crash when warming up MoE on Adreno (ggml-org#22876)

b7b670e

baramofme pushed a commit to baramofme/llama-cpp-turboquant that referenced this pull request May 23, 2026

opencl: fix crash when warming up MoE on Adreno (ggml-org#22876)

91877a3

carlosfundora pushed a commit to carlosfundora/llama.cpp-1-bit-turbo that referenced this pull request May 24, 2026

opencl: fix crash when warming up MoE on Adreno (ggml-org#22876)

605b578

(cherry picked from commit 1e4579f)

winstonma pushed a commit to winstonma/llama.cpp that referenced this pull request May 27, 2026

opencl: fix crash when warming up MoE on Adreno (ggml-org#22876)

e50f084

fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026

opencl: fix crash when warming up MoE on Adreno (ggml-org#22876)

10e4d65

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

opencl: fix crash when warming up MoE on Adreno#22876

opencl: fix crash when warming up MoE on Adreno#22876
lhez merged 1 commit into
ggml-org:masterfrom
qualcomm:lh/fix-moe-warmup-crash

lhez commented May 9, 2026

Uh oh!

lhez commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	size_t histogram_global_size[] = {(size_t)(((ne21 + 63) / 64) * 64), static_cast<size_t>(ne20), 1};
	size_t histogram_local_size[] = {64, static_cast<size_t>(ne20), 1};
	backend_ctx->enqueue_ndrange_kernel(kernel, 3, histogram_global_size, histogram_local_size, src);

Conversation

lhez commented May 9, 2026

Overview

Additional information

Requirements

Uh oh!

lhez commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants