opencl: batch profiling to prevent resource exhaustion by shaofeiqi · Pull Request #23495 · ggml-org/llama.cpp

shaofeiqi · 2026-05-21T20:07:26Z

Overview

With GGML_OPENCL_PROFILING enabled, events accumulate until shutdown, causing resource exhaustion and slowdown.

This PR refactors the OpenCL profiling logic to batch and flush profiling data, preventing memory leak, while also improving overall profiling efficiency.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: No

…ml-org#23495)

shaofeiqi requested a review from a team as a code owner May 21, 2026 20:07

github-actions Bot added ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend labels May 22, 2026

opencl: batch profiling to improve speed and prevent memory leaks

0bd24d9

lhez force-pushed the sq/opencl-batch-profiling branch from 30ac728 to 0bd24d9 Compare May 24, 2026 04:06

lhez approved these changes May 24, 2026

View reviewed changes

lhez requested a review from max-krasnyansky May 24, 2026 04:07

max-krasnyansky approved these changes May 24, 2026

View reviewed changes

lhez merged commit f306111 into ggml-org:master May 24, 2026
60 of 61 checks passed

spiritbuun pushed a commit to spiritbuun/buun-llama-cpp that referenced this pull request May 25, 2026

opencl: batch profiling to improve speed and prevent memory leaks (gg…

829357a

…ml-org#23495)

a-ghorbani mentioned this pull request May 25, 2026

chore(deps): upgrade llama.rn to 0.12.4 a-ghorbani/pocketpal-ai#743

Merged

7 tasks

fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026

opencl: batch profiling to improve speed and prevent memory leaks (gg…

5fe0831

…ml-org#23495)

turbo-tan pushed a commit to turbo-tan/llama.cpp-tq3 that referenced this pull request Jun 2, 2026

opencl: batch profiling to improve speed and prevent memory leaks (gg…

0ccf942

…ml-org#23495)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

opencl: batch profiling to prevent resource exhaustion#23495

opencl: batch profiling to prevent resource exhaustion#23495
lhez merged 1 commit into
ggml-org:masterfrom
qualcomm:sq/opencl-batch-profiling

shaofeiqi commented May 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

shaofeiqi commented May 21, 2026

Overview

Requirements

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants