Skip to content

opencl: batch profiling to prevent resource exhaustion#23495

Merged
lhez merged 1 commit into
ggml-org:masterfrom
qualcomm:sq/opencl-batch-profiling
May 24, 2026
Merged

opencl: batch profiling to prevent resource exhaustion#23495
lhez merged 1 commit into
ggml-org:masterfrom
qualcomm:sq/opencl-batch-profiling

Conversation

@shaofeiqi
Copy link
Copy Markdown
Contributor

Overview

With GGML_OPENCL_PROFILING enabled, events accumulate until shutdown, causing resource exhaustion and slowdown.

This PR refactors the OpenCL profiling logic to batch and flush profiling data, preventing memory leak, while also improving overall profiling efficiency.

Requirements

@shaofeiqi shaofeiqi requested a review from a team as a code owner May 21, 2026 20:07
@github-actions github-actions Bot added ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend labels May 22, 2026
@lhez lhez force-pushed the sq/opencl-batch-profiling branch from 30ac728 to 0bd24d9 Compare May 24, 2026 04:06
@lhez lhez requested a review from max-krasnyansky May 24, 2026 04:07
@lhez lhez merged commit f306111 into ggml-org:master May 24, 2026
60 of 61 checks passed
spiritbuun pushed a commit to spiritbuun/buun-llama-cpp that referenced this pull request May 25, 2026
fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026
turbo-tan pushed a commit to turbo-tan/llama.cpp-tq3 that referenced this pull request Jun 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants