Skip to content

metal : reduce rset heartbeat from 500ms -> 5ms#24074

Merged
ggerganov merged 1 commit into
masterfrom
gg/metal-reduce-rset-heart-beat
Jun 4, 2026
Merged

metal : reduce rset heartbeat from 500ms -> 5ms#24074
ggerganov merged 1 commit into
masterfrom
gg/metal-reduce-rset-heart-beat

Conversation

@ggerganov
Copy link
Copy Markdown
Member

@ggerganov ggerganov commented Jun 3, 2026

Overview

cont #17766

Make the heartbeat loop of the Metal backend (that keeps the residency sets alive) more frequent. This is helpful for reducing the time for exiting a ggml application on macos.

Requirements

@ggerganov ggerganov requested a review from a team as a code owner June 3, 2026 12:52
@github-actions github-actions Bot added ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) labels Jun 3, 2026
@ggerganov ggerganov merged commit 3d19986 into master Jun 4, 2026
26 checks passed
@ggerganov ggerganov deleted the gg/metal-reduce-rset-heart-beat branch June 4, 2026 05:05
gabe-l-hart added a commit to gabe-l-hart/llama.cpp that referenced this pull request Jun 4, 2026
* origin/master: (57 commits)
server : disable on-device spec checkpoints (ggml-org#24108)
arg: fix double mtp downloads (ggml-org#24128)
webui: [a11y] fix keyboard navigation issues in chat interface and sidebar (ggml-org#23132)
Move duplicated imatrix code into single common imatrix-loader.cpp (ggml-org#22445)
ui: Fixed packages (ggml-org#24119)
ui: added single line reasoning preview (ggml-org#23601)
return filter to save memory (ggml-org#24125)
convert: Fix Gemma 4 Unified conversion (ggml-org#24118)
ggml: vectorize ggml_vec_dot_q4_1_q8_1 with WASM SIMD128 (ggml-org#22209)
server: avoid unnecessary checkpoint restore when new tokens are present (ggml-org#24110)
agents: refactor, include more guidelines (ggml-org#24111)
webui: fix tool selector toggle/counter, key tools by stable identity (ggml-org#24065)
build : use umbrella Headers directory for XCFramework module map (ggml-org#23974)
server : add header to tools/server/server-http.h (ggml-org#24089)
cmake: skip cvector-generator and export-lora when CPU backend is disabled (ggml-org#24053)
fix(mtmd): handle Gemma 4 audio projector embedding size (ggml-org#24091)
readme : add status badges (ggml-org#24104)
tests : refactor test-save-load-state to accept token input (ggml-org#24073)
metal : reduce rset heartbeat from 500ms -> 5ms (ggml-org#24074)
ggml-webgpu: FlashAttention refactor + standardize quantization support (ggml-org#23834)
...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant