server: (router) alloc tmp buffer on heap by ngxson · Pull Request #23159 · ggml-org/llama.cpp

ngxson · 2026-05-16T18:47:17Z

Overview

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: no

(cherry picked from commit b64739e)

* turboquant/HEAD: (82 commits) docs(readme): credit Google's original TurboQuant + explain the '+' docs(readme): fix turbo ladder ordering + cite K-compression paper docs(readme): reorder KV configs as a ladder + 'start light' guidance docs(readme): add Chronara to deployments + AtomicChat link docs: restructure README — professional layout, deployments, paper links docs: tighten README — add turbo2, missing features, paper links docs: keep upstream README, prepend fork-specific summary docs: replace upstream README with fork-specific summary fix(xxd.cmake): handle missing input file (not just empty) fix(ci): 4 cross-vendor -Werror failures + defensive xxd.cmake cmake : fix LLAMA_BUILD_UI logic (ggml-org#23190) fix(ggml-cuda): HIP nodiscard + MUSA cudaMemcpyToSymbol alias fix(turbo-quant): add forward declaration for turbo_cpu_fwht_inverse fix(metal): set ne12/ne13/r2/r3 function constants in mul_mm_tq_rotated pipeline webui: support video files as input (ggml-org#22830) server: (router) alloc tmp buffer on heap (ggml-org#23159) server: skip device enumeration in router mode to avoid creating CUDA primary context (ggml-org#23137) vulkan: removed duplicate #include <memory> in headers (ggml-org#23144) ui: Add request timeout for MCP tool calls (ggml-org#23138) sync : ggml ...

server: (router) alloc tmp buffer on heap

bc29dcc

ngxson requested a review from a team as a code owner May 16, 2026 18:47

Merge branch 'master' into xsn/server_nits_heap_alloc

5f1d625

ServeurpersoCom approved these changes May 16, 2026

View reviewed changes

github-actions Bot added examples server labels May 16, 2026

ggerganov approved these changes May 16, 2026

View reviewed changes

ngxson merged commit b64739e into ggml-org:master May 16, 2026
44 of 49 checks passed

kgrama pushed a commit to kgrama/llama.cpp that referenced this pull request May 19, 2026

server: (router) alloc tmp buffer on heap (ggml-org#23159)

28ff23d

xxmustafacooTR pushed a commit to xxPlayground/llama-cpp-turboquant that referenced this pull request May 19, 2026

server: (router) alloc tmp buffer on heap (ggml-org#23159)

4ea622d

rsenthilkumar6 pushed a commit to rsenthilkumar6/llama.cpp that referenced this pull request May 19, 2026

server: (router) alloc tmp buffer on heap (ggml-org#23159)

b67d98b

ArberSephirotheca pushed a commit to ArberSephirotheca/llama.cpp that referenced this pull request May 19, 2026

server: (router) alloc tmp buffer on heap (ggml-org#23159)

425ba5f

baramofme pushed a commit to baramofme/llama-cpp-turboquant that referenced this pull request May 23, 2026

server: (router) alloc tmp buffer on heap (ggml-org#23159)

08f68a0

srossitto79 pushed a commit to srossitto79/llama.cpp that referenced this pull request May 23, 2026

server: (router) alloc tmp buffer on heap (ggml-org#23159)

07656b2

carlosfundora pushed a commit to carlosfundora/llama.cpp-1-bit-turbo that referenced this pull request May 24, 2026

server: (router) alloc tmp buffer on heap (ggml-org#23159)

9189d30

(cherry picked from commit b64739e)

winstonma pushed a commit to winstonma/llama.cpp that referenced this pull request May 27, 2026

server: (router) alloc tmp buffer on heap (ggml-org#23159)

6b58db2

fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026

server: (router) alloc tmp buffer on heap (ggml-org#23159)

16fbc96

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server: (router) alloc tmp buffer on heap#23159

server: (router) alloc tmp buffer on heap#23159
ngxson merged 2 commits into
ggml-org:masterfrom
ngxson:xsn/server_nits_heap_alloc

ngxson commented May 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ngxson commented May 16, 2026

Overview

Requirements

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants