app : show version by angt · Pull Request #23426 · ggml-org/llama.cpp

angt · 2026-05-20T15:19:11Z

Overview

Additional information

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: NO

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* origin/master: (138 commits) fix(flash-attn): replace f32 with kv_type and q_type (ggml-org#23372) tests : move save-load-state from examples to tests (ggml-org#23336) server: expose prompt token counts in /slots endpoint (ggml-org#23454) metal : optimize concat kernel and fix set kernel threads (ggml-org#23411) server : free draft/MTP resources on sleep to fix VRAM leak (ggml-org#23461) server: re-inject subcommand when router spawns children under unified binary (ggml-org#23442) app : add batched-bench, fit-params, quantize & perplexity (ggml-org#23459) mtp: use inp_out_ids for skipping logit computation (ggml-org#23433) vocab : add Carbon-3B (HybridDNATokenizer) support (ggml-org#23410) doc: fix spec mtp typo (ggml-org#23435) ui: Improve Git Hooks for UI development (ggml-org#23403) ggml : Check the right iface method before using the fallback 2d get (ggml-org#23306) llama-graph: fix null-buffer crash in llm_graph_input_attn_kv_iswa for SWA-only models (ggml-org#23131) hexagon: ssm-conv fix for large prompts (ggml-org#23307) app : show version (ggml-org#23426) mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (ggml-org#23329) ui: Add max image size option (ggml-org#22849) Move to backend sampling for MTP draft path (ggml-org#23287) opencl: refactor backend initilization (ggml-org#23318) common/speculative : fix nullptr crash in get_devices_str (ggml-org#23386) ...

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

* upstream/HEAD: (38 commits) vocab : add Carbon-3B (HybridDNATokenizer) support (ggml-org#23410) doc: fix spec mtp typo (ggml-org#23435) ui: Improve Git Hooks for UI development (ggml-org#23403) ggml : Check the right iface method before using the fallback 2d get (ggml-org#23306) llama-graph: fix null-buffer crash in llm_graph_input_attn_kv_iswa for SWA-only models (ggml-org#23131) hexagon: ssm-conv fix for large prompts (ggml-org#23307) app : show version (ggml-org#23426) mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (ggml-org#23329) ui: Add max image size option (ggml-org#22849) Move to backend sampling for MTP draft path (ggml-org#23287) opencl: refactor backend initilization (ggml-org#23318) common/speculative : fix nullptr crash in get_devices_str (ggml-org#23386) mtmd : DeepSeek-OCR image processing fixes, img_tool::resize padding refactor (ggml-org#23345) vulkan: optimize operations in the IM2COL shader (ggml-org#22685) feat: Add WAV MIME type variants and improve audio format detection (ggml-org#23396) hexagon: HMX quantized matmul rework (ggml-org#23368) Programmatic Dependent Launch (PDL) for more performance on newer NVIDIA GPUs (Hopper+) (ggml-org#22522) app : introduce the llama unified executable (ggml-org#23296) refactor: Move text attachments up before the message content in chat completions payload (ggml-org#23406) mtmd: fit_params now take into account mmproj (ggml-org#21489) ...

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

app : show version

eab5099

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

ggerganov approved these changes May 20, 2026

View reviewed changes

ngxson approved these changes May 20, 2026

View reviewed changes

angt merged commit ce02093 into ggml-org:master May 21, 2026
49 checks passed

ProTekk pushed a commit to ProTekk/buun-llama-cpp that referenced this pull request May 21, 2026

app : show version (ggml-org#23426)

17f077d

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

nyo16 mentioned this pull request May 21, 2026

Bump llama.cpp to 52fb93a2b (30 commits) nyo16/llama_cpp_ex#42

Merged

4 tasks

baramofme pushed a commit to baramofme/llama-cpp-turboquant that referenced this pull request May 23, 2026

app : show version (ggml-org#23426)

b10903c

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

srossitto79 pushed a commit to srossitto79/llama.cpp that referenced this pull request May 23, 2026

app : show version (ggml-org#23426)

f96876b

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026

app : show version (ggml-org#23426)

c409100

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

turbo-tan pushed a commit to turbo-tan/llama.cpp-tq3 that referenced this pull request Jun 2, 2026

app : show version (ggml-org#23426)

41dbf66

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

app : show version#23426

app : show version#23426
angt merged 1 commit into
ggml-org:masterfrom
angt:app-show-version

angt commented May 20, 2026 •

edited by ggerganov

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

angt commented May 20, 2026 • edited by ggerganov Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Additional information

Requirements

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

angt commented May 20, 2026 •

edited by ggerganov

Loading