Skip to content

app : show version#23426

Merged
angt merged 1 commit into
ggml-org:masterfrom
angt:app-show-version
May 21, 2026
Merged

app : show version#23426
angt merged 1 commit into
ggml-org:masterfrom
angt:app-show-version

Conversation

@angt
Copy link
Copy Markdown
Member

@angt angt commented May 20, 2026

Overview

cont #23296

Additional information

Requirements

Signed-off-by: Adrien Gallouët <angt@huggingface.co>
@angt angt merged commit ce02093 into ggml-org:master May 21, 2026
49 checks passed
ProTekk pushed a commit to ProTekk/buun-llama-cpp that referenced this pull request May 21, 2026
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
gabe-l-hart added a commit to gabe-l-hart/llama.cpp that referenced this pull request May 21, 2026
* origin/master: (138 commits)
fix(flash-attn): replace f32 with kv_type and q_type (ggml-org#23372)
tests : move save-load-state from examples to tests (ggml-org#23336)
server: expose prompt token counts in /slots endpoint (ggml-org#23454)
metal : optimize concat kernel and fix set kernel threads (ggml-org#23411)
server : free draft/MTP resources on sleep to fix VRAM leak (ggml-org#23461)
server: re-inject subcommand when router spawns children under unified binary (ggml-org#23442)
app : add batched-bench, fit-params, quantize & perplexity (ggml-org#23459)
mtp: use inp_out_ids for skipping logit computation (ggml-org#23433)
vocab : add Carbon-3B (HybridDNATokenizer) support (ggml-org#23410)
doc: fix spec mtp typo (ggml-org#23435)
ui: Improve Git Hooks for UI development (ggml-org#23403)
ggml : Check the right iface method before using the fallback 2d get (ggml-org#23306)
llama-graph: fix null-buffer crash in llm_graph_input_attn_kv_iswa for SWA-only models (ggml-org#23131)
hexagon: ssm-conv fix for large prompts (ggml-org#23307)
app : show version (ggml-org#23426)
mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (ggml-org#23329)
ui: Add max image size option (ggml-org#22849)
Move to backend sampling for MTP draft path (ggml-org#23287)
opencl: refactor backend initilization (ggml-org#23318)
common/speculative : fix nullptr crash in get_devices_str (ggml-org#23386)
...
baramofme pushed a commit to baramofme/llama-cpp-turboquant that referenced this pull request May 23, 2026
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
Jcfunk added a commit to Jcfunk/llama.cpp that referenced this pull request May 23, 2026
* upstream/HEAD: (38 commits)
  vocab : add Carbon-3B (HybridDNATokenizer) support (ggml-org#23410)
  doc: fix spec mtp typo (ggml-org#23435)
  ui: Improve Git Hooks for UI development (ggml-org#23403)
  ggml : Check the right iface method before using the fallback 2d get (ggml-org#23306)
  llama-graph: fix null-buffer crash in llm_graph_input_attn_kv_iswa for SWA-only models (ggml-org#23131)
  hexagon: ssm-conv fix for large prompts (ggml-org#23307)
  app : show version (ggml-org#23426)
  mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (ggml-org#23329)
  ui: Add max image size option (ggml-org#22849)
  Move to backend sampling for MTP draft path (ggml-org#23287)
  opencl: refactor backend initilization (ggml-org#23318)
  common/speculative : fix nullptr crash in get_devices_str (ggml-org#23386)
  mtmd : DeepSeek-OCR image processing fixes, img_tool::resize padding refactor (ggml-org#23345)
  vulkan: optimize operations in the IM2COL shader (ggml-org#22685)
  feat: Add WAV MIME type variants and improve audio format detection (ggml-org#23396)
  hexagon: HMX quantized matmul rework (ggml-org#23368)
  Programmatic Dependent Launch (PDL) for more performance on newer NVIDIA GPUs (Hopper+) (ggml-org#22522)
  app : introduce the llama unified executable (ggml-org#23296)
  refactor: Move text attachments up before the message content in chat completions payload (ggml-org#23406)
  mtmd: fit_params now take into account mmproj (ggml-org#21489)
  ...
srossitto79 pushed a commit to srossitto79/llama.cpp that referenced this pull request May 23, 2026
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
turbo-tan pushed a commit to turbo-tan/llama.cpp-tq3 that referenced this pull request Jun 2, 2026
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants