server : add props.model_alias #16943

ggerganov · 2025-11-02T16:23:23Z

Add "model_alias" to /props endpoint
Render the model alias when specified

llama-server -hf ggml-org/gpt-oss-20b-GGUF --jinja -c 0 --port 8033 --alias gpt-oss-20-example

allozaur · 2025-11-02T17:01:41Z

Looking good to me!

CISC · 2025-11-02T20:24:04Z

Not a massively helpful message, but I guess something needs to be fixed :)
https://github.com/ggml-org/llama.cpp/actions/runs/19014962680/job/54301478932?pr=16943#step:6:15

allozaur · 2025-11-02T20:46:49Z

Not a massively helpful message, but I guess something needs to be fixed :)

https://github.com/ggml-org/llama.cpp/actions/runs/19014962680/job/54301478932?pr=16943#step:6:15

I hadn't seen that check failing in CI. @ggerganov u can just run npm run format locally and push.

* origin/master: (169 commits) opencl: support imrope (ggml-org#16914) fix: Viewing multiple PDF attachments (ggml-org#16974) model-conversion : pass config to from_pretrained (ggml-org#16963) server : add props.model_alias (ggml-org#16943) ggml: CUDA: add head size 72 for flash-attn (ggml-org#16962) mtmd: add --image-min/max-tokens (ggml-org#16921) mtmd: pad mask for qwen2.5vl (ggml-org#16954) ggml : LoongArch fixes (ggml-org#16958) sync: minja (glm 4.6 & minmax m2 templates) (ggml-org#16949) SYCL: optimized repeat_back kernel (3× fewer asm instructions, 2× faster)Feature/sycl repeat back opt (ggml-org#16869) feat(webui): improve LaTeX rendering with currency detection (ggml-org#16508) test-backend-ops : fix segfault in moe-expert-reduce test in support mode and coverage (ggml-org#16936) ci : disable failing riscv cross build (ggml-org#16952) model: add Janus Pro for image understanding (ggml-org#16906) clip : use FA (ggml-org#16837) server : support unified cache across slots (ggml-org#16736) common : move gpt-oss reasoning processing to init params (ggml-org#16937) docs: remove llama_sampler_accept reference in sampling sample usage (ggml-org#16920) CUDA: add FLOOR, CEIL, ROUND, TRUNC unary ops (ggml-org#16917) devops: fix failing s390x docker build (ggml-org#16918) ...

* server : add props.model_alias * webui : npm run format

ggerganov requested review from allozaur and ngxson as code owners November 2, 2025 16:23

github-actions bot added examples server labels Nov 2, 2025

allozaur approved these changes Nov 2, 2025

View reviewed changes

ngxson approved these changes Nov 2, 2025

View reviewed changes

ggerganov added 2 commits November 3, 2025 11:32

server : add props.model_alias

6890d72

webui : npm run format

0e42f25

ggerganov force-pushed the gg/server-use-alias branch from e63315b to 0e42f25 Compare November 3, 2025 09:33

ggerganov mentioned this pull request Nov 3, 2025

changelog : llama-server REST API #9291

Open

allozaur merged commit 48bd265 into master Nov 3, 2025
68 of 70 checks passed

ggerganov deleted the gg/server-use-alias branch November 3, 2025 13:46

GittyBurstein pushed a commit to yael-works/llama.cpp that referenced this pull request Nov 5, 2025

server : add props.model_alias (ggml-org#16943)

af8c5d4

* server : add props.model_alias * webui : npm run format

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server : add props.model_alias #16943

server : add props.model_alias #16943

Uh oh!

ggerganov commented Nov 2, 2025

Uh oh!

allozaur commented Nov 2, 2025

Uh oh!

CISC commented Nov 2, 2025

Uh oh!

allozaur commented Nov 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

server : add props.model_alias #16943

server : add props.model_alias #16943

Uh oh!

Conversation

ggerganov commented Nov 2, 2025

Uh oh!

allozaur commented Nov 2, 2025

Uh oh!

CISC commented Nov 2, 2025

Uh oh!

allozaur commented Nov 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants