llama-server: Support `--verbose-prompt` by a4lg · Pull Request #19752 · ggml-org/llama.cpp

a4lg · 2026-02-20T04:37:08Z

llama-server has the option --verbose-prompt but was ineffective.

This commit recovers the logger for full prompt input (gated behind the non-default option --verbose-prompt) which was originally commented out.

llama-server has the option `--verbose-prompt` but was ineffective. This commit recovers the logger for full prompt input (gated behind the non-default option `--verbose-prompt`).

ngxson · 2026-02-20T08:55:11Z

Use slots debug instead

a4lg · 2026-02-26T03:52:42Z

Okay, undocumented slot debugging feature...?

How to use it?

Invoking with Slots Debug enabled

LLAMA_SERVER_SLOTS_DEBUG=1 llama-server --port 11434 [ARGS...]

How to inspect the results?

After the process finishes, inspect slots. Here is an example:

# curl --silent 'http://localhost:11434/slots' | jq
[
  ...,
  {
    "id": 7,
    "n_ctx": 2048,
    "speculative": false,
    "is_processing": false,
    "id_task": 0,
    ...,
    "prompt": "<|im_start|>system\nYou are Qwen, created by Alibaba Cloud. You are a helpful assistant.<|im_end|>\n<|im_start|>user\nHello!<|im_end|>\n<|im_start|>assistant\n",
    "generated": "Hello! How can I assist you today?"
  }
]

You may need to specify the model name on the router mode.

curl --silent 'http://localhost:11434/slots?model=Qwen%2FQwen2.5-Coder-1.5B-Instruct' | jq

Alternatives

@ngxson
I'm half convinced. Although undocumented, it's a good feature which I have missed.

Then, I came up with one question. What's the point leaving --verbose-prompt which is completely ineffective on llama-server in the first place? If this is a valid question, I'll submit a PR to remove --verbose-prompt from llama-server instead.

What do you think?

ngxson · 2026-02-26T11:30:14Z

@a4lg It was undocumented because it was mean to be used for testing only. But feel free to add it to a new section at the end of server docs.

If this is a valid question, I'll submit a PR to remove --verbose-prompt from llama-server instead.

Yes, please, that would be helpful. You can push it with the same PR I suggested above

a4lg requested review from ggerganov and ngxson as code owners February 20, 2026 04:37

llama-server: Support --verbose-prompt

da2e13c

llama-server has the option `--verbose-prompt` but was ineffective. This commit recovers the logger for full prompt input (gated behind the non-default option `--verbose-prompt`).

a4lg force-pushed the server-verbose-prompt branch from aa0925f to da2e13c Compare February 20, 2026 04:39

github-actions Bot added examples server labels Feb 20, 2026

ngxson closed this Feb 20, 2026

CISC mentioned this pull request Mar 27, 2026

server: remove the verbose_prompt parameter #21059

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama-server: Support `--verbose-prompt`#19752

llama-server: Support `--verbose-prompt`#19752
a4lg wants to merge 1 commit into
ggml-org:masterfrom
a4lg:server-verbose-prompt

a4lg commented Feb 20, 2026 •

edited

Loading

Uh oh!

ngxson commented Feb 20, 2026

Uh oh!

a4lg commented Feb 26, 2026

Uh oh!

ngxson commented Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

a4lg commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngxson commented Feb 20, 2026

Uh oh!

a4lg commented Feb 26, 2026

How to use it?

Invoking with Slots Debug enabled

How to inspect the results?

Alternatives

Uh oh!

ngxson commented Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

a4lg commented Feb 20, 2026 •

edited

Loading