Skip to content

llama-server: Support --verbose-prompt#19752

Closed
a4lg wants to merge 1 commit into
ggml-org:masterfrom
a4lg:server-verbose-prompt
Closed

llama-server: Support --verbose-prompt#19752
a4lg wants to merge 1 commit into
ggml-org:masterfrom
a4lg:server-verbose-prompt

Conversation

@a4lg
Copy link
Copy Markdown

@a4lg a4lg commented Feb 20, 2026

llama-server has the option --verbose-prompt but was ineffective.

This commit recovers the logger for full prompt input (gated behind the non-default option --verbose-prompt) which was originally commented out.

llama-server has the option `--verbose-prompt` but was ineffective.

This commit recovers the logger for full prompt input (gated behind
the non-default option `--verbose-prompt`).
@ngxson
Copy link
Copy Markdown
Contributor

ngxson commented Feb 20, 2026

Use slots debug instead

@ngxson ngxson closed this Feb 20, 2026
@a4lg
Copy link
Copy Markdown
Author

a4lg commented Feb 26, 2026

Okay, undocumented slot debugging feature...?

How to use it?

Invoking with Slots Debug enabled

LLAMA_SERVER_SLOTS_DEBUG=1 llama-server --port 11434 [ARGS...]

How to inspect the results?

After the process finishes, inspect slots. Here is an example:

# curl --silent 'http://localhost:11434/slots' | jq
[
  ...,
  {
    "id": 7,
    "n_ctx": 2048,
    "speculative": false,
    "is_processing": false,
    "id_task": 0,
    ...,
    "prompt": "<|im_start|>system\nYou are Qwen, created by Alibaba Cloud. You are a helpful assistant.<|im_end|>\n<|im_start|>user\nHello!<|im_end|>\n<|im_start|>assistant\n",
    "generated": "Hello! How can I assist you today?"
  }
]

You may need to specify the model name on the router mode.

curl --silent 'http://localhost:11434/slots?model=Qwen%2FQwen2.5-Coder-1.5B-Instruct' | jq

Alternatives

@ngxson
I'm half convinced. Although undocumented, it's a good feature which I have missed.

Then, I came up with one question. What's the point leaving --verbose-prompt which is completely ineffective on llama-server in the first place? If this is a valid question, I'll submit a PR to remove --verbose-prompt from llama-server instead.

What do you think?

@ngxson
Copy link
Copy Markdown
Contributor

ngxson commented Feb 26, 2026

@a4lg It was undocumented because it was mean to be used for testing only. But feel free to add it to a new section at the end of server docs.

If this is a valid question, I'll submit a PR to remove --verbose-prompt from llama-server instead.

Yes, please, that would be helpful. You can push it with the same PR I suggested above

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants