UPSTREAM PR #16603: llama-cli: add support for reasoning by DajanaV · Pull Request #135 · auroralabs-loci/llama.cpp

DajanaV · 2025-11-08T14:34:52Z

This change adds a "partial formatter" that processes partially collected messages (like the server streaming logic) in order to render reasoning logic prior to EOG token arrival.

In addition, the chat_add_and_format lambda has been moved to a functor, and this now calls common_chat_templates_apply directly to allow more robust template-application options.

Logic has been put in place to suppress the system/prompt tags to clean up output.

Example output :

./build/bin/llama-cli.exe -m ./models/gpt-oss-20b-mxfp4.gguf -c 2048 -sys "You are a wizard" -p "please recite me a haiku about llamas" --jinja -co

mtmcp added 18 commits October 15, 2025 22:16

Add partial formatter

d230722

Remove extra call to common_chat_templates_apply

3d94112

Suppress template markup in system & prompt display

a7771c1

Track system/user prompt position

8694fa3

Remove complexity

e403844

Add guards against stripped reasoning

c3768f4

Remove trailing _ for member variables

c381ea5

WIP: colorizing the reasoning content

3087ff7

Add new console::write routine

98b0d26

Rename syntax variable

becf4c5

Use non-template version of write routine

c879317

Write to log when enabled otherwise direct

edb8c0f

Merge branch 'master' into llamacli-reasoning2

e8952fb

Fix pointer formatting

7216448

Only call common_chat_parse with assistant messages

c0ca21d

Add reasoning delimiters

1b1629d

Remove stale data from delta

fc248d1

Merge branch 'master' into llamacli-reasoning2

dc882ee

DajanaV had a problem deploying to PROD__AL_DEMO November 8, 2025 14:34 — with GitHub Actions Failure

Use double-arrow as reasoning delimiter

e42715e

DajanaV had a problem deploying to PROD__AL_DEMO November 8, 2025 15:33 — with GitHub Actions Failure

DajanaV force-pushed the main branch 9 times, most recently from 96c975c to aa2fc28 Compare November 9, 2025 16:08

DajanaV force-pushed the main branch 29 times, most recently from 24733fb to 4b4bb7c Compare November 13, 2025 12:15

DajanaV closed this Nov 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UPSTREAM PR #16603: llama-cli: add support for reasoning#135

UPSTREAM PR #16603: llama-cli: add support for reasoning#135
DajanaV wants to merge 19 commits intomainfrom
upstream-PR16603-branch_bandoti-llamacli-reasoning2

DajanaV commented Nov 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DajanaV commented Nov 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants