Skip to content

UPSTREAM PR #18227: server: /v1/responses (text generation only)#695

Open
loci-dev wants to merge 21 commits intomainfrom
upstream-PR18227-branch_openingnow-master
Open

UPSTREAM PR #18227: server: /v1/responses (text generation only)#695
loci-dev wants to merge 21 commits intomainfrom
upstream-PR18227-branch_openingnow-master

Conversation

@loci-dev
Copy link
Copy Markdown

Mirrored from ggml-org/llama.cpp#18227

This PR introduces minimally working openAI-compatible /v1/responses API by converting /v1/responses request into /v1/chat/completions request.

Only text generation is supported and several fields such as IDs (of response and messages) are omitted.

If this appears too unfinished for a merge at this stage, please let me know and I'll convert it to a draft.

@loci-dev loci-dev force-pushed the main branch 10 times, most recently from f7cdf84 to 701f648 Compare December 26, 2025 23:08
@loci-dev loci-dev force-pushed the main branch 30 times, most recently from ca0d661 to 594833d Compare January 3, 2026 22:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants