Skip to content

Conversation

@odrling
Copy link
Contributor

@odrling odrling commented Oct 22, 2025

Fixes #16722
I’ve checked that it works with Firefox’s AI tools

I’m not sure if I have to rebuild the webui bundle in this PR or let a maintainer do it before merging.

Fixes ggml-org#16722
I’ve checked that it works with Firefox’s AI tools
Copy link
Collaborator

@allozaur allozaur left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This needs proper Svelte 5 syntax as in the code suggestions.

@odrling odrling requested a review from allozaur October 23, 2025 01:55
Copy link
Collaborator

@allozaur allozaur left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okay, I've just tested this locally and it's working great. Last thing to do is to include static build and then we'll need to pass the CI and we will be good to go ;)

@odrling
Copy link
Contributor Author

odrling commented Oct 23, 2025

OK I’ve updated the static build

@odrling odrling requested a review from allozaur October 23, 2025 15:20
@allozaur allozaur merged commit 69e9ff0 into ggml-org:master Oct 24, 2025
14 checks passed
wqerrewetw added a commit to wqerrewetw/llama.cpp that referenced this pull request Oct 25, 2025
* model-conversion : add trust_remote_code for orig model run [no ci] (ggml-org#16751)

This commit add the trust_remote_code=True argument when loading models
using AutoConfig, AutoTokenizer, and AutoModelForCausalLM for the run
original model script.

The motivation for this is that some models require custom code to be
loaded properly, and setting trust_remote_code=True avoids a prompt
asking for user confirmation:
```console
(venv) $ make causal-run-original-model
The repository /path/to/model contains custom code which must be
executed to correctly load the model. You can inspect the repository
content at /path/to/model.

Do you wish to run the custom code? [y/N] N
```

Having this as the default seems like a safe choice as we have to clone
or download the models we convert and would be expecting to run any
custom code they have.

* webui: support q URL parameter (ggml-org#16728)

* webui: support q URL parameter

Fixes ggml-org#16722
I’ve checked that it works with Firefox’s AI tools

* webui: apply suggestions from code review

Co-authored-by: Aleksander Grygier <[email protected]>

* chore: update webui static build

---------

Co-authored-by: Aleksander Grygier <[email protected]>

* CUDA: use CUB for arbitary size argsort (ggml-org#16754)

* ggml: fix CUDA grid launch condition for large block_nums.y in binbcast (ggml-org#16742)

* Fix CUDA grid launch condition for large block_nums.y

* add backend ops test

* reduce test  repetitions

* convert : avoid dequantizing mxfp4 for GPT-OSS (ggml-org#16756)

* vulkan: Optimize SSM_SCAN (ggml-org#16645)

* vulkan: delete dead code (ggml-org#16732)

ggml_vk_create_buffer_temp is not used anywhere, and it is the only
caller for ggml_vk_pool_malloc.

Signed-off-by: Giuseppe Scrivano <[email protected]>

* model : set res->t_embd in PLaMo2 models (ggml-org#16766)

---------

Signed-off-by: Giuseppe Scrivano <[email protected]>
Co-authored-by: Daniel Bevenius <[email protected]>
Co-authored-by: Florian Badie <[email protected]>
Co-authored-by: Aleksander Grygier <[email protected]>
Co-authored-by: Aman Gupta <[email protected]>
Co-authored-by: leejet <[email protected]>
Co-authored-by: compilade <[email protected]>
Co-authored-by: Jeff Bolz <[email protected]>
Co-authored-by: Giuseppe Scrivano <[email protected]>
Co-authored-by: Shunta Saito <[email protected]>
wqerrewetw added a commit to wqerrewetw/llama.cpp that referenced this pull request Oct 25, 2025
* qwen3-coder tool call parser

* reset template

* Fix grammar, hide tool_call from output

* Fix C++ compilation error in tests/test-chat.cpp

Add missing closing brace to terminate test_template_output_parsers() function. This resolves compilation errors that prevented successful build of the test-chat target.

* Update common/chat.cpp

Co-authored-by: Kashyap Jois <[email protected]>

* Update common/chat.cpp

Co-authored-by: Kashyap Jois <[email protected]>

* Fix for test

* revert

* Update common/chat.cpp

Co-authored-by: Marcel de Vries <[email protected]>

* Update common/chat.cpp

Co-authored-by: Marcel de Vries <[email protected]>

* removed test

* Qwen3-Coder XML: handle union schema types and sanitize unsupported branches; add tests

- chat-parser: support schema.type as array (e.g. ["number","null"]) in convert_qwen3_param_value()
- chat: resolve $refs; allow unions including "string" as freeform; sanitize empty {"not":{}} in anyOf/oneOf before add_schema
- tests: add Qwen3-Coder regression ensuring grammar builds with unions and ignores {"not":{}}

* Moved common_chat_parse_qwen3_coder_xml

* Fix merge oopsie

* Sync bundled template with upstream

See https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct/blob/main/chat_template.jinja

* Fix crash when tool call doesn't start with <tool_call>

* model-conversion : add trust_remote_code for orig model run [no ci] (ggml-org#16751)

This commit add the trust_remote_code=True argument when loading models
using AutoConfig, AutoTokenizer, and AutoModelForCausalLM for the run
original model script.

The motivation for this is that some models require custom code to be
loaded properly, and setting trust_remote_code=True avoids a prompt
asking for user confirmation:
```console
(venv) $ make causal-run-original-model
The repository /path/to/model contains custom code which must be
executed to correctly load the model. You can inspect the repository
content at /path/to/model.

Do you wish to run the custom code? [y/N] N
```

Having this as the default seems like a safe choice as we have to clone
or download the models we convert and would be expecting to run any
custom code they have.

* webui: support q URL parameter (ggml-org#16728)

* webui: support q URL parameter

Fixes ggml-org#16722
I’ve checked that it works with Firefox’s AI tools

* webui: apply suggestions from code review

Co-authored-by: Aleksander Grygier <[email protected]>

* chore: update webui static build

---------

Co-authored-by: Aleksander Grygier <[email protected]>

---------

Co-authored-by: Benjamin Oldenburg <[email protected]>
Co-authored-by: Marcel de Vries <[email protected]>
Co-authored-by: Kashyap Jois <[email protected]>
Co-authored-by: Daniel Bevenius <[email protected]>
Co-authored-by: Florian Badie <[email protected]>
Co-authored-by: Aleksander Grygier <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Feature Request: support q URL parameter in webui

2 participants