up #5

wqerrewetw · 2025-10-25T16:44:30Z

、

…gml-org#16751) This commit add the trust_remote_code=True argument when loading models using AutoConfig, AutoTokenizer, and AutoModelForCausalLM for the run original model script. The motivation for this is that some models require custom code to be loaded properly, and setting trust_remote_code=True avoids a prompt asking for user confirmation: ```console (venv) $ make causal-run-original-model The repository /path/to/model contains custom code which must be executed to correctly load the model. You can inspect the repository content at /path/to/model. Do you wish to run the custom code? [y/N] N ``` Having this as the default seems like a safe choice as we have to clone or download the models we convert and would be expecting to run any custom code they have.

* webui: support q URL parameter Fixes ggml-org#16722 I’ve checked that it works with Firefox’s AI tools * webui: apply suggestions from code review Co-authored-by: Aleksander Grygier <[email protected]> * chore: update webui static build --------- Co-authored-by: Aleksander Grygier <[email protected]>

…st (ggml-org#16742) * Fix CUDA grid launch condition for large block_nums.y * add backend ops test * reduce test repetitions

ggml_vk_create_buffer_temp is not used anywhere, and it is the only caller for ggml_vk_pool_malloc. Signed-off-by: Giuseppe Scrivano <[email protected]>

danbev and others added 8 commits October 24, 2025 12:02

CUDA: use CUB for arbitary size argsort (ggml-org#16754)

0bcb40b

ggml: fix CUDA grid launch condition for large block_nums.y in binbca…

55945d2

…st (ggml-org#16742) * Fix CUDA grid launch condition for large block_nums.y * add backend ops test * reduce test repetitions

convert : avoid dequantizing mxfp4 for GPT-OSS (ggml-org#16756)

5cca254

vulkan: Optimize SSM_SCAN (ggml-org#16645)

8423d01

vulkan: delete dead code (ggml-org#16732)

f90b4a8

ggml_vk_create_buffer_temp is not used anywhere, and it is the only caller for ggml_vk_pool_malloc. Signed-off-by: Giuseppe Scrivano <[email protected]>

model : set res->t_embd in PLaMo2 models (ggml-org#16766)

226f295

wqerrewetw merged commit 4c67f3d into qw25vl Oct 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

up #5

up #5

Uh oh!

wqerrewetw commented Oct 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

up #5

up #5

Uh oh!

Conversation

wqerrewetw commented Oct 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants