Skip to content

Address performance regression in Qwen and llama.cpp due to chunking

c77bafd
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Merged

ggml-cpu: handle 3d tensors in repack mat_mul #17241

Address performance regression in Qwen and llama.cpp due to chunking
c77bafd
Select commit
Loading
Failed to load commit list.
labeler
succeeded Nov 13, 2025 in 8s