vulkan: fix mmq out of bounds reads #17108

0cc4m · 2025-11-08T17:32:24Z

I also cleaned up some legacy indexing and buffer size code in the matmul functions.

Draft because I caused a mul_mat_id regression, still trying to find the reason.

…code

mimi89999 · 2025-11-08T19:16:18Z

When building I got multiple warnings about unused variables:

[22/99] Building CXX object ggml/src/ggml-vulkan/CMakeFiles/ggml-vulkan.dir/ggml-vulkan.cpp.o
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp: In function ‘void ggml_vk_mul_mat_vec_q_f16(ggml_backend_vk_context*, vk_context&, const ggml_cgraph*, int)’:
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:6588:20: warning: unused variable ‘ne22’ [-Wunused-variable]
 6588 |     const uint64_t ne22 = dst->ne[2];
      |                    ^~~~
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:6589:20: warning: unused variable ‘ne23’ [-Wunused-variable]
 6589 |     const uint64_t ne23 = dst->ne[3];
      |                    ^~~~
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp: In function ‘void ggml_vk_mul_mat_id_q_f16(ggml_backend_vk_context*, vk_context&, const ggml_tensor*, const ggml_tensor*, const ggml_tensor*, ggml_tensor*)’:
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:7171:20: warning: unused variable ‘ne03’ [-Wunused-variable]
 7171 |     const uint64_t ne03 = src0->ne[3];
      |                    ^~~~
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:7186:20: warning: unused variable ‘ne22’ [-Wunused-variable]
 7186 |     const uint64_t ne22 = dst->ne[2];
      |                    ^~~~
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:7187:20: warning: unused variable ‘ne23’ [-Wunused-variable]
 7187 |     const uint64_t ne23 = dst->ne[3];
      |                    ^~~~
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp: In function ‘void ggml_vk_mul_mat_vec_id_q_f16(ggml_backend_vk_context*, vk_context&, const ggml_cgraph*, int)’:
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:7441:20: warning: unused variable ‘ne02’ [-Wunused-variable]
 7441 |     const uint64_t ne02 = src0->ne[2];
      |                    ^~~~
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:7442:20: warning: unused variable ‘ne03’ [-Wunused-variable]
 7442 |     const uint64_t ne03 = src0->ne[3];
      |                    ^~~~
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:7446:20: warning: unused variable ‘ne12’ [-Wunused-variable]
 7446 |     const uint64_t ne12 = src1->ne[2];
      |                    ^~~~
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:7447:20: warning: unused variable ‘ne13’ [-Wunused-variable]
 7447 |     const uint64_t ne13 = src1->ne[3];
      |                    ^~~~
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:7458:20: warning: unused variable ‘ne22’ [-Wunused-variable]
 7458 |     const uint64_t ne22 = dst->ne[2];
      |                    ^~~~
/home/michel/llm/llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp:7459:20: warning: unused variable ‘ne23’ [-Wunused-variable]
 7459 |     const uint64_t ne23 = dst->ne[3];
      |                    ^~~~
[96/99] Linking CXX executable bin/llama-export-lora

The change also didn't fix #17056 for me.

mimi89999 · 2025-11-08T19:46:48Z

However, this change did fix [MUL_MAT] NMSE = 0.098834385 > 0.000500000 MUL_MAT(type_a=q4_0,type_b=f32,m=576,n=512,k=576,bs=[1,1],nr=[1,1],per=[0,1,2,3],k_v=0,o=1): FAIL for me: MUL_MAT(type_a=q4_0,type_b=f32,m=576,n=512,k=576,bs=[1,1],nr=[1,1],per=[0,1,2,3],k_v=0,o=1): OK

0cc4m · 2025-11-08T19:50:27Z

Yeah, that's the problem I found in the issue. I added that test. I guess you have another problem. I'll try to find it.

0cc4m · 2025-11-08T19:58:21Z

I can reproduce another issue with the model you posted in your issue. I'll look into it, thank you for testing the fix.

vulkan: fix mmq out of bounds reads, streamline outdated matmul host …

477738c

…code

github-actions bot added testing Everything test related Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Nov 8, 2025

DajanaV mentioned this pull request Nov 8, 2025

UPSTREAM PR #17108: vulkan: fix mmq out of bounds reads auroralabs-loci/llama.cpp#137

Open

fix mul_mat_id quantization call

551d053

0cc4m marked this pull request as ready for review November 8, 2025 17:46

0cc4m requested a review from slaren as a code owner November 8, 2025 17:46

0cc4m requested a review from jeffbolznv November 8, 2025 17:46

0cc4m mentioned this pull request Nov 8, 2025

Eval bug: Vulkan not working on Intel GPU #17056

Closed

jeffbolznv approved these changes Nov 8, 2025

View reviewed changes

Fix compiler warnings

02a65b3

0cc4m mentioned this pull request Nov 9, 2025

Eval bug: Q4_K_M with vulkan generates garbage/repetitive output #16961

Open

0cc4m merged commit 8a3519b into master Nov 9, 2025
67 of 71 checks passed

0cc4m deleted the 0cc4m/vulkan-mmq-bk-step-fix branch November 9, 2025 08:53

louiswpf mentioned this pull request Nov 12, 2025

Eval bug: '????????' output with Vulkan backend #17055

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vulkan: fix mmq out of bounds reads #17108

vulkan: fix mmq out of bounds reads #17108

0cc4m commented Nov 8, 2025

Uh oh!

mimi89999 commented Nov 8, 2025 •

edited

Loading

Uh oh!

mimi89999 commented Nov 8, 2025

Uh oh!

0cc4m commented Nov 8, 2025

Uh oh!

0cc4m commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vulkan: fix mmq out of bounds reads #17108

vulkan: fix mmq out of bounds reads #17108

Conversation

0cc4m commented Nov 8, 2025

Uh oh!

mimi89999 commented Nov 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mimi89999 commented Nov 8, 2025

Uh oh!

0cc4m commented Nov 8, 2025

Uh oh!

0cc4m commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mimi89999 commented Nov 8, 2025 •

edited

Loading