ggml: fixed Arm SVE usage bug in vec.h, vec.cpp by martin-klacer-arm · Pull Request #22841 · ggml-org/llama.cpp

martin-klacer-arm · 2026-05-08T14:36:37Z

Overview

This pull request fixes Arm SVE code in GGML vec.h and vec.cpp files. Previously, the F16 multiply accumulate functions used F16 as the accumulation data type as well, even though the output type is F32. This lead to overflows in some larger models, causing random ASCII output. This PR changes the accumulation to be F32 data type which solves the overflow.

Additional information

This PR fixes the bug: #21548

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: YES - used AI guidance to help in understanding SVE intrinsics details.

* Updated vec.h/vec.cpp code to accumulate to F32 rather than F16 Signed-off-by: Martin Klacer <martin.klacer@arm.com> Co-authored-by: Milos Puzovic <Milos.Puzovic@arm.com> Change-Id: I0cb789347f2bf60ffaf9047319f727e788c825f8

martin-klacer-arm · 2026-05-18T08:52:56Z

Hello @ggerganov, I wanted to follow up on this PR and check if you've had a chance to take a look? If you have any questions about the PR or code changes I'm happy to provide more detail. Thank you!

chaxu01 · 2026-05-28T07:00:12Z

Hi @ggerganov — just wanted to highlight this PR again when you have a chance.

This fixes an Arm SVE accumulation issue where FP16 accumulation was being used in F16 MAC paths even though the output type is FP32. On some larger models, this could lead to overflow and random ASCII output generation.

We’ve reviewed and validated the fix internally on our side as well. Thanks!

ggerganov · 2026-05-28T07:04:28Z

Thanks for the reminder!

* Updated vec.h/vec.cpp code to accumulate to F32 rather than F16 Change-Id: I0cb789347f2bf60ffaf9047319f727e788c825f8 Signed-off-by: Martin Klacer <martin.klacer@arm.com> Co-authored-by: Milos Puzovic <Milos.Puzovic@arm.com>

ggml: fixed Arm SVE usage bug in vec.h, vec.cpp

acca752

* Updated vec.h/vec.cpp code to accumulate to F32 rather than F16 Signed-off-by: Martin Klacer <martin.klacer@arm.com> Co-authored-by: Milos Puzovic <Milos.Puzovic@arm.com> Change-Id: I0cb789347f2bf60ffaf9047319f727e788c825f8

martin-klacer-arm requested a review from ggerganov as a code owner May 8, 2026 14:36

github-actions Bot added the ggml changes relating to the ggml tensor library for machine learning label May 8, 2026

martin-klacer-arm mentioned this pull request May 15, 2026

Eval bug: Qwen2-VL 2B and Qwen2 1.5B Instruct producing random characters output on Aarch64 with SVE #21548

Closed

ggerganov merged commit e31cdaa into ggml-org:master May 28, 2026
43 of 46 checks passed

chaxu01 mentioned this pull request Jun 2, 2026

ci: remove redundant or duplicate jobs #23927

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml: fixed Arm SVE usage bug in vec.h, vec.cpp#22841

ggml: fixed Arm SVE usage bug in vec.h, vec.cpp#22841
ggerganov merged 1 commit into
ggml-org:masterfrom
martin-klacer-arm:feature/fix_arm_sve_code

martin-klacer-arm commented May 8, 2026

Uh oh!

martin-klacer-arm commented May 18, 2026

Uh oh!

chaxu01 commented May 28, 2026

Uh oh!

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

martin-klacer-arm commented May 8, 2026

Overview

Additional information

Requirements

Uh oh!

martin-klacer-arm commented May 18, 2026

Uh oh!

chaxu01 commented May 28, 2026

Uh oh!

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants