tests : add GQA=20 FA test by ggerganov · Pull Request #19095 · ggml-org/llama.cpp

ggerganov · 2026-01-25T20:40:18Z

Might be a good idea to have a test that exercises GQA=20 in order to catch any potential regressions.

JohannesGaessler · 2026-01-26T01:29:25Z

The test failure here seems to be due to attention sinks in a configuration that does not appear in actual models. But there also seems to be a more serious issue.

ggerganov · 2026-01-26T09:09:11Z

Let me know if you prefer to adjust the tests in some way in this PR, or feel free to push directly here.

JohannesGaessler · 2026-01-26T16:21:04Z

In terms of correctness the new functionality should be covered with the test case added in #19115 . The kernel selection logic specific for GLM 4.7 Flash should only affect performance.

ggerganov · 2026-01-28T07:14:49Z

The test failure here seems to be due to attention sinks in a configuration that does not appear in actual models.

@JohannesGaessler How would you like to handle the sinks for this shape? Probably it's fine to add a check in support_op and not accept sinks until an actual model appears.

ggerganov requested a review from JohannesGaessler January 25, 2026 20:40

github-actions bot added the testing Everything test related label Jan 25, 2026

JohannesGaessler approved these changes Jan 25, 2026

View reviewed changes

loci-dev mentioned this pull request Jan 25, 2026

UPSTREAM PR #19095: tests : add GQA=20 FA test auroralabs-loci/llama.cpp#1032

Open

ggerganov force-pushed the gg/tests-add-gqa-20 branch from f0079c9 to 5e288b8 Compare January 27, 2026 08:43

tests : add GQA=20 FA test

2f83f9e

ggerganov force-pushed the gg/tests-add-gqa-20 branch from 5e288b8 to 2f83f9e Compare January 30, 2026 08:48

ggerganov merged commit c3b87ce into master Jan 30, 2026
81 of 82 checks passed

ggerganov deleted the gg/tests-add-gqa-20 branch January 30, 2026 11:53

4b1tQu4ntN3k0 pushed a commit to 4b1tQu4ntN3k0/llama.cpp that referenced this pull request Feb 2, 2026

tests : add GQA=20 FA test (ggml-org#19095)

c53262d

shaofeiqi pushed a commit to qualcomm/llama.cpp that referenced this pull request Feb 6, 2026

tests : add GQA=20 FA test (ggml-org#19095)

2bd45ec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests : add GQA=20 FA test#19095

tests : add GQA=20 FA test#19095
ggerganov merged 1 commit intomasterfrom
gg/tests-add-gqa-20

ggerganov commented Jan 25, 2026

Uh oh!

JohannesGaessler commented Jan 26, 2026

Uh oh!

ggerganov commented Jan 26, 2026

Uh oh!

JohannesGaessler commented Jan 26, 2026

Uh oh!

ggerganov commented Jan 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ggerganov commented Jan 25, 2026

Uh oh!

JohannesGaessler commented Jan 26, 2026

Uh oh!

ggerganov commented Jan 26, 2026

Uh oh!

JohannesGaessler commented Jan 26, 2026

Uh oh!

ggerganov commented Jan 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants