Skip to content

tests : add GQA=20 FA test#19095

Merged
ggerganov merged 1 commit intomasterfrom
gg/tests-add-gqa-20
Jan 30, 2026
Merged

tests : add GQA=20 FA test#19095
ggerganov merged 1 commit intomasterfrom
gg/tests-add-gqa-20

Conversation

@ggerganov
Copy link
Member

Might be a good idea to have a test that exercises GQA=20 in order to catch any potential regressions.

@github-actions github-actions bot added the testing Everything test related label Jan 25, 2026
@JohannesGaessler
Copy link
Contributor

The test failure here seems to be due to attention sinks in a configuration that does not appear in actual models. But there also seems to be a more serious issue.

@ggerganov
Copy link
Member Author

Let me know if you prefer to adjust the tests in some way in this PR, or feel free to push directly here.

@JohannesGaessler
Copy link
Contributor

In terms of correctness the new functionality should be covered with the test case added in #19115 . The kernel selection logic specific for GLM 4.7 Flash should only affect performance.

@ggerganov ggerganov force-pushed the gg/tests-add-gqa-20 branch from f0079c9 to 5e288b8 Compare January 27, 2026 08:43
@ggerganov
Copy link
Member Author

The test failure here seems to be due to attention sinks in a configuration that does not appear in actual models.

@JohannesGaessler How would you like to handle the sinks for this shape? Probably it's fine to add a check in support_op and not accept sinks until an actual model appears.

@ggerganov ggerganov force-pushed the gg/tests-add-gqa-20 branch from 5e288b8 to 2f83f9e Compare January 30, 2026 08:48
@ggerganov ggerganov merged commit c3b87ce into master Jan 30, 2026
81 of 82 checks passed
@ggerganov ggerganov deleted the gg/tests-add-gqa-20 branch January 30, 2026 11:53
4b1tQu4ntN3k0 pushed a commit to 4b1tQu4ntN3k0/llama.cpp that referenced this pull request Feb 2, 2026
shaofeiqi pushed a commit to qualcomm/llama.cpp that referenced this pull request Feb 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

testing Everything test related

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants