Fix division by zero bug #349

ikawrakow · 2025-04-26T07:17:16Z

The bug was in the calculation of number of work items to use when computing FA on the CPU. In my case (maximum of 32 threads) it triggered with the GLM-4 model that has an unusually small number of KV heads (just 2). But I guess it can also trigger with a larger number of threads for more common numbers of KV heads.

Fixed by just using max(1, nk). This will result in a far from optimal number of compute chunks, but at least it works.

I'm working on a better strategy for dividing the work between the threads on this branch, but not quite ready for a PR yet.

Fix division by zero bug

957308c

ikawrakow mentioned this pull request Apr 26, 2025

Add GLM-4-0414 Model Support #344

Merged

ikawrakow merged commit 9e846f0 into main Apr 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix division by zero bug #349

Fix division by zero bug #349

Uh oh!

ikawrakow commented Apr 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix division by zero bug #349

Fix division by zero bug #349

Uh oh!

Conversation

ikawrakow commented Apr 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants