Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
llama : use Q4_K for attn_v for Q2_K_S when n_gqa >= 4 (#4996)
Co-authored-by: Iwan Kawrakow <[email protected]>
- Loading branch information