- 
                Notifications
    You must be signed in to change notification settings 
- Fork 13.5k
Closed
Labels
Description
Name and Version
b4896
Operating systems
Linux
Which llama.cpp modules do you know to be affected?
llama-quantize
Command line
./llama-quantize --imatrix /models/OLMo-2-0325-32B-Instruct-GGUF/OLMo-2-0325-32B-Instruct.imatrix /models/OLMo-2-0325-32B-Instruct-GGUF/OLMo-2-0325-32B-Instruct-bf16.gguf /models/OLMo-2-0325-32B-Instruct-GGUF/OLMo-2-0325-32B-Instruct-Q4_K_M.gguf Q4_K_MProblem description & steps to reproduce
Affects Q2_K, Q4_K, and Q5_K on the same blocks
Does not seem to affect Q6_K, IQ4_NL, IQ4_XS, Q4_0, Q3_K, or any IQ3/2_N quant levels
Imatrix file uploaded here for testing:
BF16 conversion also uploaded for testing convenience:
First Bad Commit
No response
Relevant log output
[ 466/ 707]                 blk.42.attn_k.weight - [ 5120,  1024,     1,     1], type =   bf16, converting to q4_K .. ggml_validate_row_data: found nan value at block 40
ggml_validate_row_data: found nan value at block 20
ggml_validate_row_data: found nan value at block 40
ggml_validate_row_data: found nan value at block 20
ggml_validate_row_data: found nan value at block 20
ggml_validate_row_data: found nan value at block 40