Sanitize imatrix #735

ikawrakow · 2025-08-27T13:07:02Z

This PR "sanitizes" the provided imatrix before using it for quantization, so that hopefully we will no longer find NaNs in quantized models.

For now we don't cover quantization types where the quantization is done via a function ggml-quants.c. These are basically all legacy quants, k-quants, and i-quants (but repacked variants of these are covered).

@ubergarm Hopefully this PR prevents NaNs in your models where we have observed them.

ubergarm · 2025-08-27T18:07:37Z

Thanks this looks like a great feature given no need to re-make an imatrix. I've tested this to re-make two previously broken/nan quants:

Qwen3-Coder-30B-A3B-Instruct-IQ3_K

Unfortunately, it didn't help here given the nans were in blk.1.ffn_(gate|up)_exps.weight which are set to IQ3_K ~~which doesn't look covered in the list of quants mentioned in the commit~~ which looks like the first WIP commit patch covers both IQ3_K and IQ3_KS.

I'll tried this a second time back at my desk, but still not passing the --validate-quants.

Qwen3-Coder-480B-A35B-Instruct-IQ2_KL.gguf

Yes, it does seem to fix this one. I re-quantizied using the existing imatrix with this PR and the resulting gguf passes the new --validate-quants test and so far running clean on my usual llama-perplexity test against wiki.test.raw on CPU-only backend after 128 chunks in.

I released this model now here: https://huggingface.co/ubergarm/Qwen3-Coder-480B-A35B-Instruct-GGUF#iq2_kl-169597-gib-3034-bpw

ikawrakow · 2025-08-28T11:10:59Z

I added even more checks specifically for iq3_ks and iq3_k. Hopefully this finally fixes the NaNs.

ubergarm · 2025-08-28T14:30:57Z

@ikawrakow

Hopefully this finally fixes the NaNs.

Great! Yes! Just re-build this PR735 @ aa34097 and quantized the Qwen3-Coder-30B-A3B-Instruct-IQ3_K again. Then went back to tip of main and it passes --validate-quants and also runs clean perplexity too so I released it here: https://huggingface.co/ubergarm/Qwen3-Coder-30B-A3B-Instruct-GGUF#iq3_k-14509-gib-4082-bpw

I was not able to rebase #624 on it now cleanly so didn't use the quantization tweaks despite this being IQ3_K which would likely benefit from that PR.

I do not have any broken nan quants left now, so can't test ik/sanitize_importance_kt_quants easily.

Thanks for adding this feature!

Thireus · 2025-08-29T07:52:51Z

How can I identify if some of the GGUFs I've produced contain Nan?

ikawrakow · 2025-08-29T09:57:47Z

How can I identify if some of the GGUFs I've produced contain Nan?

Just add -vq to your normal command (llama-server, llama-cli, etc.) If there are NaNs in the model, you will get info about the tensors containing NaNs, and the command will terminate.

Iwan Kawrakow added 15 commits August 27, 2025 14:45

sanitize importance matrix: WIP

6f4dd9c

sanitize importance matrix: iq4_k

b41e8ef

sanitize importance matrix: iq5_k, iq6_k

b6f1bd6

sanitize imatrix: iq4_ks

a6ecc67

sanitize imatrix: iq4_kss

e935625

sanitize imatrix: iq2_ks and iq2_kl

63efad8

sanitize imatrix: iq5_ks

44fc0a5

sanitize imatrix: iq4_nl_r4

3ab0e41

sanitize imatrix: q4_0_r8

c5de02c

sanitize imatrix: q6_0_r4

deb55ff

sanitize imatrix: iq4_xs_r8

fcbf11e

sanitize imatrix: iq4_xs_r8 and q3_k_r4 with a template

b052add

sanitize imatrix: q2_k_r4, q4_k_r4, q5_k_r4, q6_k_r4

f220b83

sanitize imatrix: repacked i-quants

756f3df

Minor

c9b50fd

Add more checks for iq3_k, iq3_ks

aa34097

ikawrakow merged commit 46968d4 into main Aug 29, 2025

ikawrakow mentioned this pull request Sep 24, 2025

Bug: trellis dequantization fails because of INF and NaN values #793

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sanitize imatrix #735

Sanitize imatrix #735

Uh oh!

ikawrakow commented Aug 27, 2025

Uh oh!

ubergarm commented Aug 27, 2025 •

edited

Loading

Uh oh!

ikawrakow commented Aug 28, 2025

Uh oh!

ubergarm commented Aug 28, 2025

Uh oh!

Thireus commented Aug 29, 2025

Uh oh!

ikawrakow commented Aug 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Sanitize imatrix #735

Sanitize imatrix #735

Uh oh!

Conversation

ikawrakow commented Aug 27, 2025

Uh oh!

ubergarm commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ikawrakow commented Aug 28, 2025

Uh oh!

ubergarm commented Aug 28, 2025

Uh oh!

Thireus commented Aug 29, 2025

Uh oh!

ikawrakow commented Aug 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ubergarm commented Aug 27, 2025 •

edited

Loading