Add CUDA support for IQ1_TN #45

ikawrakow · 2024-09-09T17:07:02Z

Just reuse the IQ1_BN implementation. The only twist is that we now have the row scale stored at the beginning of the row, so we need a small modification of the dot product template to have a pointer to the beginning of the row passed to the dot product implementation.

It is slightly slower than IQ2_TN (305 t/s vs 320 t/s for the 4B TriLM model on RTX-4080), but this is to be expected given the bit twiddling we need to unpack the ternary bits.

Iwan Kawrakow added 4 commits September 9, 2024 18:42

iq1_tn: adding CUDA dequantize

5b40848

iq1_tn: adding CUDA dot product

f808466

Delete commented out stuff

db41d67

Delete forgotten TODO

a9b15ed

ikawrakow merged commit 918ada2 into main Sep 9, 2024

This was referenced May 18, 2025

Forgotten MMQ ref and typo #431

Merged

Refactor: GGUF v14 broke compatibility with IQx_KS quants #432

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add CUDA support for IQ1_TN #45

Add CUDA support for IQ1_TN #45

Uh oh!

ikawrakow commented Sep 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add CUDA support for IQ1_TN #45

Add CUDA support for IQ1_TN #45

Uh oh!

Conversation

ikawrakow commented Sep 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants