Skip to content

download : prefer q8_0 when q4_k not available#22428

Merged
ngxson merged 1 commit into
masterfrom
gg/download-prefer-q8
Apr 27, 2026
Merged

download : prefer q8_0 when q4_k not available#22428
ngxson merged 1 commit into
masterfrom
gg/download-prefer-q8

Conversation

@ggerganov
Copy link
Copy Markdown
Member

Overview

By default, the llama tools download Q4_K quantizations. When not available, they will now look for Q8_0.

Requirements

@ggerganov ggerganov requested a review from a team as a code owner April 27, 2026 09:09
@ngxson ngxson merged commit e940b3d into master Apr 27, 2026
45 of 46 checks passed
IntelNav pushed a commit to IntelNav/llama.cpp that referenced this pull request Apr 29, 2026
IntelNav pushed a commit to IntelNav/llama.cpp that referenced this pull request Apr 29, 2026
rsenthilkumar6 pushed a commit to rsenthilkumar6/llama.cpp that referenced this pull request May 1, 2026
samuraieng pushed a commit to samuraieng/llama.cpp that referenced this pull request May 6, 2026
ljubomirj pushed a commit to ljubomirj/llama.cpp that referenced this pull request May 6, 2026
meh pushed a commit to meh/llama.cpp that referenced this pull request May 10, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants