Skip to content

Conversation

@firecoperana
Copy link
Collaborator

@firecoperana firecoperana commented Sep 17, 2025

Add support for grok-2.

Test with unsloth/grok-2-UD-Q2_K_XL.gguf and looks ok.
Use --jinja for loading the model with llama-server

ggml-org/llama.cpp#15539

@firecoperana firecoperana self-assigned this Sep 17, 2025
float yarn_attn_factor = 1.0f; // YaRN magnitude scaling factor
float yarn_beta_fast = 32.0f; // YaRN low correction dim
float yarn_beta_slow = 1.0f; // YaRN high correction dim
float yarn_attn_factor = -1.0f; // YaRN magnitude scaling factor
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is this change required?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This copies from mainline.

LLAMA_VOCAB_PRE_TYPE_HUNYUAN = 36,
LLAMA_VOCAB_PRE_TYPE_KIMI_K2 = 37,
LLAMA_VOCAB_PRE_TYPE_HUNYUAN_DENSE = 38,
LLAMA_VOCAB_PRE_TYPE_GROK_2 = 39,
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Align '=' as the other entries

@ikawrakow ikawrakow merged commit 8cd2d7c into main Sep 23, 2025
@firecoperana firecoperana deleted the fcp/grok2 branch October 26, 2025 16:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants