[staging] e2e Studio q2_k_l GGUF export validation (Linux / macOS / Windows) by danielhanchen · Pull Request #127 · danielhanchen/unsloth-staging-2

danielhanchen · 2026-05-18T10:55:34Z

Throwaway staging PR validating the Studio q2_k_l GGUF export fix end-to-end on real Linux / macOS / Windows runners. Do not merge — close after CI confirms green.

Pairs with unsloth-zoo-staging-1#18 (already green on all 3 OS for the unit-test layer); this PR validates the binary-toolchain layer:

Build llama.cpp from source on each OS via the patched install_llama_cpp().
Confirm the freshly-built llama-quantize --help advertises q2_k AND the two preset flags (--output-tensor-type, --token-embedding-type) — proving the CLI surface still matches what unsloth_zoo emits.
Import the patched unsloth_zoo from the staging-1 branch, capture the command quantize_gguf(quant_type='q2_k_l') would emit, assert the post-expansion command is correct.

Diagnostic regression check for: main: invalid ftype 'q2_k_l' on Apple Silicon.

Changes on this branch

+ .github/workflows/studio-export-q2kl-e2e.yml — 3-OS matrix, max-parallel: 3 to stay under the 5-Windows-runner cap, concurrency.cancel-in-progress: true, paths: filter so unrelated commits don't re-fire it.
− 22 unrelated workflow files (this branch only) — keeps push fan-out bounded. Branch retains lint-ci.yml + wheel-smoke.yml + the new e2e workflow.

Test plan

ubuntu-latest job green
macos-14 job green
windows-latest job green

Adds .github/workflows/studio-export-q2kl-e2e.yml that: 1. Builds llama.cpp from source on each OS via install_llama_cpp() (apt-installed build deps on Linux; cmake + system toolchain on macOS / Windows). CUDA spoof preamble matches the existing consolidated-tests-ci.yml llama-cpp-smoke job. 2. Locates the freshly-built llama-quantize binary (build/bin/Release/ on Windows, top-level on macOS / Linux). 3. Asserts llama-quantize --help advertises q2_k AND the two preset flags (--output-tensor-type, --token-embedding-type) so the CLI surface still matches what unsloth_zoo emits on this OS. 4. Imports the patched unsloth_zoo from the staging fork branch (danielhanchen/unsloth-zoo-staging-1@studio-export-q2kl-fix), captures the command quantize_gguf emits for quant_type='q2_k_l', and asserts the post-expansion tokens are present and the literal preset name does NOT leak through. This is the diagnostic regression check for the user-reported error main: invalid ftype 'q2_k_l' on Apple Silicon Studio export. Trims 22 unrelated workflow files on this branch only (leaving lint-ci.yml + wheel-smoke.yml + the new e2e workflow). Drop is staging- only hygiene -- not intended for upstream unslothai/unsloth.

gemini-code-assist · 2026-05-18T10:55:40Z

Note

Gemini is unable to generate a review for this pull request due to the file types involved not being currently supported.

… keep pip-install + command-capture e2e

danielhanchen · 2026-05-19T08:00:54Z

Throwaway dry-run, validated end-to-end on Linux / macOS / Windows runners. Real fix is at unslothai/unsloth-zoo#667. Closing.

Daniel Han added 2 commits May 18, 2026 10:57

ci: install transformers + huggingface_hub for unsloth_zoo eager imports

d5ac261

ci: drop install_llama_cpp build (macOS pkg-mgr detection unrelated);…

417cd1d

… keep pip-install + command-capture e2e

danielhanchen mentioned this pull request May 18, 2026

Fix Studio q2_k_l GGUF export and new llama.cpp converter package layout unslothai/unsloth-zoo#667

Merged

5 tasks

danielhanchen closed this May 19, 2026

danielhanchen deleted the studio-export-q2kl-fix branch May 19, 2026 08:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[staging] e2e Studio q2_k_l GGUF export validation (Linux / macOS / Windows)#127

[staging] e2e Studio q2_k_l GGUF export validation (Linux / macOS / Windows)#127
danielhanchen wants to merge 3 commits into
mainfrom
studio-export-q2kl-fix

danielhanchen commented May 18, 2026

Uh oh!

gemini-code-assist Bot commented May 18, 2026

Uh oh!

danielhanchen commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

danielhanchen commented May 18, 2026

Changes on this branch

Test plan

Uh oh!

gemini-code-assist Bot commented May 18, 2026

Uh oh!

danielhanchen commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant