PR #697 cross-OS validation (do not merge) by danielhanchen · Pull Request #156 · danielhanchen/unsloth-staging-2

danielhanchen · 2026-05-27T13:05:06Z

Staging CI runs for unslothai/unsloth-zoo#697 (`Lyxot:fix/mlx-save-gguf-export-parity` -- 11 MLX save / GGUF export parity fixes + 23 regressions in `tests/test_mlx_save_export_regressions.py`).

No Apple Silicon hardware locally, so this branch validates the PR on real `macos-14` GitHub Actions runners (the highest-signal target) plus `ubuntu-latest` and `windows-latest` via the `tests/mlx_simulation/` torch-spoof shim.

Branches

Base: `pr697-base` -- snapshot of `unslothai/unsloth-zoo:main` so this PR's diff isolates PR docker images unslothai/unsloth#697's contents plus the three workflow files.
Head: `pr697-cross-os` -- `pr697-base` + the PR docker images unslothai/unsloth#697 head + swept `.github/workflows/*` + the three new jobs below.

Jobs

Workflow	Runner	What it verifies
`pr697-macos.yml`	`macos-14` (M1)	`pip install -e .[mlx]` (real mlx / mlx-lm / mlx-vlm wheels), subpackage import smoke (`unsloth_zoo.mlx.{loader,compile,utils,trainer,cce,runtime}`), 23 save/export regressions, `test_mlx_torch_shim_smoke.py`
`pr697-ubuntu.yml`	`ubuntu-latest`	CPU torch + `.[core]` + `pip install --no-deps unsloth` to satisfy the `find_spec("unsloth")` guard, `pytest --collect-only`, then 23 regressions + shim smoke via `mlx_simulation`
`pr697-windows.yml`	`windows-latest`	Same shim path as Ubuntu; Windows-native `pathlib` coverage for the new `_copy_source_sidecars` / `Path.iterdir` / `os.environ` helpers

Sweep rationale

`.github/workflows/mlx-ci.yml` on PR head still imports the pre-migration flat module path (`unsloth_zoo.mlx_loader`); PR unslothai#697 lives in the subpackage `unsloth_zoo.mlx.loader`. Left as-is the workflow would hard-fail unrelated to PR unslothai#697's diff, so the staging branch sweeps every existing workflow and substitutes the three jobs above. Precedent: PR #154 on this fork.

Lifecycle

Throwaway. Close (don't merge) once `macos-14` + `ubuntu-latest` are green and PR unslothai#697 upstream lands.

Prefer HF vision aliases when inverting mlx-vlm sanitizers, while keeping same-name tensor rewrites as a fallback. This preserves Gemma3 patch embedding layout fixes and avoids stopping early on Qwen-family MLX vision_tower names.

Use loaded VLM model instances when replaying mlx-vlm sanitizers for GGUF tensor rewrites, while keeping the config-derived class pipeline as a fallback. This covers models whose top-level sanitizer delegates through submodules, such as GLM-OCR.

This reverts commit d8cdf89. Also removes the related regression coverage and later helper hardening now that bug-4 is out of scope.

…aging branch

…n jobs Sweep .github/workflows/* (mlx-ci.yml is broken on PR head -- imports old flat unsloth_zoo.mlx_loader path; subpackage migration moved it to unsloth_zoo.mlx.loader). Add three focused per-OS jobs covering the tests/test_mlx_save_export_regressions.py suite that ships with PR unslothai#697: - pr697-macos.yml -- macos-14 Apple Silicon, .[mlx] extras + subpackage import smoke + 23 save/export regressions + shim smoke (highest-signal job). - pr697-ubuntu.yml -- ubuntu-latest, CPU torch + [core] + mlx_simulation shim, full pytest plus --collect-only. - pr697-windows.yml -- windows-latest, shim path, Windows-native pathlib coverage for the new save/export helpers. Staging branch only. Do not merge.

gemini-code-assist

Code Review

This pull request introduces comprehensive regression tests and several improvements to the MLX save and GGUF export pipelines. Key changes include fixing path resolution for package-layout converters in llama_cpp.py, adding robust VLM processor repair mechanisms in loader.py, and implementing MLX VLM tensor rewriting for GGUF compatibility in utils.py. The reviewer feedback focuses on ensuring cross-platform encoding safety by explicitly specifying encoding="utf-8" when reading and writing JSON configuration and index files in utils.py.

gemini-code-assist · 2026-05-27T13:07:06Z

+    config_path = path / "config.json"
+    if not config_path.exists():
+        return 0
+    with open(config_path, "r") as f:


On Windows, opening files without specifying encoding="utf-8" can lead to UnicodeDecodeError if the file contains non-ASCII characters (which is common in model configs). Please specify encoding="utf-8" when reading the config file.

Suggested change

with open(config_path, "r") as f:

with open(config_path, "r", encoding="utf-8") as f:

gemini-code-assist · 2026-05-27T13:07:06Z

+    sanitize_steps = _build_mlx_vlm_sanitize_pipelines(config, model=model)
+    if not sanitize_steps:
+        if config_changed:
+            with open(config_path, "w") as f:


On Windows, writing files without specifying encoding="utf-8" can lead to encoding mismatches or UnicodeEncodeError if the config contains non-ASCII characters. Please specify encoding="utf-8" when writing the config file.

Suggested change

with open(config_path, "w") as f:

with open(config_path, "w", encoding="utf-8") as f:

gemini-code-assist · 2026-05-27T13:07:06Z

+
+    index_path = path / "model.safetensors.index.json"
+    if rewritten and index_path.exists():
+        with open(index_path, "r") as f:


On Windows, opening files without specifying encoding="utf-8" can lead to UnicodeDecodeError if the index file contains non-ASCII characters. Please specify encoding="utf-8" when reading the index file.

Suggested change

with open(index_path, "r") as f:

with open(index_path, "r", encoding="utf-8") as f:

gemini-code-assist · 2026-05-27T13:07:06Z

+                )
+            weight_map[new_name] = shard
+        index_data["weight_map"] = dict(sorted(weight_map.items()))
+        with open(index_path, "w") as f:


On Windows, writing files without specifying encoding="utf-8" can lead to encoding mismatches or UnicodeEncodeError if the index contains non-ASCII characters. Please specify encoding="utf-8" when writing the index file.

Suggested change

with open(index_path, "w") as f:

with open(index_path, "w", encoding="utf-8") as f:

gemini-code-assist · 2026-05-27T13:07:06Z

+            json.dump(index_data, f, indent=4)
+
+    if config_changed:
+        with open(config_path, "w") as f:


On Windows, writing files without specifying encoding="utf-8" can lead to encoding mismatches or UnicodeEncodeError if the config contains non-ASCII characters. Please specify encoding="utf-8" when writing the config file.

Suggested change

with open(config_path, "w") as f:

with open(config_path, "w", encoding="utf-8") as f:

pyproject gates torch off on darwin+arm64 (mlx is the native path), but tests/mlx_simulation/__init__.py unconditionally imports torch. The autouse _install_mlx_torch_shim fixture in test_mlx_save_export_regressions.py runs on every test in the file, so torch is required on macos-14 too.

danielhanchen · 2026-05-27T13:14:47Z

All three OS jobs green on commit c9fc1960:

macos-14 (Apple Silicon, real mlx/mlx-lm/mlx-vlm wheels): 23/23 save/export regressions PASSED + 56 shim smoke PASSED -> https://github.com/danielhanchen/unsloth-staging-2/actions/runs/26513116397
ubuntu-latest (mlx_simulation shim): https://github.com/danielhanchen/unsloth-staging-2/actions/runs/26513116634
windows-latest (mlx_simulation shim): https://github.com/danielhanchen/unsloth-staging-2/actions/runs/26513116392

Closing per staging-fork convention; never merged.

Lyxot and others added 18 commits May 27, 2026 15:26

fix(mlx): preserve vlm merged config saves

615aff8

fix(mlx): dequantize qlora merged saves

55fa173

fix(mlx): materialize tied lm head exports

d8cdf89

fix(mlx): forward gguf export options

0029c54

fix(mlx): normalize vlm mmproj export tensors

6fcaa19

fix(mlx): rewrite VLM GGUF tensor candidates

97c980f

Prefer HF vision aliases when inverting mlx-vlm sanitizers, while keeping same-name tensor rewrites as a fallback. This preserves Gemma3 patch embedding layout fixes and avoids stopping early on Qwen-family MLX vision_tower names.

fix(mlx): extract vlm config objects for saves

962dec8

fix(gguf): run package converters beside conversion

4546225

fix(mlx): align gguf nextn metadata with tensors

d6d5d39

fix(mlx): repair degraded VLM processors

398cb9c

fix(mlx): preserve source save sidecars

df5deb9

test(mlx): cover save export regressions

e530537

fix(mlx): harden save export review cases

0336fc3

fix(mlx): address save export review cases

16ca0d1

revert(mlx): drop tied lm head materialization

f73f432

This reverts commit d8cdf89. Also removes the related regression coverage and later helper hardening now that bug-4 is out of scope.

Merge PR unslothai#697 head (fix/mlx-save-gguf-export-parity) into st…

30e70b4

…aging branch

gemini-code-assist Bot reviewed May 27, 2026

View reviewed changes

danielhanchen closed this May 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PR #697 cross-OS validation (do not merge)#156

PR #697 cross-OS validation (do not merge)#156
danielhanchen wants to merge 19 commits into
pr697-basefrom
pr697-cross-os

danielhanchen commented May 27, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 27, 2026

Uh oh!

gemini-code-assist Bot May 27, 2026

Uh oh!

gemini-code-assist Bot May 27, 2026

Uh oh!

gemini-code-assist Bot May 27, 2026

Uh oh!

gemini-code-assist Bot May 27, 2026

Uh oh!

danielhanchen commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	with open(config_path, "r") as f:
	with open(config_path, "r", encoding="utf-8") as f:

	with open(config_path, "w") as f:
	with open(config_path, "w", encoding="utf-8") as f:

	with open(index_path, "r") as f:
	with open(index_path, "r", encoding="utf-8") as f:

	with open(index_path, "w") as f:
	with open(index_path, "w", encoding="utf-8") as f:

Conversation

danielhanchen commented May 27, 2026

Branches

Jobs

Sweep rationale

Lifecycle

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 27, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 27, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 27, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 27, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 27, 2026

Choose a reason for hiding this comment

Uh oh!

danielhanchen commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants