Fix MLX response masking and eval parity issues by Lyxot · Pull Request #2 · mmathew23/unsloth-zoo

Lyxot · 2026-05-28T15:19:51Z

This PR fixes several MLXTrainer/SFTTrainer parity issues around response masking, tokenization, and eval dataset handling.

MLX response-only batches kept fully masked samples
MLX only checked all-masked batches after batching, while the CUDA/SFT path filters samples whose post-mask labels are entirely -100.
Commit: 0fc7f14
Plain HF fast tokenizers failed in train_on_responses_only
MLX unwrapped any object with _tokenizer, but HF fast tokenizers expose _tokenizer as the low-level Rust tokenizer, which is not callable like PreTrainedTokenizerBase.
Commit: ae977ba
MLX text batching could double-add BOS
MLX used tokenizer.encode(text) directly, while CUDA/SFT disables special-token insertion when rendered text already starts with BOS or the chat template owns BOS.
Commit: 5dd5c00
Dict eval_dataset was treated as iterable rows
MLX passed dict eval datasets directly into batch creation/evaluation, so split names were iterated instead of preparing each eval split separately.
Commit: 4f70b5f

fix(mlx): filter fully masked response samples

0fc7f14

Copilot AI review requested due to automatic review settings May 28, 2026 15:19

Lyxot added 3 commits May 29, 2026 18:07

fix(mlx): preserve hf tokenizers for response masking

ae977ba

fix(mlx): avoid double bos in text batching

5dd5c00

fix(mlx): handle dict eval datasets

4f70b5f

Lyxot changed the title ~~fix(mlx): filter fully masked response samples~~ Fix MLX response masking and eval parity issues May 29, 2026

Provide feedback