[tests] Review tests for PR #615 by danielhanchen · Pull Request #15 · shimmyshimmer/unsloth-zoo-staging-2

danielhanchen · 2026-05-04T01:51:44Z

Automated test files from review process

…ackend Add device_empty_cache() helper in device_type.py alongside the existing device_synchronize(), and route every torch.cuda.empty_cache() / .synchronize() call in saving_utils.py through these helpers so XPU and HIP builds no longer crash or silently no-op during GGUF export. Concretely, this fixes: - Unguarded torch.cuda.empty_cache() in the outer shard loop of merge_and_overwrite_lora and inside _merge_and_overwrite_lora_mxfp4, both of which raise "Torch not compiled with CUDA enabled" on XPU after the first shard / mxfp4 tensor is processed. - Six guarded torch.cuda.empty_cache() / .synchronize() sites inside _merge_and_overwrite_lora and _merge_and_overwrite_lora_mxfp4 that silently no-op on XPU, leaving XPU VRAM unflushed mid-export. Add a private _active_merge_device(W) helper that returns W.device when W is already on the active backend, otherwise constructs torch.device( DEVICE_TYPE_TORCH[, index]). Route _merge_lora and the five MoE expert merge helpers (_merge_moe_gate_expert, _merge_moe_up_expert, _merge_moe_down_proj_expert, _merge_moe_fused_gate_up_expert, _merge_moe_fused_down_proj_expert) through it so MoE LoRA merges run on the active accelerator instead of silently falling back to CPU on XPU. CUDA/HIP behavior is unchanged because DEVICE_TYPE_TORCH equals "cuda" for both backends and device_empty_cache() preserves the existing torch.cuda.is_available() guard.

Add device_is_bf16_supported() to device_type.py alongside the existing device_synchronize() and device_empty_cache() helpers, and route the three torch.cuda.is_bf16_supported() callsites in llama_cpp.py's convert_to_gguf mmproj/outtype branches through it. On XPU torch builds these calls would otherwise raise "Torch not compiled with CUDA enabled" during VLM GGUF export, mirroring the same crash class fixed in saving_utils.py. CUDA and HIP behavior is unchanged (DEVICE_TYPE in ("cuda","hip") -> the helper returns torch.cuda.is_bf16_supported() exactly as before).

Mirror the defensive hasattr pattern from device_is_bf16_supported in device_empty_cache so that a torch.xpu module that exposes is_available but not empty_cache (custom or partial XPU build) does not raise AttributeError when the active backend cache is flushed.

Mirror the defensive hasattr pattern already applied to device_empty_cache and device_is_bf16_supported so a torch.xpu module that exposes is_available but not synchronize (custom or partial XPU build) does not raise AttributeError when device_synchronize is invoked from the GGUF merge path.

Rename test_device_synchronize_partial_build.py to test_backend_device_helpers.py so the file name reflects the actual scope (dispatch and partial-build safety across all three backend helpers: device_synchronize, device_empty_cache, device_is_bf16_supported).

danielhanchen · 2026-05-04T04:06:56Z

Fixes pushed to unslothai#615.

andomeder and others added 5 commits April 28, 2026 21:31

fix: use backend device type in GGUF merge path

91cde98

fix: preserve active device index in GGUF merge path

1b90f79

Merge remote-tracking branch 'origin/main' into staging branch

93f1a8c

danielhanchen force-pushed the pr-615-tests branch from 2a87149 to 895ecb0 Compare May 4, 2026 02:33

danielhanchen force-pushed the pr-615-tests branch from 895ecb0 to 2fd55a4 Compare May 4, 2026 02:47

danielhanchen added 2 commits May 4, 2026 02:58

Add backend device helper tests

89c706c

danielhanchen force-pushed the pr-615-tests branch from 2fd55a4 to 9123710 Compare May 4, 2026 02:59

danielhanchen force-pushed the pr-615-tests branch from ab7b4a8 to b42a801 Compare May 4, 2026 03:21

danielhanchen closed this May 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tests] Review tests for PR #615#15

[tests] Review tests for PR #615#15
danielhanchen wants to merge 9 commits into
mainfrom
pr-615-tests

danielhanchen commented May 4, 2026

Uh oh!

danielhanchen commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

danielhanchen commented May 4, 2026

Uh oh!

danielhanchen commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants