Skip to content

mixtral: drop training-branching hack for SFT segfault & add ZeRO-3 leaf utility#2185

Merged
regisss merged 23 commits into
huggingface:mainfrom
yafshar:mixtral/remove-sft-segfault-hack
Aug 21, 2025
Merged

mixtral: drop training-branching hack for SFT segfault & add ZeRO-3 leaf utility#2185
regisss merged 23 commits into
huggingface:mainfrom
yafshar:mixtral/remove-sft-segfault-hack

Commits

Commits on Jul 23, 2025

Commits on Jul 30, 2025

Commits on Aug 4, 2025

Commits on Aug 8, 2025

Commits on Aug 12, 2025

Commits on Aug 13, 2025

Commits on Aug 14, 2025

Commits on Aug 20, 2025