Skip to content

Fix: adding magistral fsdp config, fixing not eval with test_datasets, handle mllama attention#2789

Merged
NanoCode012 merged 4 commits into
mainfrom
fix/prerelease-fix
Jun 14, 2025
Merged

Fix: adding magistral fsdp config, fixing not eval with test_datasets, handle mllama attention#2789
NanoCode012 merged 4 commits into
mainfrom
fix/prerelease-fix