Skip to content

Update Qwen3 235B A22B MXFP8 GB200/300 recipe and resolve NaN grad norm#2209

Merged
erhoo82 merged 1 commit intoNVIDIA-NeMo:mainfrom
dingqingy-nv:qwen3_mxfp8_recipe_update
Feb 4, 2026
Merged

Update Qwen3 235B A22B MXFP8 GB200/300 recipe and resolve NaN grad norm#2209
erhoo82 merged 1 commit intoNVIDIA-NeMo:mainfrom
dingqingy-nv:qwen3_mxfp8_recipe_update

Commits

Commits on Feb 4, 2026