Fix completion_mask alignment and temperature scaling in Megatron GRPO trainer#8427
Merged
hjh0119 merged 2 commits intomodelscope:mainfrom Mar 26, 2026
Merged
Fix completion_mask alignment and temperature scaling in Megatron GRPO trainer#8427hjh0119 merged 2 commits intomodelscope:mainfrom
hjh0119 merged 2 commits intomodelscope:mainfrom