cp: feat: add Megatron support for on-policy distillation (1324) into r0.4.0#1398
Merged
Loading
feat: add Megatron support for on-policy distillation (1324) into r0.4.0#1398