Fix Gemma-4 GRPO catastrophic KL divergence with TRL 1.0.0+#21
Closed
danielhanchen wants to merge 2 commits into
Closed
Fix Gemma-4 GRPO catastrophic KL divergence with TRL 1.0.0+#21danielhanchen wants to merge 2 commits into
danielhanchen wants to merge 2 commits into
GitHub Advanced Security / CodeQL
succeeded
Apr 9, 2026 in 53s
No new alerts in code changed by this pull request
Loading