Skip to content

Fix Gemma-4 GRPO catastrophic KL divergence with TRL 1.0.0+#21

Closed
danielhanchen wants to merge 2 commits into
mainfrom
pr-4934-head
Closed

Fix Gemma-4 GRPO catastrophic KL divergence with TRL 1.0.0+#21
danielhanchen wants to merge 2 commits into
mainfrom
pr-4934-head

[pre-commit.ci] auto fixes from pre-commit.com hooks

1f27883
Select commit
Loading
Failed to load commit list.
GitHub Advanced Security / CodeQL succeeded Apr 9, 2026 in 53s

No new alerts in code changed by this pull request