Skip to content

Fix Gemma-4 GRPO catastrophic KL divergence with TRL 1.0.0+#21

Closed
danielhanchen wants to merge 2 commits into
mainfrom
pr-4934-head
Closed

Fix Gemma-4 GRPO catastrophic KL divergence with TRL 1.0.0+#21
danielhanchen wants to merge 2 commits into
mainfrom
pr-4934-head

Commits

Commits on Apr 9, 2026