[BREAKING][skyrl-train] Implement loss reduction via advantage normalization and fix token_mean reduction strategy#1296
Merged
erictang000 merged 26 commits intoNovaSky-AI:mainfrom Mar 31, 2026
Commits
Commits on Mar 9, 2026
Commits on Mar 10, 2026
Commits on Mar 20, 2026
Commits on Mar 25, 2026
Commits on Mar 27, 2026
- andcommitted
- andcommitted
- committed
- committed
- committed
- committed
- committed
- committed
- committed
Commits on Mar 30, 2026
- committed
- committed
- committed
- committed
- committed
- committed