[ppo] feat: add critic valuehead model support for multi-modal PPO#1839
Merged
hiyouga merged 6 commits intoverl-project:mainfrom Jun 9, 2025
Merged
[ppo] feat: add critic valuehead model support for multi-modal PPO#1839hiyouga merged 6 commits intoverl-project:mainfrom
hiyouga merged 6 commits intoverl-project:mainfrom
Commits
Commits on Jun 4, 2025
Commits on Jun 5, 2025
- committed
- committed
Commits on Jun 6, 2025
- committed
Commits on Jun 7, 2025
- authored
- authored