Skip to content

feat(experimental): Divergence Proximal Policy Optimization#5117

Merged
LeonEricsson merged 28 commits into
huggingface:mainfrom
LeonEricsson:feature/dppo
Mar 19, 2026
Merged

feat(experimental): Divergence Proximal Policy Optimization#5117
LeonEricsson merged 28 commits into
huggingface:mainfrom
LeonEricsson:feature/dppo

Commits

Commits on Feb 13, 2026

Commits on Feb 15, 2026

Commits on Feb 18, 2026

Commits on Feb 22, 2026

Commits on Feb 25, 2026

Commits on Feb 26, 2026

Commits on Feb 27, 2026

Commits on Mar 1, 2026

Commits on Mar 18, 2026