How to interpret policy/approx_kl / budgeting KL #217

trialan · 2023-01-24T18:24:05Z

trialan
Jan 24, 2023

In the openAI papers they stop training when KL hits about 10 nats, how do I know when this is hit using trlx WnB logs? it feels like approx_kl should be the thing but clearly that's not it.

(sorry if this is wrong place to ask, I never know where to put these questions: discord, issues, or discussions?)

trialan · 2023-01-28T04:47:58Z

trialan
Jan 28, 2023
Author

I think this is a mistake: the code is computing the KL from the "ref_model", not the behaviour cloning model (the initial policy). The WebGPT paper says "The KL here is measure from the BC model and summed over the episode". It would be more informative (easier to compare to papers) if trlx logged the KL from the BC model/original policy/finetuned model

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to interpret policy/approx_kl / budgeting KL #217

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

How to interpret policy/approx_kl / budgeting KL #217

trialan Jan 24, 2023

Replies: 1 comment

trialan Jan 28, 2023 Author

trialan
Jan 24, 2023

trialan
Jan 28, 2023
Author