[3/9] final_lr_ratio #17

garrett361 · 2025-06-27T19:11:56Z

Alters the SFT linear learning rate scheduler such that the final lr is configurable, rather than having it run down to zero. The final learning rate is learning_rate * final_lr_ratio

NOTE: this changes the default behavior --final_lr_ratio 0.1, whereas previously finetune.py effectively had --final_lr_ratio 0.0. The 0.1 default is inspired by Qwen.

NOTE: only implemented for the linear scheduler. This PR disables other scheduler options.

	PR	Title
1	#15	padding-free
2	#16	clean_checkpoints_at_end
3	>17	final_lr_ratio
4	#18	add_seed_and_date_to_run_name
5	#19	additional_model_arguments
6	#20	sync_each_batch=True grad acc
7	#21	no grad acc averaging for sum losses
8	#22	extra reporting
9	#23	local_main_process_first when building dataset

fabianlim · 2025-06-28T00:25:41Z

seems like there is some rebasing trouble?

open_instruct/finetune.py

fabianlim

final_lr_ratio should not be a required setting, if scheduler is not linear it should have have any effect

garrett361 · 2025-06-28T01:40:21Z

@fabianlim agree with everything, updated

fabianlim · 2025-06-28T18:32:49Z

open_instruct/finetune.py

+    num_warmup_steps = int(num_training_steps_for_scheduler * args.warmup_ratio)
+    if args.final_lr_ratio is not None and args.lr_scheduler_type == "linear":
+        # Correct num_training_steps_for_scheduler to respect final_lr_ratio for a linear scheduler
+        num_training_steps_for_scheduler = (


num_training_steps_for_scheduler is not used anwhere else except get_scheduler right?

Correct, it's just defined here, and maybe updated if the user specifies final_lr_ratio and is using a linear scheduler.

This was referenced Jun 27, 2025

[1/9] padding-free #15

Merged

[4/9] rm run_name, add_seed_and_date_to_exp_name #18

Merged

garrett361 changed the title ~~[3/9] WIP: final_lr_ratio~~ [3/9] final_lr_ratio Jun 27, 2025

garrett361 force-pushed the padding-free-squashing-2 branch from eb9e294 to 8a3148c Compare June 27, 2025 19:23

garrett361 force-pushed the padding-free-squashing-3 branch from d1a5006 to e231062 Compare June 27, 2025 19:23

garrett361 requested review from Swanand-Kadhe, dangxuanhong and fabianlim June 27, 2025 19:41

garrett361 force-pushed the padding-free-squashing-2 branch from 8a3148c to 3b77ec7 Compare June 27, 2025 20:21

garrett361 force-pushed the padding-free-squashing-3 branch from e231062 to a2546c2 Compare June 27, 2025 20:21

garrett361 force-pushed the padding-free-squashing-2 branch from 3b77ec7 to 01e3cfd Compare June 27, 2025 20:48

garrett361 force-pushed the padding-free-squashing-3 branch from a2546c2 to bdc2c43 Compare June 27, 2025 20:48

garrett361 force-pushed the padding-free-squashing-2 branch from 01e3cfd to da5dbb1 Compare June 27, 2025 20:54

garrett361 force-pushed the padding-free-squashing-3 branch 3 times, most recently from 0548cfd to b6d7b83 Compare June 27, 2025 21:19

fabianlim changed the base branch from padding-free-squashing-2 to padding-free-squashing June 28, 2025 00:26

fabianlim changed the base branch from padding-free-squashing to main June 28, 2025 00:28

fabianlim reviewed Jun 28, 2025

View reviewed changes

open_instruct/finetune.py Outdated Show resolved Hide resolved

fabianlim reviewed Jun 28, 2025

View reviewed changes

open_instruct/finetune.py Outdated Show resolved Hide resolved

fabianlim reviewed Jun 28, 2025

View reviewed changes

open_instruct/finetune.py Outdated Show resolved Hide resolved

fabianlim requested changes Jun 28, 2025

View reviewed changes

final_lr_ratio

7a0671a

garrett361 force-pushed the padding-free-squashing-3 branch from b6d7b83 to 7a0671a Compare June 28, 2025 01:39

fabianlim reviewed Jun 28, 2025

View reviewed changes

fabianlim mentioned this pull request Jun 28, 2025

Padding free squashing #12

Closed

fabianlim merged commit 8f21b76 into main Jun 30, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[3/9] final_lr_ratio #17

[3/9] final_lr_ratio #17

Uh oh!

garrett361 commented Jun 27, 2025 •

edited

Loading

Uh oh!

fabianlim commented Jun 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fabianlim left a comment

Uh oh!

garrett361 commented Jun 28, 2025

Uh oh!

fabianlim Jun 28, 2025

Uh oh!

garrett361 Jun 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[3/9] final_lr_ratio #17

[3/9] final_lr_ratio #17

Uh oh!

Conversation

garrett361 commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fabianlim commented Jun 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fabianlim left a comment

Choose a reason for hiding this comment

Uh oh!

garrett361 commented Jun 28, 2025

Uh oh!

fabianlim Jun 28, 2025

Choose a reason for hiding this comment

Uh oh!

garrett361 Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

garrett361 commented Jun 27, 2025 •

edited

Loading