final_lr_ratio #745

garrett361 · 2025-07-01T15:23:11Z

Currently, the SFT learning rate is run all the way down to zero by default in finetune.py. This PR adds a --final_lr_ratio flag which allows the LR to run only down to a value of learning_rate * final_lr_ratio over the course of SFT. Qwen, for instance, follows this strategy of only decaying the lr down by a fixed factrion.

Only currently implemented for the linear scheduler.

final_lr_ratio

842ee07

hamishivi approved these changes Jul 2, 2025

View reviewed changes

hamishivi merged commit 8cbf07b into allenai:main Jul 2, 2025
3 checks passed

finbarrtimbers pushed a commit that referenced this pull request Jul 2, 2025

final_lr_ratio (#745)

cd0e77a

finbarrtimbers pushed a commit that referenced this pull request Jul 2, 2025

final_lr_ratio (#745)

c510ce8

fabianlim mentioned this pull request Jul 10, 2025

Port Padding-Free, #772

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

final_lr_ratio #745

final_lr_ratio #745

Uh oh!

garrett361 commented Jul 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

final_lr_ratio #745

final_lr_ratio #745

Uh oh!

Conversation

garrett361 commented Jul 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants