Fix squared optimization steps bug by twerkmeister · Pull Request #280 · huggingface/setfit

twerkmeister · 2023-01-18T15:34:49Z

Currently, when increasing the number of epochs, the number of steps per epoch is also increased, leading to a number of optimization steps thats roughly (examples / batch size) * epochs * epochs instead of (examples / batch size) * epochs. The proposed change should fix this.

tomaarsen · 2023-01-18T15:54:49Z

Hello!

This bug was discovered #268 (comment), and a fix was proposed. It involves simply removing this line:

setfit/src/setfit/trainer.py

Line 378 in f777c2c

steps_per_epoch=train_steps,

When steps_per_epoch is not specified, the number of steps per epoch will be equivalent to the number of datapoints in the dataloader. See the docs on the fit method for more info.

The remainder of the code would not need any changes.
Would you agree that this is a preferable solution over the changes that you have proposed? I think it's best if we make a new PR for this.

Tom Aarsen

twerkmeister · 2023-01-19T09:30:25Z

Hey @tomaarsen, I removed the argument as you mentioned. Beyond that, this version also fixes the fact that for the triplet losses self.num_epochs was used to calculate total number of optimization steps for reporting when in fact num_epochs should be used because self.num_epochs can be shadowed by an argument to the train method

tomaarsen · 2023-01-19T09:37:22Z

You're very right! Thanks for catching that. I'll merge this once the tests go green 🎉

tomaarsen · 2023-01-19T09:49:11Z

src/setfit/trainer.py

+            logger.info(f"  Total optimization steps = {len(train_dataloader) * num_epochs}")
            logger.info(f"  Total train batch size = {batch_size}")

            warmup_steps = math.ceil(train_steps * self.warmup_proportion)


Suggested change

warmup_steps = math.ceil(train_steps * self.warmup_proportion)

warmup_steps = math.ceil(train_steps * self.warmup_proportion)

This still uses the train_steps variable that you removed.

tomaarsen · 2023-01-23T10:49:52Z

Wonderful! Thank you for this fix, and for the edits on the PR @twerkmeister!

Based on the bug fix from huggingface#280

Based on the bug fix from #280

Fix squared optimization steps bug

a67f3d9

tomaarsen added the bug Something isn't working label Jan 18, 2023

removed steps_per_epoch body.fit argument

e603995

tomaarsen reviewed Jan 19, 2023

View reviewed changes

twerkmeister and others added 2 commits January 23, 2023 09:50

reinstated total_train_step variable

721b384

Reformat trainer.py

39627c4

tomaarsen merged commit 29c0348 into huggingface:main Jan 23, 2023

tomaarsen added a commit to tomaarsen/setfit that referenced this pull request Jan 23, 2023

Fix squared optimization steps bug in distillation trainer

197dcd6

Based on the bug fix from huggingface#280

tomaarsen mentioned this pull request Jan 23, 2023

Fix squared optimization steps bug in distillation trainer #284

Merged

tomaarsen added a commit that referenced this pull request Jan 23, 2023

Fix squared optimization steps bug in distillation trainer (#284)

0cb8ffd

Based on the bug fix from #280

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix squared optimization steps bug#280

Fix squared optimization steps bug#280
tomaarsen merged 4 commits intohuggingface:mainfrom
twerkmeister:squared-steps-bug

twerkmeister commented Jan 18, 2023

Uh oh!

tomaarsen commented Jan 18, 2023

Uh oh!

twerkmeister commented Jan 19, 2023

Uh oh!

tomaarsen commented Jan 19, 2023

Uh oh!

tomaarsen Jan 19, 2023 •

edited

Loading

Uh oh!

tomaarsen commented Jan 23, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	warmup_steps = math.ceil(train_steps * self.warmup_proportion)
	warmup_steps = math.ceil(train_steps * self.warmup_proportion)

Conversation

twerkmeister commented Jan 18, 2023

Uh oh!

tomaarsen commented Jan 18, 2023

Uh oh!

twerkmeister commented Jan 19, 2023

Uh oh!

tomaarsen commented Jan 19, 2023

Uh oh!

tomaarsen Jan 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tomaarsen commented Jan 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tomaarsen Jan 19, 2023 •

edited

Loading

tomaarsen commented Jan 23, 2023 •

edited

Loading