Need check_val_every_n_steps in Trainer #5565

del2z · 2021-01-19T03:24:55Z

🚀 Feature

Add an argument check_val_every_n_steps in Trainer.__init__ function to check metrics of validation set for certain steps.

Motivation

For many tasks, large models are trained in steps not complete epochs, especially pretrained models in CV and NLP. As a consequence, step-based arguments like max_steps, log_every_n_steps may be more convenient than epoch-based ones. However, the Trainer API only has a check_val_every_n_epoch argument for computing metrics of validation data. It's very helpful to have an additional argument like check_val_every_n_steps in Trainer constructor.

Pitch

Trainer.init(..., check_val_every_n_epoch=1, check_val_every_n_steps=100, ...)

The text was updated successfully, but these errors were encountered:

del2z · 2021-01-19T07:51:31Z

Another confusing concept is batch_idx in training_step, validation_step and test_step. A detailed example or illustration may be helpful to understand this concept. From my experience, batch_idx may not be widely used for developing models.

rohitgr7 · 2021-01-19T09:08:38Z

there is val_check_interval for that.

yuvalkirstain · 2022-02-01T13:09:14Z

@rohitgr7 val_check_interval can't exceed a single epoch. So it does not support evaluation every n steps where n is larger than the amount of batches in the dataloader.

rohitgr7 · 2022-02-01T13:35:37Z

hey @yuvalkirstain not yet.
here is the tracking issue: #8135

del2z added feature Is an improvement or enhancement help wanted Open to be worked on labels Jan 19, 2021

del2z closed this as completed Feb 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need check_val_every_n_steps in Trainer #5565

Need check_val_every_n_steps in Trainer #5565

del2z commented Jan 19, 2021 •

edited

Loading

del2z commented Jan 19, 2021 •

edited

Loading

rohitgr7 commented Jan 19, 2021

yuvalkirstain commented Feb 1, 2022

rohitgr7 commented Feb 1, 2022

Need check_val_every_n_steps in Trainer #5565

Need check_val_every_n_steps in Trainer #5565

Comments

del2z commented Jan 19, 2021 • edited Loading

🚀 Feature

Motivation

Pitch

del2z commented Jan 19, 2021 • edited Loading

rohitgr7 commented Jan 19, 2021

yuvalkirstain commented Feb 1, 2022

rohitgr7 commented Feb 1, 2022

del2z commented Jan 19, 2021 •

edited

Loading

del2z commented Jan 19, 2021 •

edited

Loading