Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change to save checkpoint and run validation on N steps instead of epochs #204

Closed
roedoejet opened this issue Dec 18, 2023 · 2 comments · Fixed by #461
Closed

Change to save checkpoint and run validation on N steps instead of epochs #204

roedoejet opened this issue Dec 18, 2023 · 2 comments · Fixed by #461
Assignees
Labels
enhancement New feature or request help wanted Extra attention is needed
Milestone

Comments

@roedoejet
Copy link
Member

Since validation on every epoch can cause performance problems on small datasets.

@roedoejet roedoejet self-assigned this Dec 18, 2023
@roedoejet roedoejet added enhancement New feature or request beta labels Jan 16, 2024
@roedoejet roedoejet added this to the beta milestone Jan 16, 2024
@roedoejet roedoejet removed the beta label Jan 16, 2024
@roedoejet
Copy link
Member Author

@roedoejet roedoejet modified the milestones: beta, alpha May 1, 2024
@roedoejet roedoejet removed their assignment Jun 10, 2024
@roedoejet roedoejet added the help wanted Extra attention is needed label Jun 10, 2024
@SamuelLarkin
Copy link
Collaborator

SamuelLarkin commented Jun 11, 2024

Notes:
TRAINER

  • limit_val_batches (Union[int, float, None]) – How much of validation dataset to check (float = fraction, int = num_batches). Default: 1.0.
  • val_check_interval (Union[int, float, None]) – How often to check the validation set. Pass a float in the range [0.0, 1.0] to check after a fraction of the training epoch. Pass an int to check after a fixed number of training batches. An int value can only be higher than the number of training batches when check_val_every_n_epoch=None, which validates after every N training batches across epochs or during iteration-based training. Default: 1.0.
  • check_val_every_n_epoch (Optional[int]) – Perform a validation loop every after every N training epochs. If None, validation will be done solely based on the number of training batches, requiring val_check_interval to be an integer value. Default: 1.

val_check_interval was added in Support val_check_interval values higher than number of training batches and was merged on Apr 21, 2022.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants