You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I copied the resnet.py for constructing ResNet18 and start training.
The hyperparameters I used to train the network are the same as defined in the main.py, but I eventually end up with overfitting on the testing set.
I got almost 98% accuracy on training set while only 89% on testing set.
Did I mis-configured something or how can I handle it to avoid overfitting? Thanks
The text was updated successfully, but these errors were encountered:
Hi, I have figured out that the learning rate scheduler "CosineAnnealingLR" has an impact on both training and testing accuracy, which can make the model more converged at the end of the training.
After I added it back to my code, I can achieve about 93% on ResNet18, so maybe you try that.
I copied the resnet.py for constructing ResNet18 and start training.
The hyperparameters I used to train the network are the same as defined in the main.py, but I eventually end up with overfitting on the testing set.
I got almost 98% accuracy on training set while only 89% on testing set.
Did I mis-configured something or how can I handle it to avoid overfitting? Thanks
The text was updated successfully, but these errors were encountered: