Overfitting on ResNet18 #161

MichaelLee-ceo · 2023-03-24T09:39:22Z

I copied the resnet.py for constructing ResNet18 and start training.
The hyperparameters I used to train the network are the same as defined in the main.py, but I eventually end up with overfitting on the testing set.
I got almost 98% accuracy on training set while only 89% on testing set.
Did I mis-configured something or how can I handle it to avoid overfitting? Thanks

KWang1998 · 2023-04-15T18:45:13Z

Same here. I also got about 85% on the test set with ResNet 18. I haven't figured out the reason yet.
Have you addressed this issue? Thanks

MichaelLee-ceo · 2023-04-16T06:33:56Z

Hi, I have figured out that the learning rate scheduler "CosineAnnealingLR" has an impact on both training and testing accuracy, which can make the model more converged at the end of the training.
After I added it back to my code, I can achieve about 93% on ResNet18, so maybe you try that.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overfitting on ResNet18 #161

Overfitting on ResNet18 #161

MichaelLee-ceo commented Mar 24, 2023

KWang1998 commented Apr 15, 2023

MichaelLee-ceo commented Apr 16, 2023

Overfitting on ResNet18 #161

Overfitting on ResNet18 #161

Comments

MichaelLee-ceo commented Mar 24, 2023

KWang1998 commented Apr 15, 2023

MichaelLee-ceo commented Apr 16, 2023