ResNet-18 parameters are much much higher #136

abdulsam · 2021-06-01T11:38:58Z

I used pytorch-model-summary library to look at the summary of ResNet-18 model.

from pytorch_model_summary import summary

def ResNet18():
    print(summary(ResNet(BasicBlock, [2, 2, 2, 2]), torch.zeros((1, 3, 32, 32)), show_input=True))
    return ResNet(BasicBlock, [2, 2, 2, 2])

I observed that the number of parameters are much higher than the number of parameters mentioned in the paper Deep Residual Learning for Image Recognition for CIFAR-10 ResNet-18 model.
Have a look at the model summary:

Now look at the table mentioned in the paper:

Why the parameters are so high in this implemented model?

The text was updated successfully, but these errors were encountered:

vamshichowdary · 2021-06-16T18:49:12Z

This is because the Resnet implemented in this repo is not exactly the same as original author's implementation. Original author's implementation is more suited for imagenet dataset. I think the closer implementation to the one in paper is in pytorch's repo: https://github.com/pytorch/vision/blob/master/torchvision/models/resnet.py

Main differences I observe:

First conv layer is of 7x7 kernel size with stride=2 and padding=3 in the original resnet. In the repo its 3x3 with stride=1 and padding=1
There is no max pooling layer in this implementation (although this directly doesn't influence the number of parameters, I think it affects them in deeper layers)

bamert · 2021-11-08T20:32:32Z

Both this and the repo in https://github.com/pytorch/vision/blob/main/torchvision/models/resnet.py do not implement Resnet-20 for Cifar10 in the same way as described in Deep Residual Learning for Image Recognition. In addition to 1, 2 mentioned by vamshichowdary, the paper mentions

"The numbers of filters are {16, 32, 64} respectively". (Here they are 64, 128, 256, 512)

vgthengane · 2021-11-11T03:53:33Z

Have a look at this https://pytorch-tutorial.readthedocs.io/en/latest/tutorial/chapter03_intermediate/3_2_2_cnn_resnet_cifar10/. It uses the same configuration as mentioned in the Deep Residual Learning for Image Recognition.

DhDeepLIT · 2021-11-29T16:35:32Z

What about best accuracies when training from scratch ?

skrbnv mentioned this issue Aug 18, 2022

ResNet18 performs much better than expected! #149

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ResNet-18 parameters are much much higher #136

ResNet-18 parameters are much much higher #136

abdulsam commented Jun 1, 2021

vamshichowdary commented Jun 16, 2021

bamert commented Nov 8, 2021

vgthengane commented Nov 11, 2021

DhDeepLIT commented Nov 29, 2021

ResNet-18 parameters are much much higher #136

ResNet-18 parameters are much much higher #136

Comments

abdulsam commented Jun 1, 2021

vamshichowdary commented Jun 16, 2021

bamert commented Nov 8, 2021

vgthengane commented Nov 11, 2021

DhDeepLIT commented Nov 29, 2021