Adding EfficientNetV2 architecture #5450

datumbox · 2022-02-21T15:51:52Z

Related to #2707

Adds EfficientNetV2 implementation on the existing EfficientNet class.

This PR is influenced by earlier work done by @xiaohu2015 at #4910

facebook-github-bot · 2022-02-21T15:51:58Z

💊 CI failures summary and remediations

As of commit 364da8f (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

jdsgomes

Looks good to me so far. Just left a small nit

torchvision/models/efficientnet.py

xiaohu2015 · 2022-02-22T02:39:31Z

Related to #2707

Adds EfficientNetV2 implementation on the existing EfficientNet class.

This PR is influenced by earlier work done by @xiaohu2015 at #4910

Do you have checked the accuray of the converted TF weights. In TF, they used a different BN parameter:
norm_layer = partial(nn.BatchNorm2d, eps=1e-03)
this difference affect the accuracy, another gap maybe lay in the padding mode.

datumbox · 2022-02-22T09:22:31Z

@xiaohu2015 Yes. You get a hit of about 0.3 on the Small model (Acc@1 83.602 Acc@5 96.556).

The reason why I haven't overwritten the BN configuration is because we plan to train it from scratch rather than using the TF weights. There are a few techniques used on the paper that are not supported by our reference scripts (like progressive learning) so depending on how close the delta from the training will be we might try to close this gap by making the proposed patch.

jdsgomes

LGTM, just have a small nit

torchvision/models/efficientnet.py

xiaohu2015 · 2022-02-22T11:20:53Z

@xiaohu2015 Yes. You get a hit of about 0.3 on the Small model (Acc@1 83.602 Acc@5 96.556).

The reason why I haven't overwritten the BN configuration is because we plan to train it from scratch rather than using the TF weights. There are a few techniques used on the paper that are not supported by our reference scripts (like progressive learning) so depending on how close the delta from the training will be we might try to close this gap by making the proposed patch.

I have a question: you got 83.1 top-1 acc using TF weights, is this result tested on nn.BatchNorm2d, eps=1e-03?

datumbox · 2022-02-22T11:43:11Z

@xiaohu2015 Yes correct. The accuracy you see on the source code (83.1) is with the TF weights and without the BN patch. After applying the BN patch, we can reach 83.6. The reason why the BN patch is not on the source code is because I'm currently training the model from scratch and I want to see if that's necessary. Note that the ported TF weights are just added for my convenience to be able to run some of the tests and for checking that nothing is fundamentally broken with the implementation.

xiaohu2015 · 2022-02-22T11:52:35Z

@xiaohu2015 Yes correct. The accuracy you see on the source code (83.1) is with the TF weights and without the BN patch. After applying the BN patch, we can reach 83.6. The reason why the BN patch is not on the source code is because I'm currently training the model from scratch and I want to see if that's necessary. Note that the ported TF weights are just added for my convenience to be able to run some of the tests and for checking that nothing is fundamentally broken with the implementation.

thanks. because I used the timm weights (converted from TF), but I cannot get such accuacy even with the BN patch (about 83.0%), maybe I missed some things.

datumbox · 2022-02-27T13:44:28Z

@xiaohu2015 I've just replaced the weights for the Small variant with some trained from scratch using TorchVision's recipe. We can do better than the paper by ~0.3 points:

torchrun --nproc_per_node=1 train.py --test-only --prototype --weights EfficientNet_V2_S_Weights.IMAGENET1K_V1 --model efficientnet_v2_s -b 1
Acc@1 84.228 Acc@5 96.878

The above means that we don't have to implement TF specific tricks to reproduce the paper, which massively simplifies our code.

Here are the results from training medium from scratch:

gpurun torchrun --nproc_per_node=1 train.py --test-only --prototype --weights EfficientNet_V2_M_Weights.IMAGENET1K_V1 --model efficientnet_v2_m -b 1
Acc@1 85.112 Acc@5 97.156

And here is Large ported from the paper:

torchrun --nproc_per_node=1 train.py --test-only --prototype --weights EfficientNet_V2_S_Weights.IMAGENET1K_V1 --model efficientnet_v2_l
Acc@1 85.810 Acc@5 97.792

jdsgomes

LGTM thank you!

Summary: * Extend the EfficientNet class to support v1 and v2. * Refactor config/builder methods and add prototype builders * Refactoring weight info. * Update dropouts based on TF config ref * Update BN eps on TF base_config * Use Conv2dNormActivation. * Adding pre-trained weights for EfficientNetV2-s * Add Medium and Large weights * Update stats with single batch run. * Add accuracies in the docs. Reviewed By: vmoens Differential Revision: D34878984 fbshipit-source-id: 1f771dc1173dcdcf21391fb01dfa79d7c3608c5f

Extend the EfficientNet class to support v1 and v2.

b03a7ec

pytorch-bot bot added the ciflow/default label Feb 21, 2022

datumbox requested a review from jdsgomes February 21, 2022 15:51

facebook-github-bot added the cla signed label Feb 21, 2022

jdsgomes reviewed Feb 21, 2022

View reviewed changes

torchvision/models/efficientnet.py Show resolved Hide resolved

datumbox added 2 commits February 21, 2022 17:47

Refactor config/builder methods and add prototype builders

aa82cf1

Refactoring weight info.

9310325

datumbox force-pushed the models/efficientnet_v2 branch from 8f20299 to 9310325 Compare February 21, 2022 19:27

datumbox added module: models new feature topic: classification labels Feb 21, 2022

This was referenced Feb 21, 2022

[RFC] Batteries Included - Phase 2 #5410

Closed

Are new models planned to be added? #2707

Open

jdsgomes reviewed Feb 22, 2022

View reviewed changes

torchvision/models/efficientnet.py Show resolved Hide resolved

datumbox and others added 9 commits February 22, 2022 13:03

Update dropouts based on TF config ref

ebc1b65

Update BN eps on TF base_config

7cec6a7

Merge branch 'main' into models/efficientnet_v2

12635e4

Merge branch 'main' into models/efficientnet_v2

1d05807

Merge branch 'main' into models/efficientnet_v2

5b9dc59

Merge branch 'main' into models/efficientnet_v2

dedd596

Use Conv2dNormActivation.

2ff8734

Merge branch 'main' into models/efficientnet_v2

a347d57

Merge branch 'main' into models/efficientnet_v2

8f6df0c

Adding pre-trained weights for EfficientNetV2-s

bf41dfb

Add Medium and Large weights

abeac10

datumbox force-pushed the models/efficientnet_v2 branch from 9057045 to abeac10 Compare March 2, 2022 09:17

datumbox and others added 3 commits March 2, 2022 10:17

Update stats with single batch run.

907944e

Merge branch 'main' into models/efficientnet_v2

7eb03c1

Add accuracies in the docs.

a269432

jdsgomes approved these changes Mar 2, 2022

View reviewed changes

datumbox changed the title ~~[WIP] Adding EfficientNetV2 architecture~~ Adding EfficientNetV2 architecture Mar 2, 2022

Merge branch 'main' into models/efficientnet_v2

364da8f

datumbox merged commit e6d82f7 into pytorch:main Mar 2, 2022

datumbox deleted the models/efficientnet_v2 branch March 2, 2022 12:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding EfficientNetV2 architecture #5450

Adding EfficientNetV2 architecture #5450

datumbox commented Feb 21, 2022 •

edited

Loading

facebook-github-bot commented Feb 21, 2022 •

edited

Loading

jdsgomes left a comment

xiaohu2015 commented Feb 22, 2022

datumbox commented Feb 22, 2022 •

edited

Loading

jdsgomes left a comment

xiaohu2015 commented Feb 22, 2022

datumbox commented Feb 22, 2022

xiaohu2015 commented Feb 22, 2022

datumbox commented Feb 27, 2022 •

edited

Loading

jdsgomes left a comment

Adding EfficientNetV2 architecture #5450

Adding EfficientNetV2 architecture #5450

Conversation

datumbox commented Feb 21, 2022 • edited Loading

facebook-github-bot commented Feb 21, 2022 • edited Loading

💊 CI failures summary and remediations

jdsgomes left a comment

Choose a reason for hiding this comment

xiaohu2015 commented Feb 22, 2022

datumbox commented Feb 22, 2022 • edited Loading

jdsgomes left a comment

Choose a reason for hiding this comment

xiaohu2015 commented Feb 22, 2022

datumbox commented Feb 22, 2022

xiaohu2015 commented Feb 22, 2022

datumbox commented Feb 27, 2022 • edited Loading

jdsgomes left a comment

Choose a reason for hiding this comment

datumbox commented Feb 21, 2022 •

edited

Loading

facebook-github-bot commented Feb 21, 2022 •

edited

Loading

datumbox commented Feb 22, 2022 •

edited

Loading

datumbox commented Feb 27, 2022 •

edited

Loading