Add UperNet #20648

NielsRogge · 2022-12-07T12:38:04Z

What does this PR do?

This PR adds the classic UperNet framework to Transformers.

Many papers that introduce a new vision backbone, such as BEiT, ConvNeXt, Swin,... benchmark their model on downstream tasks such as semantic segmentation and object detection. All of these papers use the UperNet framework (introduced in 2018) when evaluating their backbone on semantic segmentation.

Hence, this PR implements this framework, making use of the new AutoBackbone API to make the following possible:

from transformers import SwinConfig, UperNetConfig, UperNetForSemanticSegmentation

backbone_config = SwinConfig(out_features=["stage1", "stage2", "stage3", "stage4"])

config = UperNetConfig(backbone_config=backbone_config)
model = UperNetForSemanticSegmentation(config)

In the code above, we're instantiating the UperNet framework with Swin Transformer as backbone. The code looks equivalent for another backbone, like ConvNeXt:

from transformers import ConvNextBackbone, UperNetConfig, UperNetForSemanticSegmentation

backbone_config = ConvNextBackbone(out_features=["stage1", "stage2", "stage3", "stage4"])

config = UperNetConfig(backbone_config=backbone_config)
model = UperNetForSemanticSegmentation(config)

To do:

looking into supporting from_pretrained of backbones => will be done in a follow-up PR
make sure UperNetImageProcessor does exact same preprocessing
make UperNetImageProcessor also take segmentation_maps as optional input
add image processor tests
convert all checkpoints + update organization
fix integration tests

HuggingFaceDocBuilderDev · 2022-12-07T13:50:14Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for working on this new model. Left a couple of comments.

README.md

docs/source/en/model_doc/upernet.mdx

src/transformers/models/auto/configuration_auto.py

src/transformers/models/donut/modeling_donut_swin.py

src/transformers/models/swin/modeling_swin.py

src/transformers/models/upernet/modeling_upernet.py

docs/source/en/model_doc/upernet.mdx

src/transformers/models/swin/modeling_swin.py

sgugger · 2022-12-16T11:12:23Z

src/transformers/models/upernet/configuration_upernet.py

+            )
+            if isinstance(backbone_config, dict):
+                config_class = CONFIG_MAPPING[backbone_model_type]
+                backbone_config = config_class.from_dict(backbone_config)


Maybe raise an error if then the type is not a PretrainedConfig?

src/transformers/models/upernet/modeling_upernet.py

utils/check_repo.py

alaradirik

Thanks for adding this!

Left a few minor comments but everything looks good apart from the issues/comments related to configuration and model parameter initialisation (+ organization name update) and works as expected.

src/transformers/models/upernet/__init__.py

src/transformers/models/upernet/modeling_upernet.py

src/transformers/models/upernet/convert_convnext_upernet_to_pytorch.py

src/transformers/models/upernet/convert_swin_upernet_to_pytorch.py

sgugger

LGTM on the model initialization. One last comment on the image processor: it needs to be added in the auto mapping, and since it seems to be a full copy of the SegformerImageProcessor, you should re-use it in the auto-mapping and not introduce a new image processor here.

src/transformers/models/upernet/configuration_upernet.py

src/transformers/models/upernet/image_processing_upernet.py

sgugger

Thanks for all your work on this model!

NielsRogge · 2022-12-23T08:51:30Z

Thanks for the review, I'm waiting for the authors to respond regarding the creation of an organization on the hub.

NielsRogge force-pushed the add_upernet_swin_encoder branch from 41fe6b6 to f899eea Compare December 7, 2022 12:54

NielsRogge force-pushed the add_upernet_swin_encoder branch from 7e880ea to 11db99f Compare December 9, 2022 08:49

NielsRogge mentioned this pull request Dec 9, 2022

Add OneFormer Model #20577

Merged

5 tasks

NielsRogge force-pushed the add_upernet_swin_encoder branch from 70736a5 to 963dc11 Compare December 14, 2022 10:17

NielsRogge marked this pull request as ready for review December 14, 2022 15:48

NielsRogge requested review from alaradirik and sgugger December 14, 2022 15:48

sgugger reviewed Dec 14, 2022

View reviewed changes

NielsRogge mentioned this pull request Dec 14, 2022

Add Swin backbone #20769

Merged

NielsRogge force-pushed the add_upernet_swin_encoder branch from d657041 to f415981 Compare December 15, 2022 21:35

NielsRogge requested a review from sgugger December 16, 2022 08:23

NielsRogge mentioned this pull request Dec 16, 2022

Add Mask2Former #20792

Merged

5 tasks

sgugger reviewed Dec 16, 2022

View reviewed changes

alaradirik approved these changes Dec 19, 2022

View reviewed changes

NielsRogge mentioned this pull request Dec 20, 2022

HuggingFace integration open-mmlab/mmsegmentation#2424

Open

sgugger reviewed Dec 21, 2022

View reviewed changes

src/transformers/models/upernet/configuration_upernet.py Outdated Show resolved Hide resolved

src/transformers/models/upernet/image_processing_upernet.py Outdated Show resolved Hide resolved

sgugger approved these changes Dec 23, 2022

View reviewed changes

Niels Rogge added 11 commits January 13, 2023 12:14

First draft

d623a9b

More improvements

b41886c

Add convnext backbone

0958720

Add conversion script

d88a250

Add more improvements

275c084

Comment out to_dict

86e4f8f

Add to_dict method

76da127

Add default config

09c76be

Fix config

4ba8f4c

Fix backbone

6aa5c65

Fix backbone some more

5f1dda2

Niels Rogge and others added 24 commits January 13, 2023 13:26

Add integration test

4ed9394

Add convnext integration test

c1e7c2f

Update docstring

feb0852

Fix README

7efe1d4

Simplify config

37da574

Apply suggestions

dae5567

Improve docs

9447dbe

Rename class

95ff604

Fix test_initialization

75dd26e

Fix import

4437fb3

Address review

3c3ae6f

Fix confg

ea9c71d

Convert all checkpoints

3a4dd36

Fix default backbone

dbd47c4

Usage same processor as segformer

77cfe29

Apply suggestions

f7cc21e

Fix init_weights, update conversion scripts

774f940

Improve config

5f93106

Use Auto API instead of creating a new image processor

55a1321

Fix docs

9ce6c57

Add doctests

d4533d2

Remove ResNetConfig dependency

de32d46

Add always_partition argument

ca576e3

Fix rebaseé

4369af8

NielsRogge force-pushed the add_upernet_swin_encoder branch from 586eebc to 4369af8 Compare January 13, 2023 12:45

Niels Rogge added 2 commits January 13, 2023 14:27

Improve docs

b8b604d

Convert checkpoints

8e427a2

NielsRogge merged commit 4ed89d4 into huggingface:main Jan 16, 2023

Add UperNet #20648

Add UperNet #20648

Uh oh!

Conversation

NielsRogge commented Dec 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Dec 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sgugger Dec 16, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alaradirik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

NielsRogge commented Dec 23, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

NielsRogge commented Dec 7, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 7, 2022 •

edited

Loading