#26566 swin2 sr allow in out channels #26568

marvingabler · 2023-10-03T16:27:03Z

What does this PR do?

This PR adds the feature of accepting arbitary number of input and output channels when using the Swin2SR model. This allows to perform super resolution from greyscale (1 channel) to color (rgb), or from low resolution multi band satellite to high resolution rgb satellite.

All examples and pretrained models are running as expected based on my tests. No new dependencies have been added.

Just use it like

from transformers import Swin2SRForImageSuperResolution, Swin2SRConfig
import torch

Swin2SRConfig = (
     num_channels_in=1,
     num_channels_out=3
)
model = Swin2SRForImageSuperResolution(Swin2SRConfig)

with torch.no_grad():
    # or use the image preprocessor per default
    out = model({"pixel_values":torch.randn((1,1,264,264))})

Fixes #26566.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Yes here
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests? No, test where there already.

Tagging the reviewers

text models: @ArthurZucker and @younesbelkada
vision models: @amyeroberts

… arbitary in and out channels

younesbelkada

Hi @marvingabler
Thanks for your contribution! as you are removing the num_channels attribute I think that this is a breaking change, what about keeping num_channels and make it behave as num_channels_in and use num_channels_out as an optional argument that is initialized as the same value as num_channels in case it is set to None. That way I believe changes will be backward compatible. What do you think?

marvingabler · 2023-10-04T11:14:57Z

Good point, yes lets do that! Let me update the PR soon :)

younesbelkada

Looking great to me! thanks!

HuggingFaceDocBuilderDev · 2023-10-04T12:16:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

ArthurZucker

Thanks! Can you add a small test as well? Making sure that a dummy model with a this can still perform as expected! 😉

src/transformers/models/swin2sr/configuration_swin2sr.py

src/transformers/models/swin2sr/convert_swin2sr_original_to_pytorch.py

Co-authored-by: Arthur <[email protected]>

marvingabler · 2023-10-04T14:51:03Z

Just realized that there are a couple of more changes required, as the Swin2SRForImageSuperResolution denormalizes based on the input images, while for the case of mapping from multiband images to single band, the mean&stds of inputs and outputs differ. Will add the changes soon.

…nnels!=#output_channels

ArthurZucker

Looks good to me thanks for adding this.

LysandreJik

Great, thanks!

deepweather added 5 commits October 3, 2023 16:14

feat: close huggingface#26566, changed model & config files to accept…

7ff83b9

… arbitary in and out channels

updated docstrings

9528e85

fix: linter error

e7daa5d

fix: update Copy docstrings

f48c722

fix: linter update

9f7e36d

younesbelkada reviewed Oct 4, 2023

View reviewed changes

deepweather added 2 commits October 4, 2023 11:22

fix: rename num_channels_in to num_channels to prevent breaking changes

080f794

fix: make num_channels_out None per default

c11373b

younesbelkada approved these changes Oct 4, 2023

View reviewed changes

younesbelkada requested review from ArthurZucker and amyeroberts October 4, 2023 12:00

ArthurZucker reviewed Oct 4, 2023

View reviewed changes

src/transformers/models/swin2sr/configuration_swin2sr.py Outdated Show resolved Hide resolved

src/transformers/models/swin2sr/convert_swin2sr_original_to_pytorch.py Outdated Show resolved Hide resolved

marvingabler and others added 3 commits October 4, 2023 14:34

Update src/transformers/models/swin2sr/configuration_swin2sr.py

e4da462

Co-authored-by: Arthur <[email protected]>

fix: update tests to include num_channels_out

4579be2

fix:linter

b628aeb

fix: remove normalization with precomputed rgb values when #input_cha…

6e8f008

…nnels!=#output_channels

marvingabler requested a review from ArthurZucker October 5, 2023 09:43

ArthurZucker approved these changes Oct 5, 2023

View reviewed changes

LysandreJik approved these changes Oct 5, 2023

View reviewed changes

LysandreJik merged commit 0a3b9d0 into huggingface:main Oct 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

#26566 swin2 sr allow in out channels #26568

#26566 swin2 sr allow in out channels #26568

Uh oh!

marvingabler commented Oct 3, 2023

Uh oh!

younesbelkada left a comment

Uh oh!

marvingabler commented Oct 4, 2023

Uh oh!

younesbelkada left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 4, 2023

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

Uh oh!

marvingabler commented Oct 4, 2023

Uh oh!

ArthurZucker left a comment

Uh oh!

LysandreJik left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

#26566 swin2 sr allow in out channels #26568

#26566 swin2 sr allow in out channels #26568

Uh oh!

Conversation

marvingabler commented Oct 3, 2023

What does this PR do?

Before submitting

Tagging the reviewers

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

marvingabler commented Oct 4, 2023

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 4, 2023

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

marvingabler commented Oct 4, 2023

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants