Bug fix for drop path decay rate in swin transformer by abhi-glitchhg · Pull Request #34291 · huggingface/transformers

abhi-glitchhg · 2024-10-21T19:05:52Z

What does this PR do?

This PR fixes #33974 .

As i had mentioned in the issue, I feel that swin transformer implementation has incorrect implementation of stochastic depth decay.

According to the official implementation, drop_prob for every SwinLayer is different.

https://github.com/microsoft/Swin-Transformer/blob/f82860bfb5225915aca09c3227159ee9e1df874d/models/swin_transformer.py#L544

https://github.com/microsoft/Swin-Transformer/blob/f82860bfb5225915aca09c3227159ee9e1df874d/models/swin_transformer.py#L558

https://github.com/microsoft/Swin-Transformer/blob/main/models/swin_transformer.py#L397-#L408

But in transformers, we were using a constant value that is picked from the config file. I feel that implementations in transformers should be closer to the official ones. This also applies for the SwinV2 model. (and maybe swin2sr as well)

Please do look into this and let me know. Also i have changed the variable names as well. I am very bad at naming, so any suggestions for the argument names are welcome 😄

Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case. -> 'drop_path` argument for SwinStage class is unused. #33974

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.
@amyeroberts, @qubvel @ArthurZucker

src/transformers/models/swin/modeling_swin.py

ArthurZucker · 2024-10-24T12:13:18Z

cc @molbap

molbap

Hi and thanks for the PR! I think this is a work in progress? If sofeel free to pass the PR in draft mode or name it [WIP], and ping me again when you want another review! I think you correctly identified this bug :) when you're done, you can run make fixup to run the linter and make check doe quality happy in the CI

src/transformers/models/swin/modeling_swin.py

abhi-glitchhg · 2024-10-24T15:34:06Z

Hi, thanks for the reply. I was waiting for confirmation if this was indeed a bug or intended behaviour. I will fix the lint and other errors.
Thanks.
abhijit

abhi-glitchhg · 2024-10-24T16:26:37Z

In the #33974 , @ArthurZucker mentioned the messy initialisation of swinlayer class, i would not like to touch it in this pr.

Personally i think the initialisation looks good. But if you think we should make it simpler, i m happy to tackle it in another pr

src/transformers/models/glm/modeling_glm.py

…/transformers into swin_drop_path_bug

abhi-glitchhg · 2024-10-25T17:34:12Z

The CI is green. Yey! ig now the pr is ready for review @molbap

Btw i really loved the infra for CI/CD, tests run fast! Linting tools work amazingly! I also recently watched @ArthurZucker's pytorch conference talk and now i really understand the pain points mentioned in the video.

Thank you guys for maintaining such a high-impact library!

src/transformers/models/swin/modeling_tf_swin.py

molbap

Thanks, it's cleaner! left a couple comments, let me know what you think

molbap · 2024-10-28T10:50:45Z

src/transformers/models/swin/modeling_tf_swin.py

                input_resolution=input_resolution,
                num_heads=num_heads,
                shift_size=0 if (i % 2 == 0) else config.window_size // 2,
+                drop_path_rate=drop_path[i],


nice fix - and aligned with hiera & focalnet which also have a varying drop_path per layer iirc

src/transformers/models/swinv2/modeling_swinv2.py

molbap

LGTM - revamp of layers init to depend on (config, layer_idx) TBD in a follow-up PR! cc @ArthurZucker for final review

ArthurZucker

Nice and simple! Thanks 🤗

HuggingFaceDocBuilderDev · 2024-10-29T15:39:35Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

* potential bug fix for drop path * variable name change * forgot to rename the variables * back to original * modify dpr properly * check_copies auto fix * corresponsing swin2 changes * auto fix * linting * default value for drop_path_rate as 0.0 * Update src/transformers/models/glm/modeling_glm.py * maskformer fix * ruff format * changes made to tf code as well * lint --------- Co-authored-by: abhijit deo <167164474+deo-abhijit@users.noreply.github.com>

abhi-glitchhg added 2 commits October 22, 2024 00:03

potential bug fix for drop path

844999c

variable name change

2414fe0

abhi-glitchhg changed the title ~~potential bug fix for drop path rate in swin transformer and easier initialisation of swinLayer class.~~ potential bug fix for drop path decay rate in swin transformer and easier initialisation of swinLayer class. Oct 21, 2024

forgot to rename the variables

10d5dc1

abhi-glitchhg changed the title ~~potential bug fix for drop path decay rate in swin transformer and easier initialisation of swinLayer class.~~ potential bug fix for drop path decay rate in swin transformer and simple initialisation of swinLayer class. Oct 22, 2024

deo-abhijit reviewed Oct 22, 2024

View reviewed changes

src/transformers/models/swin/modeling_swin.py Outdated Show resolved Hide resolved

ArthurZucker requested a review from molbap October 24, 2024 12:13

molbap reviewed Oct 24, 2024

View reviewed changes

src/transformers/models/swin/modeling_swin.py Outdated Show resolved Hide resolved

src/transformers/models/swin/modeling_swin.py Outdated Show resolved Hide resolved

abhi-glitchhg marked this pull request as draft October 24, 2024 15:39

deo-abhijit and others added 8 commits October 24, 2024 21:11

back to original

ee397b6

modify dpr properly

70624b2

check_copies auto fix

7ed9071

corresponsing swin2 changes

9e3a37c

auto fix

be53da6

linting

1214d4b

default value for drop_path_rate as 0.0

97dd9de

Merge branch 'main' into swin_drop_path_bug

a8189db

abhi-glitchhg changed the title ~~potential bug fix for drop path decay rate in swin transformer and simple initialisation of swinLayer class.~~ potential bug fix for drop path decay rate in swin transformer Oct 24, 2024

This comment was marked as outdated.

Sign in to view

Merge branch 'main' into swin_drop_path_bug

88122d6

abhi-glitchhg commented Oct 25, 2024

View reviewed changes

src/transformers/models/glm/modeling_glm.py Outdated Show resolved Hide resolved

abhi-glitchhg added 6 commits October 25, 2024 22:23

Update src/transformers/models/glm/modeling_glm.py

e593c22

maskformer fix

b4ddfc5

Merge branch 'swin_drop_path_bug' of https://github.com/abhi-glitchhg…

7504ab0

…/transformers into swin_drop_path_bug

ruff format

67a60fd

changes made to tf code as well

1a8e567

lint

79f3fcd

abhi-glitchhg marked this pull request as ready for review October 25, 2024 17:34

abhi-glitchhg commented Oct 25, 2024

View reviewed changes

src/transformers/models/swin/modeling_tf_swin.py Show resolved Hide resolved

abhi-glitchhg changed the title ~~potential bug fix for drop path decay rate in swin transformer~~ Bug fix for drop path decay rate in swin transformer Oct 25, 2024

molbap reviewed Oct 28, 2024

View reviewed changes

Merge branch 'main' into swin_drop_path_bug

312954d

molbap approved these changes Oct 28, 2024

View reviewed changes

ArthurZucker approved these changes Oct 29, 2024

View reviewed changes

ArthurZucker merged commit 56c45d5 into huggingface:main Oct 29, 2024

abhi-glitchhg deleted the swin_drop_path_bug branch October 31, 2024 12:33

Conversation

abhi-glitchhg commented Oct 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Who can review?

Uh oh!

Uh oh!

ArthurZucker commented Oct 24, 2024

Uh oh!

molbap left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

abhi-glitchhg commented Oct 24, 2024

Uh oh!

abhi-glitchhg commented Oct 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

abhi-glitchhg commented Oct 25, 2024

Uh oh!

Uh oh!

molbap left a comment

Choose a reason for hiding this comment

Uh oh!

molbap Oct 28, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

molbap left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 29, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

abhi-glitchhg commented Oct 21, 2024 •

edited

Loading

abhi-glitchhg commented Oct 24, 2024 •

edited

Loading