Add BridgeTower model #20775

abhiwand · 2022-12-15T03:52:04Z

What does this PR do?

This PR implements a HuggingFace Transformers version of BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning from the paper https://arxiv.org/abs/2206.08657.pdf

This paper has been accepted to https://aaai.org/Conferences/AAAI-23/

The model's pre-trained checkpoints and configurations have been released here:
https://huggingface.co/BridgeTower under:

The following heads have been implemented:

BridgeTowerForMaskedLM
BridgeTowerForImageAndTextRetrieval

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

@amyeroberts @NielsRogge @ArthurZucker could you please assist with review and feedback.

@philschmid

…r-huggingface into btmodel

…to btmodel

…r-huggingface into btmodel

Add bridgetower.pr review8

Fixes for BridgeTowerVisionEmbeddings

src/transformers/models/bridgetower/modeling_bridgetower.py

NielsRogge

Thanks a lot for working on this and addressing all comments!

There are still 2 comments which seem to be unaddressed, after that good for me to merge.

tileintel · 2023-01-25T10:58:31Z

@NielsRogge Our PR keeps failing at tests/pipelines/test_pipelines_automatic_speech_recognition.py::AutomaticSpeechRecognitionPipelineTests::test_return_timestamps_in_preprocess. Would you please help to see if it is because of BridgeTower or because of something else?
Thanks a lot

Synchronize

…add_bridgetower_model

Synchronize with HF

…add_bridgetower_model

amyeroberts · 2023-01-25T14:57:56Z

@abhiwand @tileintel Thanks for address all of the comments! On Monday there were two PRs merged into main which added test_image_processing_common.py (#20785) and updated the feature extractor references in the test_image_processing_xxx.py files (#20768). Could you update test_image_processing_bridgetower.py to reflect these please?

…cessing_common.py

tileintel · 2023-01-25T18:02:00Z

@amyeroberts We have updated test_image_processing_bridgetower.py as you suggested. Thanks for the suggestion.
@NielsRogge @amyeroberts @sgugger We have addressed all of the comments. Thanks a lot for helping us to review and approve this. We are very looking forward to having this PR merged into main soon.

sgugger · 2023-01-25T19:04:42Z

Thanks again for your contribution!

tileintel · 2023-01-25T19:15:13Z

@sgugger Thank you for merging this PR. May I ask when BridgeTower model will go to HuggingFace's production and what release is that?
Thanks

sgugger · 2023-01-25T19:44:57Z

The next release will be in a month roughly (given the fast last release was yesterday).

tileintel · 2023-01-25T19:47:04Z

Thank @sgugger for letting us know.

abhiwand and others added 30 commits November 23, 2022 12:42

Commit with BTModel and latest HF code

3918f1c

Placeholder classes for BTForMLM and BTForITR

46c3869

Importing Bert classes from transformers

50b54fa

Removed objectives.py and dist_utils.py

3615f70

Removed swin_transformer.py

7470286

Add image normalization, BridgeTowerForImageAndTextRetrieval

b261834

Add center_crop

be7e37f

Removing bert tokenizer and LCI references

5eb8853

Tested config loading from HF transformers hub

6891d8c

Removed state_dict updates and added path to hub

4d0c5df

Enable center crop

8ae887b

Getting image_size from config, renaming num_heads and num_layers

037dfa4

Handling max_length in BridgeTowerProcessor

bf298bd

Add BridgeTowerForMaskedLM

83c551e

Merge branch 'btmodel' of https://github.com/intel-sandbox/bridgetowe…

42c9431

…r-huggingface into btmodel

Add doc string for BridgeTowerConfig

a0271f4

Merge branch 'btmodel' of https://github.com/intel-sandbox/bridgetowe…

3cefae0

…r-huggingface into btmodel

Add doc strings for BT config, processor, image processor

f15a6ee

Adding docs, removed swin

c392282

Merge branch 'main' of https://github.com/huggingface/transformers in…

fa46a84

…to btmodel

Removed convert_bridgetower_original_to_pytorch.py

ceaa586

Added doc files for bridgetower, removed is_vision

383aa0f

Add support attention_mask=None and BridgeTowerModelOutput

d934318

Fix formatting

ffaa351

Fixes with 'make style', 'make quality', 'make fixup'

41e29f3

Remove downstream tasks from BridgeTowerModel

9ffdb5e

Merge branch 'btmodel' of https://github.com/intel-sandbox/bridgetowe…

2249cbe

…r-huggingface into btmodel

Merge branch 'btmodel' of https://github.com/intel-sandbox/bridgetowe…

31f47ce

…r-huggingface into btmodel

Formatting fixes, add return_dict to BT models

e9386af

Clean up after doc_test

0e427e2

abhiwand and others added 8 commits January 24, 2023 17:11

Code cleanup

09457d4

Merge pull request #17 from abhiwand/add_bridgetower.pr_review8

1b12f96

Add bridgetower.pr review8

Fixes for BridgeTowerVisionEmbeddings

74c09e7

Merge pull request #18 from abhiwand/add_bridgetower.pr_review8

628a9cd

Fixes for BridgeTowerVisionEmbeddings

style checks

f6b7ecc

Merge pull request #18 from abhiwand/add_bridgetower.pr_review8

81a4e45

Fixes for BridgeTowerVisionEmbeddings

re-tests

a530218

fix embedding

2030a9a

NielsRogge reviewed Jan 25, 2023

View reviewed changes

src/transformers/models/bridgetower/modeling_bridgetower.py Show resolved Hide resolved

NielsRogge approved these changes Jan 25, 2023

View reviewed changes

tileintel added 2 commits January 25, 2023 02:19

address comment on init file

34c90ec

retrigger tests

a4b91ab

tileintel and others added 5 commits January 25, 2023 03:22

Merge pull request #19 from huggingface/main

4324b24

Synchronize

Merge branch 'main' of https://github.com/abhiwand/transformers into …

22db8a6

…add_bridgetower_model

update import prepare_image_inputs

5ca4e77

Merge pull request #20 from huggingface/main

7031f84

Synchronize with HF

Merge branch 'main' of https://github.com/abhiwand/transformers into …

1a234d1

…add_bridgetower_model

update test_image_processing_bridgetower.py to reflect test_image_pro…

59c3ecb

…cessing_common.py

retrigger tests

885e456

sgugger merged commit 3a6e4a2 into huggingface:main Jan 25, 2023

ydshieh mentioned this pull request Mar 10, 2023

Add a new script to check model testers' config #22063

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add BridgeTower model #20775

Add BridgeTower model #20775

Uh oh!

abhiwand commented Dec 15, 2022 •

edited

Loading

Uh oh!

Uh oh!

NielsRogge left a comment

Uh oh!

tileintel commented Jan 25, 2023

Uh oh!

amyeroberts commented Jan 25, 2023

Uh oh!

tileintel commented Jan 25, 2023

Uh oh!

sgugger commented Jan 25, 2023

Uh oh!

tileintel commented Jan 25, 2023 •

edited

Loading

Uh oh!

sgugger commented Jan 25, 2023

Uh oh!

tileintel commented Jan 25, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Add BridgeTower model #20775

Add BridgeTower model #20775

Uh oh!

Conversation

abhiwand commented Dec 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

Uh oh!

NielsRogge left a comment

Choose a reason for hiding this comment

Uh oh!

tileintel commented Jan 25, 2023

Uh oh!

amyeroberts commented Jan 25, 2023

Uh oh!

tileintel commented Jan 25, 2023

Uh oh!

sgugger commented Jan 25, 2023

Uh oh!

tileintel commented Jan 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger commented Jan 25, 2023

Uh oh!

tileintel commented Jan 25, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

abhiwand commented Dec 15, 2022 •

edited

Loading

tileintel commented Jan 25, 2023 •

edited

Loading