Add BLIP-2 #21441

NielsRogge · 2023-02-03T15:44:26Z

What does this PR do?

This PR adds BLIP-2 to the library.

To do:

make sure generation works exactly as the original implementation, (maybe @gante can have a look here - based on original code here). Edit: seems to be solved by properly setting the eos_token_id!
add more tests for BLIP-2 with AutoModelForSeq2SeqLM once designed gets approved
transfer checkpoints, update integration tests
make it possible to instantiate Blip2Config with config objects, rather than dicts (also check default text config) - will be done in a separate PR

cc @younesbelkada

HuggingFaceDocBuilderDev · 2023-02-03T15:58:36Z

The documentation is not available anymore as the PR was closed or merged.

docs/source/en/_toctree.yml

src/transformers/models/auto/tokenization_auto.py

sgugger

Thanks for working on this. The general design with the auto-model for the LM work for me, since BLIP-2 supports multiple LMs.

src/transformers/models/auto/tokenization_auto.py

src/transformers/models/auto/processing_auto.py

src/transformers/models/blip_2/configuration_blip_2.py

src/transformers/models/blip_2/modeling_blip_2.py

tests/models/blip_2/test_modeling_blip_2.py

utils/check_repo.py

NielsRogge · 2023-02-07T15:17:41Z

@sgugger all comments are addressed, feel free to approve :)

sgugger

Thanks again for all your work on this!

src/transformers/models/auto/processing_auto.py

src/transformers/models/blip_2/modeling_blip_2.py

add `accelerate` support for `blip2`

younesbelkada

Thanks a lot for this great addition!
accelerate support has been added in NielsRogge#54 - generate + BLIP2 with accelerate is having some issues right now in multi-gpu setting, let's address this in a follow up PR and merge this PR to at least un-lock int8 loading of BLIP2 models (for instance nielsr/blip2-flan-t5-xl can be loaded on a Gcolab (I managed to run it on a NVIDIAT4 16GB))

…dd_blip2

vmkp · 2023-02-12T23:35:27Z

@NielsRogge Curious, what is the timeline for this to make it into a stable release version?

NielsRogge · 2023-02-13T07:31:25Z

Usually there's a Transformers release once every 1 to 2 months, so at the very least in March.

sachit-menon · 2023-02-17T22:55:02Z

Hi, thanks for the great work! I'm running into problems trying to use this in the multigpu setting and saw this was mentioned by @younesbelkada earlier -- is there an issue to follow for that? (Specifically, in line 2765 of transformers->generation->utils.py, the devices don't match -- Expected all tensors to be on the same device, but found at least two devices, cuda:3 and cuda:0! because beam_scores is on cuda:0 while next_token_scores and next_token_scores_processed are on cuda:3 after using "auto" for the device_map when loading.)

I'm also getting a weirder error where it causes a CUDA illegal memory access error for any model used downstream of it on GPU 0, even when it's given no GPU memory on GPU 0 in max_memory. (This doesn't occur for the original BLIP2, which I'm trying to migrate from.)

xszheng2020 · 2023-02-20T13:40:11Z

Same problem here @sachit-menon
"Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0!"
bitsandbytes-foundation/bitsandbytes#153

younesbelkada · 2023-02-20T17:47:17Z

Hi @sachit-menon @xszheng2020
This is a known issue on my end, I can confirm this should be at least fixed for blip2-opt at #21707
Can you try to checkout from this branch and let us know on the PR if the fix works? thanks!

xszheng2020 · 2023-02-21T03:43:27Z

Hi, @younesbelkada
thanks! will test it on blip2-opt to see whether it works!
and hope the blip2-flant5 could be fixed soon

NielsRogge requested a review from sgugger February 6, 2023 13:53

NielsRogge marked this pull request as ready for review February 6, 2023 13:59

NielsRogge commented Feb 6, 2023

View reviewed changes

docs/source/en/_toctree.yml Outdated Show resolved Hide resolved

NielsRogge commented Feb 6, 2023

View reviewed changes

src/transformers/models/auto/tokenization_auto.py Outdated Show resolved Hide resolved

sgugger reviewed Feb 6, 2023

View reviewed changes

NielsRogge changed the title ~~[WIP] Add BLIP-2~~ Add BLIP-2 Feb 6, 2023

huggingface deleted a comment from ydshieh Feb 6, 2023

NielsRogge force-pushed the add_blip2 branch from aaef536 to e66142d Compare February 7, 2023 08:25

younesbelkada mentioned this pull request Feb 7, 2023

Issue with using dispatch_model with timm models huggingface/accelerate#1041

Closed

4 tasks

ydshieh force-pushed the add_blip2 branch from 8f4072e to 59ad539 Compare February 7, 2023 14:58

sgugger approved these changes Feb 7, 2023

View reviewed changes

src/transformers/models/auto/processing_auto.py Outdated Show resolved Hide resolved

src/transformers/models/blip_2/modeling_blip_2.py Outdated Show resolved Hide resolved

NielsRogge force-pushed the add_blip2 branch from e5961cd to 26648de Compare February 7, 2023 17:51

NielsRogge added 16 commits February 8, 2023 10:45

First draft

3eaef41

More improvements

19d9b02

More improvements

e6b2a91

Improve conversion script

63ca6be

Convert all weights

26b54de

Make forward pass work

e4ad30f

Make logits match

72f689a

More improvements

0c7ed9d

More improvements

0038185

More improvements

b4465b7

Use get_input_embeddings

552e1e0

Improve some more

593dba8

Improve model tests

105043d

Improve model tests

9a6cb4c

More improvements

74e8d39

Fix processor

a13e0eb

NielsRogge and others added 5 commits February 8, 2023 10:45

Add tests for seq2seq language models

021495f

Minor fix

cf3d2d2

Convert more checkpoints

7278108

finalize CI

d29d1a7

Fix blip and blip2 processors

429576a

NielsRogge force-pushed the add_blip2 branch from bf5f63f to 429576a Compare February 8, 2023 09:46

younesbelkada reviewed Feb 8, 2023

View reviewed changes

src/transformers/models/blip_2/modeling_blip_2.py Outdated Show resolved Hide resolved

younesbelkada and others added 4 commits February 8, 2023 19:21

add accelerate support for blip2

efa3399

clean up

26abaf3

make style

037543b

Merge pull request #54 from younesbelkada/add-blip2-accelerate

c5b958c

add `accelerate` support for `blip2`

younesbelkada approved these changes Feb 8, 2023

View reviewed changes

NielsRogge and others added 9 commits February 8, 2023 22:29

Update conversion script

a92b5a1

Update conversion script some more

ba2d730

Update organization

6b3a411

revert toc file

7912981

add blip-2 to toc file

8a4b4bf

Some more improvements

cbef10f

Merge branch 'add_blip2' of github.com:NielsRogge/transformers into a…

a0557c1

…dd_blip2

Fix docstring

5ced74b

Improve docs

1fb2c4c

NielsRogge merged commit d7f1e7c into huggingface:main Feb 9, 2023

younesbelkada mentioned this pull request Feb 20, 2023

[Blip2] Fix Blip-2 multi gpu #21707

Merged

Add BLIP-2 #21441

Add BLIP-2 #21441

Uh oh!

Conversation

NielsRogge commented Feb 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Feb 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NielsRogge commented Feb 7, 2023

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

younesbelkada left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vmkp commented Feb 12, 2023

Uh oh!

NielsRogge commented Feb 13, 2023

Uh oh!

sachit-menon commented Feb 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xszheng2020 commented Feb 20, 2023

Uh oh!

younesbelkada commented Feb 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xszheng2020 commented Feb 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

NielsRogge commented Feb 3, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 3, 2023 •

edited

Loading

younesbelkada left a comment •

edited

Loading

sachit-menon commented Feb 17, 2023 •

edited

Loading

younesbelkada commented Feb 20, 2023 •

edited

Loading