[`InstructBlip`] Add accelerate support for instructblip by younesbelkada · Pull Request #24488 · huggingface/transformers

younesbelkada · 2023-06-26T09:50:41Z

What does this PR do?

As per title, let's make users benefit from 8bit / 4bit loading of instructblip models

all accelerate tests pass for this model

As a side note, as instruct blip relies on flan-t5 as backbone for some models, therefore it is important to add

_keep_in_fp32_modules = ["wo"]

To ensure inference stability in fp16 / int8 / fp4

HuggingFaceDocBuilderDev · 2023-06-26T10:08:00Z

The documentation is not available anymore as the PR was closed or merged.

NielsRogge · 2023-06-26T11:19:33Z

Thank you :) could you add an integration test?

younesbelkada · 2023-06-26T12:13:21Z

Hey @NielsRogge !
Let's maybe add it together with: https://github.com/huggingface/transformers/pull/24490/files#r1242095062 so we should probably merge this first :D

sgugger · 2023-06-26T12:14:39Z

+    _no_split_modules = [
+        "InstructBlipAttention",
+        "T5Block",
+        "OPTDecoderLayer",
+        "InstructBlipQFormerMultiHeadAttention",
+    ]


I think this should be adapted dynamically depending on the models that are actually loaded, to be more future proof.

Sure, makes sense

Done, from what I can see and as a side note, InstructBlip relies on T5 or Llama only, so I did a mistake actually putting OPTDecoderLayer. This is also fixed

sgugger · 2023-06-26T12:31:36Z

+
        if config.use_decoder_only_language_model:
            language_model = AutoModelForCausalLM.from_config(config.text_config)
+            self._no_split_modules.append("LlamaDecoderLayer")


No, should language_model._no_split_modules here.

I think language_model._no_split_modules should already contain LlamaDecoderLayer, the purpose of adding it in self._no_split_modules is that in from_pretrained we never look at all the child modules that contain that attribute: https://github.com/huggingface/transformers/blob/195a9e5bdb1faa58cd58b47a23a47734d2b90d8c/src/transformers/modeling_utils.py#L2807C13-L2807C29 | then gets passed to accelerate this way:

transformers/src/transformers/modeling_utils.py

Line 2814 in 195a9e5

kwargs = {"no_split_module_classes": no_split_modules}

right? So not sure we should add it to language_model. What do you think?

I don't understand your comment. The code loads any model from the Hub (granted it should be llama models in the actual checkpoints) but you add the specific Llama no split blocks. This will stop working if someone adds an instructblip-2 model that loads a different language model.

A cleaner implementation maybe would be that in post_init we should add an utility method to look at all child modules that contain _no_split_modules and dynamically append them to self._no_split_modules

If you think that's better, happy to change the PR to add these changes

Ah I see yes, sorry I misunderstood your comment, will update that now

sgugger · 2023-06-26T12:31:40Z

+            self._no_split_modules.append("LlamaDecoderLayer")
        else:
            language_model = AutoModelForSeq2SeqLM.from_config(config.text_config)
+            self._no_split_modules.append("T5Block")


Same there.

sgugger · 2023-06-26T13:57:01Z

        r"language_model.decoder.embed_tokens.weight",
        r"language_model.lm_head.weight",
    ]
+    _keep_in_fp32_modules = ["wo"]


Last comment: this is the same issue here (this comes from T5 if I'm not mistaken). This key should also be build dynamically.

Makes sense, just updated it!

younesbelkada added 2 commits June 26, 2023 09:49

add accelerate support for instructblip

7ee03df

add _keep_in_fp32_modules

838db9b

younesbelkada mentioned this pull request Jun 26, 2023

Add InstructBLIP #23460

Merged

5 tasks

younesbelkada requested review from amyeroberts and sgugger and removed request for amyeroberts June 26, 2023 09:54

sgugger reviewed Jun 26, 2023

View reviewed changes

dynamically adapt _no_split_modules

88ccd18

younesbelkada requested a review from sgugger June 26, 2023 12:25

sgugger reviewed Jun 26, 2023

View reviewed changes

better fix

802b536

younesbelkada requested a review from sgugger June 26, 2023 12:42

sgugger reviewed Jun 26, 2023

View reviewed changes

same logic for _keep_in_fp32_modules

1f4fd81

younesbelkada requested a review from sgugger June 26, 2023 14:22

sgugger approved these changes Jun 26, 2023

View reviewed changes

younesbelkada merged commit 9895670 into huggingface:main Jun 26, 2023

younesbelkada deleted the add-instruct-blip-4bit branch June 26, 2023 16:36

Conversation

younesbelkada commented Jun 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jun 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NielsRogge commented Jun 26, 2023

Uh oh!

younesbelkada commented Jun 26, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

younesbelkada Jun 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

younesbelkada Jun 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

younesbelkada commented Jun 26, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 26, 2023 •

edited

Loading

younesbelkada Jun 26, 2023 •

edited

Loading

younesbelkada Jun 26, 2023 •

edited

Loading