Disable SDPA Attention for gpt-bigcode model by jiminha · Pull Request #78 · HabanaAI/optimum-habana-fork

jiminha · 2024-02-27T02:55:05Z

What does this PR do?

Disable SDPA attention until gpt-bigcode model support FusedSDPA attention.

vivekgoe

@jiminha what issue is this PR fixing?

jiminha · 2024-02-27T16:50:59Z

@jiminha what issue is this PR fixing?
This is fixing the issue SW-176426(TypeError: GPTBigCodeSdpaAttention.forward() got an unexpected keyword argument 'token_idx'). Transformer4.37.2/PT2.2 upgrade enabled sdpd attention for the gpt-bigcode model. This is temporary fix so the model can run with original attention layer without an error until FusedSDPA is enabled for this model.

vivekgoe · 2024-02-28T03:34:42Z

@jiminha It is ok to disable SDPA attention if it is causing problems. But I see a problem with using "_use_sdpa" flag here

optimum-habana-fork/optimum/habana/transformers/models/gpt_bigcode/modeling_gpt_bigcode.py

Line 238 in 04b0709

if self._use_sdpa and head_mask is None and not output_attentions:

This creates confusion because "_use_sdpa" is used in transformers to refer to torch SDPA which we should differentiate from habana SDPA. We already fixed 1 problem related to this for Llama in this PR #73.

vivekgoe

Discussed with @libinta LGTM

vivekgoe · 2024-03-02T03:29:28Z

Looks good to me. @libinta please go ahead and merge.

* Disable SDPA Attention for gpt-bigcode model * Update argument to take "attn_implementation"

astachowiczhabana · 2024-06-07T14:22:23Z

huggingface#771

Disable SDPA Attention for gpt-bigcode model

ef9abea

jiminha requested a review from libinta February 27, 2024 02:55

vivekgoe reviewed Feb 27, 2024

View reviewed changes

vivekgoe approved these changes Feb 28, 2024

View reviewed changes

Update argument to take "attn_implementation"

9ac19b0

libinta approved these changes Mar 5, 2024

View reviewed changes

libinta merged commit 1cd773d into habana-main Mar 5, 2024

astachowiczhabana pushed a commit that referenced this pull request Apr 5, 2024

Disable SDPA Attention for gpt-bigcode model (#78)

8671767

* Disable SDPA Attention for gpt-bigcode model * Update argument to take "attn_implementation"

astachowiczhabana pushed a commit that referenced this pull request Apr 5, 2024

Disable SDPA Attention for gpt-bigcode model (#78)

13de449

* Disable SDPA Attention for gpt-bigcode model * Update argument to take "attn_implementation"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable SDPA Attention for gpt-bigcode model#78

Disable SDPA Attention for gpt-bigcode model#78
libinta merged 2 commits into
habana-mainfrom
jha/bigcodegpt

jiminha commented Feb 27, 2024

Uh oh!

vivekgoe left a comment

Uh oh!

jiminha commented Feb 27, 2024 •

edited

Loading

Uh oh!

vivekgoe commented Feb 28, 2024

Uh oh!

vivekgoe left a comment •

edited

Loading

Uh oh!

vivekgoe commented Mar 2, 2024

Uh oh!

astachowiczhabana commented Jun 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

jiminha commented Feb 27, 2024

What does this PR do?

Uh oh!

vivekgoe left a comment

Choose a reason for hiding this comment

Uh oh!

jiminha commented Feb 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vivekgoe commented Feb 28, 2024

Uh oh!

vivekgoe left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vivekgoe commented Mar 2, 2024

Uh oh!

astachowiczhabana commented Jun 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jiminha commented Feb 27, 2024 •

edited

Loading

vivekgoe left a comment •

edited

Loading