move img_mask@get_attn_mask() to hpu by hsubramony · Pull Request #102 · HabanaAI/optimum-habana-fork

hsubramony · 2024-03-13T03:20:23Z

In certain 1x systems , we notice a drop in performance for swin models as get_attn_mask gets executed in cpu instead of hpu. This was seen after pytorch upgrade to 2.2
This fix allows img_mask in get_attn_mask() to be moved back hpu.

libinta · 2024-03-13T04:54:37Z

    "t5",
    "mistral",
    "mixtral",
+    "swin",


no need to add this

libinta · 2024-03-13T04:55:29Z

@@ -0,0 +1,50 @@
+# coding=utf-8
+# Copyright 2022 The HuggingFace Inc. team.
+# Copyright (c) 2022, NVIDIA CORPORATION.  All rights reserved.


should we have Hugginface only?

libinta · 2024-03-13T04:56:04Z

+def gaudi_swin_get_attn_mask(self, height, width, dtype):
+    if self.shift_size > 0:
+        # calculate attention mask for SW-MSA
+        img_mask = torch.zeros((1, height, width, 1), dtype=dtype, device='hpu')


can you check if there is self.device?

didnt find self.device

astachowiczhabana · 2024-06-07T14:16:10Z

huggingface#795

move img_mask@get_attn_mask() to hpu

ff25e01

hsubramony requested review from bhargaveede, ssarkar2 and vivekgoe as code owners March 13, 2024 03:20

hsubramony requested a review from libinta March 13, 2024 03:20

libinta reviewed Mar 13, 2024

View reviewed changes

review updates

2c0bbb0

libinta approved these changes Mar 14, 2024

View reviewed changes

libinta merged commit ae7fc93 into habana-main Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

move img_mask@get_attn_mask() to hpu#102

move img_mask@get_attn_mask() to hpu#102
libinta merged 2 commits into
habana-mainfrom
swin_attn_mask

hsubramony commented Mar 13, 2024 •

edited

Loading

Uh oh!

libinta Mar 13, 2024

Uh oh!

libinta Mar 13, 2024

Uh oh!

libinta Mar 13, 2024

Uh oh!

hsubramony Mar 13, 2024

Uh oh!

astachowiczhabana commented Jun 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hsubramony commented Mar 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

libinta Mar 13, 2024

Choose a reason for hiding this comment

Uh oh!

libinta Mar 13, 2024

Choose a reason for hiding this comment

Uh oh!

libinta Mar 13, 2024

Choose a reason for hiding this comment

Uh oh!

hsubramony Mar 13, 2024

Choose a reason for hiding this comment

Uh oh!

astachowiczhabana commented Jun 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hsubramony commented Mar 13, 2024 •

edited

Loading