ms_deform_attn_forward_cuda" not implemented for 'BFloat16 #38

ChinChyi · 2024-08-29T12:36:26Z

Hello!

This is the problem when I use grounded_sam2_local_demo.py for image inference

The text was updated successfully, but these errors were encountered:

rentainhe · 2024-08-30T02:11:39Z

This is caused by Deformable Attention operator, which did not support BFloat16 inference, we will fix this bug later

rentainhe · 2024-08-30T07:23:18Z

Hi @ChinChyi

Have you changed any code in your local env, we have fix this bug in our original implementation here:

Grounded-SAM-2/grounded_sam2_local_demo.py

Line 53 in 6e0ddad

torch.autocast(device_type="cuda", dtype=torch.bfloat16).__enter__()

By removing

# FIXME: figure how does this influence the G-DINO model
torch.autocast(device_type="cuda", dtype=torch.bfloat16).__enter__()

if torch.cuda.get_device_properties(0).major >= 8:
    # turn on tfloat32 for Ampere GPUs (https://pytorch.org/docs/stable/notes/cuda.html#tensorfloat-32-tf32-on-ampere-devices)
    torch.backends.cuda.matmul.allow_tf32 = True
    torch.backends.cudnn.allow_tf32 = True

After running grounding dino

Khlann · 2024-09-02T17:15:34Z

So, what is the solution?I have also encounter this problem.Thank you very much!

Khlann · 2024-09-02T17:23:32Z

This error happened when I called self.sam2_predictor.predicttwice

rentainhe · 2024-09-03T03:05:16Z

This error happened when I called self.sam2_predictor.predicttwice

Would you like to share your code with us which may be more convenient for us to debug this issue.

ChinChyi · 2024-09-03T08:03:18Z

Hi @ChinChyi

Have you changed any code in your local env, we have fix this bug in our original implementation here:

Grounded-SAM-2/grounded_sam2_local_demo.py

Line 53 in 6e0ddad

torch.autocast(device_type="cuda", dtype=torch.bfloat16).__enter__()

By removing
# FIXME: figure how does this influence the G-DINO model
torch.autocast(device_type="cuda", dtype=torch.bfloat16).__enter__()

if torch.cuda.get_device_properties(0).major >= 8:
    # turn on tfloat32 for Ampere GPUs (https://pytorch.org/docs/stable/notes/cuda.html#tensorfloat-32-tf32-on-ampere-devices)
    torch.backends.cuda.matmul.allow_tf32 = True
    torch.backends.cudnn.allow_tf32 = True
After running grounding dino

Thanks

MagdalenaKotynia · 2024-09-06T13:10:56Z

@ChinChyi changing bloat16 to float16 (torch.autocast(device_type="cuda", dtype=torch.bfloat16).__enter__() to torch.autocast(device_type="cuda", dtype=torch.float16).__enter__() ) helped for me.

rentainhe added the bug Something isn't working label Sep 3, 2024

rentainhe added the documentation Improvements or additions to documentation label Sep 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ms_deform_attn_forward_cuda" not implemented for 'BFloat16 #38

ms_deform_attn_forward_cuda" not implemented for 'BFloat16 #38

ChinChyi commented Aug 29, 2024

rentainhe commented Aug 30, 2024

rentainhe commented Aug 30, 2024 •

edited

Loading

Khlann commented Sep 2, 2024

Khlann commented Sep 2, 2024

rentainhe commented Sep 3, 2024

ChinChyi commented Sep 3, 2024

MagdalenaKotynia commented Sep 6, 2024

ms_deform_attn_forward_cuda" not implemented for 'BFloat16 #38

ms_deform_attn_forward_cuda" not implemented for 'BFloat16 #38

Comments

ChinChyi commented Aug 29, 2024

rentainhe commented Aug 30, 2024

rentainhe commented Aug 30, 2024 • edited Loading

Khlann commented Sep 2, 2024

Khlann commented Sep 2, 2024

rentainhe commented Sep 3, 2024

ChinChyi commented Sep 3, 2024

MagdalenaKotynia commented Sep 6, 2024

rentainhe commented Aug 30, 2024 •

edited

Loading