Skip to content

Conversation

@fxmarty
Copy link
Contributor

@fxmarty fxmarty commented Apr 8, 2024

As per title, fixes #29946

There was a (minor) bug in #26572, where if an user uses a Falcon model with alibi and SDPA, but not passing attention_mask input, the attn_mask argument passed to SDPA would not be causal (as it use to be (see here and here). I likely misunderstood something at the time.

I think this bug was not catched in the CI because Falcon tests do not test Alibi thoroughly.

Copy link
Contributor

@amyeroberts amyeroberts left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for fixing!

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@fxmarty fxmarty merged commit 1897874 into main Apr 8, 2024
@fxmarty fxmarty deleted the fix-falcon-no-mask-sdpa branch April 8, 2024 14:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Falcon output with alibi bias is different output_attentions=True

4 participants