Skip to content

Falcon optimization:#229

Merged
ssarkar2 merged 2 commits into
habana-mainfrom
falcon_opt
May 29, 2024
Merged

Falcon optimization:#229
ssarkar2 merged 2 commits into
habana-mainfrom
falcon_opt

Conversation

@libinta
Copy link
Copy Markdown
Collaborator

@libinta libinta commented May 29, 2024

add use_flash_attentiong, flash_attention_recompute, flash_attention_causal_mask add mark step per decoder
add fusedsdpa fp8
fix memory issue

What does this PR do?

Fixes # (issue)

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

add use_flash_attentiong, flash_attention_recompute, flash_attention_causal_mask
add mark step per decoder
add fusedsdpa fp8
fix memory issue
@libinta libinta requested a review from mandy-li as a code owner May 29, 2024 15:06
@libinta libinta requested a review from a user May 29, 2024 15:06
@ssarkar2 ssarkar2 merged commit 874129a into habana-main May 29, 2024
@astachowiczhabana
Copy link
Copy Markdown

huggingface#974

astachowiczhabana pushed a commit that referenced this pull request Jun 13, 2024
astachowiczhabana added a commit that referenced this pull request Apr 15, 2025
DeepSpeed module doesn't handle keyword arguments and throws
a RuntimeError when OH calls it with kwargs. This is a
workaround for this issue in Llama.

Signed-off-by: Urszula Golowicz <urszula.golowicz@intel.com>
Co-authored-by: Adam Stachowicz <105052242+astachowiczhabana@users.noreply.github.com>
gplutop7 pushed a commit that referenced this pull request Apr 22, 2025
DeepSpeed module doesn't handle keyword arguments and throws
a RuntimeError when OH calls it with kwargs. This is a
workaround for this issue in Llama.

Signed-off-by: Urszula Golowicz <urszula.golowicz@intel.com>
Co-authored-by: Adam Stachowicz <105052242+astachowiczhabana@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants