Encapsulate FSDPA in GaudiLlamaAttention by dudilester · Pull Request #129 · HabanaAI/optimum-habana-fork

dudilester · 2024-03-21T09:43:52Z

Done to allow quantization using HQT
Added use_flash_attention and flash_attention_recompute to run_lm_eval

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

issues were addressed.

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana · 2024-06-12T10:52:46Z

huggingface#972

dudilester · 2024-06-13T11:40:19Z

upstream URL
huggingface#976

dudilester requested review from libinta and mandy-li as code owners March 21, 2024 09:43

dudilester requested review from a user, HolyFalafel, MrGeva, Yantom1, bgoldberg-habana and ulivne and removed request for a user, libinta and mandy-li March 21, 2024 09:43

ulivne previously requested changes Mar 21, 2024

View reviewed changes

Comment thread optimum/habana/transformers/models/llama/modeling_llama.py Outdated

Comment thread optimum/habana/transformers/models/llama/modeling_llama.py Outdated

Encapsulate FSDPA in GaudiLlamaAttention

d584181

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

dudilester force-pushed the dev/dlester/moduleFSDPA branch from 0a09511 to d584181 Compare March 21, 2024 13:12

MrGeva approved these changes Mar 24, 2024

View reviewed changes

MrGeva merged commit b7e74c1 into habana-main Mar 24, 2024

dudilester added a commit that referenced this pull request Mar 31, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

bae47af

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 5, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

9cf4dff

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 5, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

f5736fa

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 19, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

b79036e

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 22, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

21948d8

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

1fcf130

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

08012fc

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

dudilester added a commit that referenced this pull request May 7, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

ffa9081

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

dudilester added a commit that referenced this pull request May 8, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

e231aa5

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

dudilester added a commit that referenced this pull request May 13, 2024

Encapsulate FSDPA in GaudiLlamaAttention (#129)

eb8b435

* Done to allow quantization using HQT * Added use_flash_attention and flash_attention_recompute to run_lm_eval

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encapsulate FSDPA in GaudiLlamaAttention#129

Encapsulate FSDPA in GaudiLlamaAttention#129
MrGeva merged 1 commit into
habana-mainfrom
dev/dlester/moduleFSDPA

dudilester commented Mar 21, 2024

Uh oh!

Uh oh!

Uh oh!

astachowiczhabana commented Jun 12, 2024

Uh oh!

dudilester commented Jun 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

dudilester commented Mar 21, 2024

Uh oh!

Uh oh!

Uh oh!

astachowiczhabana commented Jun 12, 2024

Uh oh!

dudilester commented Jun 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants