Skip to content

Split the graphs to run with flash_attention on 1x#75

Merged
3 commits merged into
HabanaAI:habana-mainfrom
kalyanjk:decoder_mark_step
Mar 4, 2024
Merged

Split the graphs to run with flash_attention on 1x#75
3 commits merged into
HabanaAI:habana-mainfrom
kalyanjk:decoder_mark_step

Commits

Commits on Feb 26, 2024