Skip to content

Fixed Gemma FP8 flash_attention lower throughput issue#1510

Merged
regisss merged 1 commit into
huggingface:mainfrom
kplau1128:fix_gemma_fp8_flash_attn_low_tp
Nov 26, 2024
Merged

Fixed Gemma FP8 flash_attention lower throughput issue#1510
regisss merged 1 commit into
huggingface:mainfrom
kplau1128:fix_gemma_fp8_flash_attn_low_tp

Commits

Commits on Nov 26, 2024