Refactor FP16 softmax buffer size into testable helper; replace expensive regression test#27829
Closed
Copilot wants to merge 3 commits intoedgchen1/fix_attention_softmax_overflowfrom
Closed
Refactor FP16 softmax buffer size into testable helper; replace expensive regression test#27829Copilot wants to merge 3 commits intoedgchen1/fix_attention_softmax_overflowfrom
Copilot wants to merge 3 commits intoedgchen1/fix_attention_softmax_overflowfrom