Commit 14d8717
[fix] xqa precision for fp16/bf16 kv cache (NVIDIA#6573)
Signed-off-by: Bruce-Lee-LY <[email protected]>
Co-authored-by: Bruce-Lee-LY <[email protected]>
Signed-off-by: Lanyu Liao <[email protected]>1 parent c3fd9a0 commit 14d8717
1 file changed
+1
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2734 | 2734 | | |
2735 | 2735 | | |
2736 | 2736 | | |
2737 | | - | |
| 2737 | + | |
2738 | 2738 | | |
2739 | 2739 | | |
2740 | 2740 | | |
| |||
0 commit comments