Skip to content

[ROCm][DSv3.2] Fix FP8 cast in indexer_k_quant_and_cache_triton

0e46a66
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Closed

[ROCm][DSv3.2] Fix FP8 cast in indexer_k_quant_and_cache_triton (top-K accuracy regression for ctx>2048) #2

[ROCm][DSv3.2] Fix FP8 cast in indexer_k_quant_and_cache_triton
0e46a66
Select commit
Loading
Failed to load commit list.