Skip to content

[ROCm][DSv3.2] Fix FP8 cast in indexer_k_quant_and_cache_triton

8319524
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Closed

[ROCm][DSv3.2] Fix FP8 cast in indexer_k_quant_and_cache_triton (top-K accuracy regression for ctx>2048) #1

[ROCm][DSv3.2] Fix FP8 cast in indexer_k_quant_and_cache_triton
8319524
Select commit
Loading
Failed to load commit list.