Skip to content

[AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger#17331

Merged
gshtras merged 15 commits into
vllm-project:mainfrom
rasmith:rasmith_add_vllm_use_rocm_fp8_scales
Jun 11, 2025
Merged

[AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger#17331
gshtras merged 15 commits into
vllm-project:mainfrom
rasmith:rasmith_add_vllm_use_rocm_fp8_scales

check if kv cache is fp8

85ccf7c
Select commit
Loading
Failed to load commit list.
DCO / DCO succeeded Jun 3, 2025 in 1s

DCO

All commits are signed off!