Skip to content

Use torch.fp8_e5m2 as fp8 kvcache dtype

5afb543
Select commit
Loading
Failed to load commit list.
Closed

Prefix Caching with FP8 KV cache support #3234

Use torch.fp8_e5m2 as fp8 kvcache dtype
5afb543
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs