Skip to content

tile_size: set IntraWGOverlap=false for SWA to match vLLM (9.50ms TPOT)

79c956a
Select commit
Loading
Failed to load commit list.
Closed

[WIP] feat: Add n_offset optimization for sliding window attention #36

tile_size: set IntraWGOverlap=false for SWA to match vLLM (9.50ms TPOT)
79c956a
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs