Skip to content

Commit ac3ed5a

Browse files
authored
Support block size of 256 used by Intel HPU (#26883)
Signed-off-by: mandy-li <[email protected]>
1 parent e6ba200 commit ac3ed5a

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/config/cache.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@
1919

2020
logger = init_logger(__name__)
2121

22-
BlockSize = Literal[1, 8, 16, 32, 64, 128]
22+
BlockSize = Literal[1, 8, 16, 32, 64, 128, 256]
2323
CacheDType = Literal["auto", "bfloat16", "fp8", "fp8_e4m3", "fp8_e5m2", "fp8_inc"]
2424
MambaDType = Literal["auto", "float32"]
2525
PrefixCachingHashAlgo = Literal["sha256", "sha256_cbor"]

0 commit comments

Comments
 (0)