Skip to content

Commit 6e2c176

Browse files
authored
[Bugfix] Restrict Machete to only run on Hopper (#20830)
Signed-off-by: mgoin <[email protected]>
1 parent a86754a commit 6e2c176

File tree

1 file changed

+3
-0
lines changed
  • vllm/model_executor/layers/quantization/kernels/mixed_precision

1 file changed

+3
-0
lines changed

vllm/model_executor/layers/quantization/kernels/mixed_precision/machete.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,9 @@ def can_implement(cls,
3232
if not current_platform.is_cuda():
3333
return False, "Machete only supported on CUDA"
3434

35+
if not current_platform.is_device_capability(90):
36+
return False, "Machete requires compute capability of 90 (Hopper)"
37+
3538
if c.has_g_idx and\
3639
c.partition_weight_shape[0] != c.full_weight_shape[0]:
3740
return False, "Act reordering currently not supported by Machete, "\

0 commit comments

Comments
 (0)