Skip to content

Commit 3727a1d

Browse files
fahadh4ilyassumitd2
authored andcommitted
[Bugfix] Fix lora loading for Compressed Tensors in vllm-project#9120 (vllm-project#9179)
Signed-off-by: Sumit Dubey <[email protected]>
1 parent 35398dd commit 3727a1d

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

vllm/lora/layers.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,9 @@ def _get_lora_device(base_layer: nn.Module) -> torch.device:
3939
# unquantizedLinear
4040
if hasattr(base_layer, "weight"):
4141
return base_layer.weight.device
42+
# Compressed Tensor
43+
elif hasattr(base_layer, "weight_packed"):
44+
return base_layer.weight_packed.device
4245
# GPTQ/AWQ
4346
elif hasattr(base_layer, "qweight"):
4447
return base_layer.qweight.device

0 commit comments

Comments
 (0)