Skip to content

Commit a5464dc

Browse files
kylesayrsmgoin
andauthored
[Compressed Tensors] Always clone output for compile robustness (#26849)
Signed-off-by: Kyle Sayers <[email protected]> Co-authored-by: Michael Goin <[email protected]>
1 parent ac3ed5a commit a5464dc

File tree

1 file changed

+1
-1
lines changed
  • vllm/model_executor/layers/quantization/compressed_tensors/transform

1 file changed

+1
-1
lines changed

vllm/model_executor/layers/quantization/compressed_tensors/transform/linear.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -163,7 +163,7 @@ def apply(
163163
if self.output_transform is not None:
164164
for part_id, (start, length) in enumerate(self.partition_ranges):
165165
x[:, start : start + length] = self.output_transform(
166-
x[:, start : start + length].contiguous(), part_id=part_id
166+
x[:, start : start + length].clone(), part_id=part_id
167167
)
168168

169169
return x

0 commit comments

Comments
 (0)