Address compatibility issues arising from the removal of the XLA dependency #1423
Address compatibility issues arising from the removal of the XLA dependency #1423
Conversation
DescriptionStart with a short description of what the PR does and how this is a change from The rest of the description includes relevant details and context, examples:
If the change fixes a Github issue, please include a link, e.g.,: TestsPlease describe how you tested this change, and include any instructions and/or ChecklistBefore submitting this PR, please make sure:
|
QiliangCui
left a comment
There was a problem hiding this comment.
can you explain more about why
Add create_weights in compressed_tensors_w8a8_int8.py because vllm's implementation would introduce another error.
...nference/layers/vllm/quantization/compressed_tensors/schemes/compressed_tensors_w8a8_int8.py
Show resolved
Hide resolved
54c01b2 to
38f8531
Compare
5102888 to
b3c04e9
Compare
|
seems like ci is failing due to upstream change: vllm-project/vllm#30519 I think this should be a simple fix. @weiyu0824 can you add a fix commit to this branch as well? |
b3c04e9 to
bce6db3
Compare
8ff7761 to
5977a14
Compare
Signed-off-by: Wei-Yu Lin <weiyulin@google.com>
Signed-off-by: Wei-Yu Lin <weiyulin@google.com>
Signed-off-by: Wei-Yu Lin <weiyulin@google.com>
Signed-off-by: Wei-Yu Lin <weiyulin@google.com>
Signed-off-by: Wei-Yu Lin <weiyulin@google.com>
Signed-off-by: Wei-Yu Lin <weiyulin@google.com>
Signed-off-by: Wei-Yu Lin <weiyulin@google.com>
Signed-off-by: Wei-Yu Lin <weiyulin@google.com>
5977a14 to
ec15832
Compare
|
hmmm we have another error due to upstream change 1 hour ago: vllm-project/vllm#32003 yeah, feel free to ignore all my other comments and let's just try to get this pr merged asap. |
Signed-off-by: Wei-Yu Lin <weiyulin@google.com>
4d84445 to
e5deac5
Compare
This reverts commit fece762. Signed-off-by: Kyuyeun Kim <kyuyeunk@google.com>
77c71d5 to
90480a1
Compare
Signed-off-by: Kyuyeun Kim <kyuyeunk@google.com>
Signed-off-by: Kyuyeun Kim <kyuyeunk@google.com>
317a769 to
4719933
Compare
Signed-off-by: Kyuyeun Kim <kyuyeunk@google.com>
4719933 to
9fbda58
Compare
Description
This PR addresses compatibility issues arising from the removal of the XLA dependency in upstream vLLM (reference.
Changes:
Tests
Checklist
Before submitting this PR, please make sure: