[Misc] Revert compressed-tensors code reuse#7521
Conversation
This reverts commit 373538f.
|
👋 Hi! Thank you for contributing to the vLLM project. Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge). To run full CI, you can do one of these:
🚀 |
|
/ready |
|
Tests are failing because of |
|
There are also seemingly tokenizer issues now https://buildkite.com/vllm/ci-aws/builds/6845#01915175-06f3-4ca9-b597-637d6e4dd026/193-731 |
|
These are failing on main as well as of commit d3d9cb6 |
Signed-off-by: Alvant <alvasian@yandex.ru>
Signed-off-by: LeiWang1999 <leiwang1999@outlook.com>
This PR reverts #7277, specifically commit 373538f.
This addresses an incompatibility between the nvcr.io/nvidia/pytorch:24.05-py3 docker container and
compressed-tensors'sacceleratedependency. #7277 will be readded onceaccelerateis moved to an optional dependency in a futurecompressed-tensorsrelease.FIX #7516