-
-
Notifications
You must be signed in to change notification settings - Fork 11.7k
[V0 Deprecation] Remove unused classes in attention #25541
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Woosuk Kwon <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request is a great cleanup effort, removing a significant amount of deprecated V0 attention code. The changes are mostly correct and consistent. However, I found one critical issue in vllm/v1/attention/backends/pallas.py where the PallasAttentionBackend class fails to implement an abstract method from its base class, which will lead to a TypeError at runtime. Please see my comment for the details and a suggested fix.
| @staticmethod | ||
| def get_metadata_cls() -> type["PallasMetadata"]: | ||
| return PallasMetadata | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The abstract method get_builder_cls is defined in the base class AttentionBackend, but it's not implemented in PallasAttentionBackend. This will cause a TypeError when trying to instantiate PallasAttentionBackend. Since the Pallas backend does not use a metadata builder, you should add an implementation that raises NotImplementedError.
@staticmethod
def get_builder_cls():
raise NotImplementedError("Pallas backend does not use a metadata builder.")Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
vllm-project/vllm#25541 --------- Signed-off-by: Chendi Xue <[email protected]>
vllm-project/vllm#25541 --------- Signed-off-by: Chendi Xue <[email protected]> Signed-off-by: Kavulya, Soila P <[email protected]>
vllm-project/vllm#25541 --------- Signed-off-by: Chendi Xue <[email protected]> Signed-off-by: Iryna Boiko <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: yewentao256 <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Woosuk Kwon <[email protected]>
No description provided.