[Bugfix] fix mindiesd laserattention unsupported error#2674
Conversation
Signed-off-by: fan2956 <zhoufan53@huawei.com>
|
Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits. |
|
No regression test for this bugfix. Please add a test that verifies Also: this appears to be a duplicate of #2673. Please close one. |
let wait until npu ci is on |
91fa5c5
into
vllm-project:release/v0.18.0.post1
PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.
Purpose
This PR fixes a RuntimeError encountered when calling the mindie custom op LaserAttention in diffusion models. The error occurred with the following message:
RuntimeError: call aclnnLaserAttention failed, detail:[PID: 1114958] 2026-04-10-10:18:07.438.311 AclNN_Parameter_Error(EZ1001): Get regInfo failed, The binary_info_config.json of socVersion [ascend910b] does not support opType [LaserAttention].
The root cause was that the ASCEND_CUSTOM_OPP_PATH environment variable was not properly configured before attempting to use the custom op. This PR ensures mindiesd is imported before the model's forward pass, which configures ASCEND_CUSTOM_OPP_PATH and allows LaserAttention to be called correctly.
Test Plan
Test Result
dit infer time
origin: 44s
Laserattention: 30s
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model. Please runmkdocs serveto sync the documentation editions to./docs.BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)