[bugfix] fix test_camem failed with triton-ascend by Meihan-chen · Pull Request #5492 · vllm-project/vllm-ascend

Meihan-chen · 2025-12-29T13:26:15Z

What this PR does / why we need it?

This fixes a bug that occurred when running test_camem.py in the triton-ascend environment NPU function error: aclrtGetMemInfo(ACL_HBM_MEM, &device_free, &device_total)

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.13.0
vLLM main: vllm-project/vllm@5326c89

gemini-code-assist

Code Review

This pull request refactors the codebase to address an e2e test failure involving Triton. The core change involves centralizing the import of torch_npu._inductor from multiple Triton kernel files into a single location within the NPUWorker's initialization. This side-effect-only import is now performed once when a worker starts, ensuring proper and timely initialization before any Triton operations are executed. This change improves code structure and resolves potential issues arising from multiple or improperly timed imports. The implementation appears correct and aligns with the project's established coding practices.

github-actions · 2025-12-29T15:01:37Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Yikun · 2025-12-30T00:32:34Z

        adapt_patch()
+        from vllm.triton_utils import HAS_TRITON
+        if HAS_TRITON:
+            import torch_npu._inductor  # noqa: F401


Would mind adding a note to show why need import this?

Done! I've added a comment explaining.

Yikun · 2025-12-30T00:37:01Z

-
-if HAS_TRITON:
-    import torch_npu._inductor  # noqa: F401
+from vllm.triton_utils import tl, triton


It seems we need to add a check Forbid import torch_npu._inductor in vllm_ascend/ops/triton/
https://github.com/vllm-project/vllm/blob/main/.pre-commit-config.yaml#L132

This import of torch_npu._inductor fixes graph mode running errors with triton-ascend. Once added, the issue no longer occurs, so future Triton ops won't need similar imports in ops/triton. Therefore, we don't need a dedicated pre-commit specifically for this.

Yikun · 2025-12-30T00:37:56Z

Please fullfill commit msg and change commit title to a meaningful title.

Meihan-chen · 2025-12-30T03:13:50Z

Please fullfill commit msg and change commit title to a meaningful title.

Thanks, I have modified it

Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>

### What this PR does / why we need it? This fixes a bug that occurred when running `test_camem.py` in the triton-ascend environment `NPU function error: aclrtGetMemInfo(ACL_HBM_MEM, &device_free, &device_total)` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>

…to FIA_rebase * 'main' of https://github.com/vllm-project/vllm-ascend: (58 commits) [Main2Main] Upgrade vllm commit to 0106 (vllm-project#5617) [CI]update bisheng version (vllm-project#5621) [UT][PCP&DCP] UT for block_table.py (vllm-project#5032) [Main2Main] Upgrade vllm commit to 0105 (vllm-project#5595) [CI] mv ops to correct path (vllm-project#5615) [BugFix] Fix Smoke Testing Bug for DSR1 longseq (vllm-project#5613) Revert "[Feat] enable hierarchical mc2 ops on A2 by default (vllm-project#5545)" (vllm-project#5611) [TRITON][TEST]Add nightly test for triton split_qkv_rmsnorm_rope (vllm-project#5267) [perf] Fix MLAPO weight disposal for KV-consumer MLA in PD-mix deploy... (vllm-project#5192) [docs] Correct image about prefill phase of PCP (vllm-project#5598) [CI] update triton-ascend version (vllm-project#5584) [P/D]Remove mooncake kvpool unused parameter `local_hostname` (vllm-project#5574) [Bugfix] record cos and sin cache in AscendRotaryEmbedding (vllm-project#5516) [bugfix] fix test_camem failed with triton-ascend (vllm-project#5492) [UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat (vllm-project#5474) [CI] Download models from ms (vllm-project#5405) Docs: Add A3 Docker image guidance for Atlas A3 machines (vllm-project#5256) [Doc] Add NNAL installation guide and requirements (vllm-project#5235) Add the requirement of arctic-inference which speculative decoding with suffix_decode (vllm-project#5045) [BugFix][Fusion] Fix graph fusion failure problem (vllm-project#5253) ...

### What this PR does / why we need it? This fixes a bug that occurred when running `test_camem.py` in the triton-ascend environment `NPU function error: aclrtGetMemInfo(ACL_HBM_MEM, &device_free, &device_total)` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>

### What this PR does / why we need it? This fixes a bug that occurred when running `test_camem.py` in the triton-ascend environment `NPU function error: aclrtGetMemInfo(ACL_HBM_MEM, &device_free, &device_total)` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? This fixes a bug that occurred when running `test_camem.py` in the triton-ascend environment `NPU function error: aclrtGetMemInfo(ACL_HBM_MEM, &device_free, &device_total)` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>

### What this PR does / why we need it? This fixes a bug that occurred when running `test_camem.py` in the triton-ascend environment `NPU function error: aclrtGetMemInfo(ACL_HBM_MEM, &device_free, &device_total)` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? This fixes a bug that occurred when running `test_camem.py` in the triton-ascend environment `NPU function error: aclrtGetMemInfo(ACL_HBM_MEM, &device_free, &device_total)` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>

wxsIcey added ready read for review ready-for-test start test by label for PR labels Dec 29, 2025

gemini-code-assist Bot reviewed Dec 29, 2025

View reviewed changes

Meihan-chen force-pushed the triton_ci branch 2 times, most recently from d1bffdc to 41cd56e Compare December 29, 2025 14:32

github-actions Bot added ci/build module:ops labels Dec 29, 2025

Yikun reviewed Dec 30, 2025

View reviewed changes

Meihan-chen changed the title ~~[bugfix] fix e2e test failed with triton~~ [bugfix] fix test_camem.py aclrtGetMemInfo failed with triton-ascend Dec 30, 2025

Meihan-chen changed the title ~~[bugfix] fix test_camem.py aclrtGetMemInfo failed with triton-ascend~~ [bugfix] fix aclrtGetMemInfo faild when running test_camem with triton-ascend Dec 30, 2025

Meihan-chen changed the title ~~[bugfix] fix aclrtGetMemInfo faild when running test_camem with triton-ascend~~ [bugfix] fix aclrtGetMemInfo faild issue when running test_camem with triton-ascend Dec 30, 2025

Meihan-chen changed the title ~~[bugfix] fix aclrtGetMemInfo faild issue when running test_camem with triton-ascend~~ [bugfix] fix test_camem failed with triton-ascend Dec 30, 2025

Meihan-chen force-pushed the triton_ci branch from 03d96a9 to 723aed4 Compare December 30, 2025 11:46

Meihan-chen added 2 commits January 4, 2026 18:18

[bugfix] fix test file failed with triton

3554796

Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>

add note

c111b5d

Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>

Meihan-chen force-pushed the triton_ci branch from 723aed4 to c111b5d Compare January 4, 2026 10:19

wangxiyuan merged commit 16b1bee into vllm-project:main Jan 5, 2026
19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] fix test_camem failed with triton-ascend#5492

[bugfix] fix test_camem failed with triton-ascend#5492
wangxiyuan merged 2 commits intovllm-project:mainfrom
Meihan-chen:triton_ci

Meihan-chen commented Dec 29, 2025 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

github-actions Bot commented Dec 29, 2025

Uh oh!

Yikun Dec 30, 2025

Uh oh!

Meihan-chen Dec 30, 2025

Uh oh!

Yikun Dec 30, 2025

Uh oh!

Meihan-chen Dec 30, 2025

Uh oh!

Yikun commented Dec 30, 2025 •

edited

Loading

Uh oh!

Meihan-chen commented Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Meihan-chen commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

github-actions Bot commented Dec 29, 2025

Uh oh!

Yikun Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Meihan-chen Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Yikun Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Meihan-chen Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Yikun commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Meihan-chen commented Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Meihan-chen commented Dec 29, 2025 •

edited

Loading

Yikun commented Dec 30, 2025 •

edited

Loading