[UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat by ZT-AIA · Pull Request #5474 · vllm-project/vllm-ascend

ZT-AIA · 2025-12-29T07:46:59Z

What this PR does / why we need it?

[UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat

Does this PR introduce any user-facing change?

How was this patch tested?

pytest -sv tests/ut/ops/test_fused_qkvzba_split_reshape_cat.py

vLLM version: v0.13.0
vLLM main: vllm-project/vllm@5326c89

Signed-off-by: ZT-AIA <1028681969@qq.com>

gemini-code-assist

Code Review

This pull request introduces a new unit test for the fused_qkvzba_split_reshape_cat Triton operation. The test coverage provided by the parameterization is good. However, I've identified a couple of high-severity issues that should be addressed to improve the quality and maintainability of the test code. My feedback includes removing dead code that unnecessarily allocates a tensor and refactoring a fragile testing pattern used for generating the reference implementation to make the test more robust.

gemini-code-assist · 2025-12-29T07:48:04Z

+    hidden_size= 2048
+
+    hidden_states = torch.randn(
+        seq_len,
+        hidden_size,
+        dtype=dtype,
+        device=device  
+    )


The variable hidden_states and its dimension hidden_size are defined and a tensor is allocated on the device, but they are never used in the test. This is dead code that wastes memory and compute resources during test execution and reduces code clarity. Please remove this unused code.

gemini-code-assist · 2025-12-29T07:48:04Z

+    gdn = Qwen3NextGatedDeltaNet.__new__(Qwen3NextGatedDeltaNet)
+    gdn.num_k_heads = num_heads_qk
+    gdn.num_v_heads = num_heads_v
+    gdn.head_k_dim = head_qk_dim
+    gdn.head_v_dim = head_v_dim
+    gdn.tp_size = 1


Using __new__ to create an uninitialized object and then monkey-patching its attributes is a fragile testing pattern. It makes the test brittle and tightly coupled to the implementation details of Qwen3NextGatedDeltaNet. If the dependencies of fix_query_key_value_ordering change (e.g., it starts relying on an attribute set in __init__), this test will fail in a non-obvious way.

A more robust approach would be to extract the reference logic into a pure, standalone function within this test file. This would make the test self-contained and resilient to changes in the model class. If that's not feasible, Qwen3NextGatedDeltaNet should be instantiated properly, using mocks if necessary to avoid the overhead of a full model initialization.

Signed-off-by: ZT-AIA <1028681969@qq.com>

…ps_ut

github-actions · 2025-12-29T08:36:03Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: ZT-AIA <1028681969@qq.com>

MengqingCao · 2025-12-29T09:25:38Z

@@ -0,0 +1,100 @@
+import pytest


plz move this file to https://github.com/vllm-project/vllm-ascend/tree/main/tests/e2e/nightly/ops/triton

Signed-off-by: ZT-AIA <1028681969@qq.com>

…ject#5474) ### What this PR does / why we need it? [UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? pytest -sv tests/ut/ops/test_fused_qkvzba_split_reshape_cat.py - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: ZT-AIA <1028681969@qq.com>

…to FIA_rebase * 'main' of https://github.com/vllm-project/vllm-ascend: (58 commits) [Main2Main] Upgrade vllm commit to 0106 (vllm-project#5617) [CI]update bisheng version (vllm-project#5621) [UT][PCP&DCP] UT for block_table.py (vllm-project#5032) [Main2Main] Upgrade vllm commit to 0105 (vllm-project#5595) [CI] mv ops to correct path (vllm-project#5615) [BugFix] Fix Smoke Testing Bug for DSR1 longseq (vllm-project#5613) Revert "[Feat] enable hierarchical mc2 ops on A2 by default (vllm-project#5545)" (vllm-project#5611) [TRITON][TEST]Add nightly test for triton split_qkv_rmsnorm_rope (vllm-project#5267) [perf] Fix MLAPO weight disposal for KV-consumer MLA in PD-mix deploy... (vllm-project#5192) [docs] Correct image about prefill phase of PCP (vllm-project#5598) [CI] update triton-ascend version (vllm-project#5584) [P/D]Remove mooncake kvpool unused parameter `local_hostname` (vllm-project#5574) [Bugfix] record cos and sin cache in AscendRotaryEmbedding (vllm-project#5516) [bugfix] fix test_camem failed with triton-ascend (vllm-project#5492) [UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat (vllm-project#5474) [CI] Download models from ms (vllm-project#5405) Docs: Add A3 Docker image guidance for Atlas A3 machines (vllm-project#5256) [Doc] Add NNAL installation guide and requirements (vllm-project#5235) Add the requirement of arctic-inference which speculative decoding with suffix_decode (vllm-project#5045) [BugFix][Fusion] Fix graph fusion failure problem (vllm-project#5253) ...

…ject#5474) ### What this PR does / why we need it? [UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? pytest -sv tests/ut/ops/test_fused_qkvzba_split_reshape_cat.py - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: ZT-AIA <1028681969@qq.com>

…ject#5474) ### What this PR does / why we need it? [UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? pytest -sv tests/ut/ops/test_fused_qkvzba_split_reshape_cat.py - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: ZT-AIA <1028681969@qq.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

…ject#5474) ### What this PR does / why we need it? [UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? pytest -sv tests/ut/ops/test_fused_qkvzba_split_reshape_cat.py - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: ZT-AIA <1028681969@qq.com>

…ject#5474) ### What this PR does / why we need it? [UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? pytest -sv tests/ut/ops/test_fused_qkvzba_split_reshape_cat.py - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: ZT-AIA <1028681969@qq.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

…ject#5474) ### What this PR does / why we need it? [UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? pytest -sv tests/ut/ops/test_fused_qkvzba_split_reshape_cat.py - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@5326c89 --------- Signed-off-by: ZT-AIA <1028681969@qq.com>

ZT-AIA and others added 3 commits December 25, 2025 16:20

aaddut for test_fused_qkvzba_split_reshape_cat

1835d89

Signed-off-by: ZT-AIA <1028681969@qq.com>

changge

69e4aef

Signed-off-by: ZT-AIA <1028681969@qq.com>

Merge branch 'main' into ops_ut

fd08dbd

gemini-code-assist bot reviewed Dec 29, 2025

View reviewed changes

ZT-AIA added 2 commits December 29, 2025 16:23

repair ci

0279f8a

Signed-off-by: ZT-AIA <1028681969@qq.com>

Merge branch 'ops_ut' of https://github.com/ZT-AIA/vllm-ascend into o…

967807b

…ps_ut

github-actions bot added the module:tests label Dec 29, 2025

ZT-AIA and others added 2 commits December 29, 2025 17:10

repair ci

a8ea235

Signed-off-by: ZT-AIA <1028681969@qq.com>

Merge branch 'main' into ops_ut

d273a28

MengqingCao reviewed Dec 29, 2025

View reviewed changes

ZT-AIA and others added 3 commits December 30, 2025 20:27

change the path of file

2745474

Signed-off-by: ZT-AIA <1028681969@qq.com>

Merge branch 'main' into ops_ut

62f2de9

Merge branch 'vllm-project:main' into ops_ut

43db04c

vllm-ascend-ci added ready read for review ready-for-test start test by label for PR labels Jan 4, 2026

wangxiyuan merged commit 58e8d19 into vllm-project:main Jan 5, 2026
38 of 39 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat#5474

[UT]add triton ops ut : test_fused_qkvzba_split_reshape_cat#5474
wangxiyuan merged 10 commits intovllm-project:mainfrom
ZT-AIA:ops_ut

ZT-AIA commented Dec 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 29, 2025

Uh oh!

gemini-code-assist bot Dec 29, 2025

Uh oh!

github-actions bot commented Dec 29, 2025

Uh oh!

MengqingCao Dec 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ZT-AIA commented Dec 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 29, 2025

Uh oh!

MengqingCao Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ZT-AIA commented Dec 29, 2025 •

edited by github-actions bot

Loading