[Ops] update causal_conv1d_update by SunnyLee151064 · Pull Request #5984 · vllm-project/vllm-ascend

SunnyLee151064 · 2026-01-19T02:18:33Z

What this PR does / why we need it?

Update causal_conv1d_update ops for better perf.

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.13.0
vLLM main: vllm-project/vllm@2c24bc6

Signed-off-by: SunnyLee219 <3294305115@qq.com>

github-actions · 2026-01-19T02:18:48Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request updates the causal_conv1d_update operation for better performance by changing the data layout of several tensors (weight, conv_state, x) to be more cache-friendly for the Triton kernel. The changes throughout the causal_conv1d_update_npu function are consistent with these new data layouts. My main feedback is to remove a leftover debugging print statement.

gemini-code-assist · 2026-01-19T02:19:33Z

    out: (batch, dim) or (batch, dim, seqlen) or (num_tokens, dim), same shape as `x`
    """
+    weight = weight.transpose(0, 1).contiguous()
+    print("weight's shape: ", weight.size())


This print statement appears to be a debugging artifact. It should be removed before merging to avoid polluting logs and to prevent potential performance degradation in production environments, as I/O operations can be costly.

Signed-off-by: SunnyLee219 <3294305115@qq.com>

…to FIA_rebase * 'main' of https://github.com/vllm-project/vllm-ascend: [CI] Upgrade CANN to 8.5.0 (vllm-project#6070) Default enable MLAPO (vllm-project#5952) [Doc] Supplement PD separation parameters of DeepSeek V3.1 (vllm-project#6053) [Ascend] perf: optimize rope embedding with triton kernel for huge performance gain (vllm-project#5918) [Ops] update causal_conv1d_update (vllm-project#5984) [CI]Update triton ascend version in 3.2.0 (vllm-project#6067) [bugfix] fix the complex and potentially problematic generate_kv_idx. (vllm-project#5957)

### What this PR does / why we need it? Update causal_conv1d_update ops for better perf. - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2c24bc6 --------- Signed-off-by: SunnyLee219 <3294305115@qq.com>

### What this PR does / why we need it? Update causal_conv1d_update ops for better perf. - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2c24bc6 --------- Signed-off-by: SunnyLee219 <3294305115@qq.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? Update causal_conv1d_update ops for better perf. - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2c24bc6 --------- Signed-off-by: SunnyLee219 <3294305115@qq.com>

### What this PR does / why we need it? Update causal_conv1d_update ops for better perf. - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2c24bc6 --------- Signed-off-by: SunnyLee219 <3294305115@qq.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? Update causal_conv1d_update ops for better perf. - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2c24bc6 --------- Signed-off-by: SunnyLee219 <3294305115@qq.com>

update causal_conv1d_update

6d940a5

Signed-off-by: SunnyLee219 <3294305115@qq.com>

SunnyLee151064 requested review from realliujiaxu and zzzzwwjj as code owners January 19, 2026 02:18

github-actions bot added the module:ops label Jan 19, 2026

gemini-code-assist bot reviewed Jan 19, 2026

View reviewed changes

update causal_conv1d_update

35b85b6

Signed-off-by: SunnyLee219 <3294305115@qq.com>

wangxiyuan added ready read for review ready-for-test start test by label for PR labels Jan 19, 2026

wangxiyuan enabled auto-merge (squash) January 19, 2026 08:27

SunnyLee151064 added 3 commits January 20, 2026 09:30

Merge branch 'up_main' into add_update

ae66eb1

Merge branch 'main' into add_update

9989edc

Fix bug

4ebe79a

Signed-off-by: SunnyLee219 <3294305115@qq.com>

auto-merge was automatically disabled January 21, 2026 03:34
Head branch was pushed to by a user without write access

wangxiyuan merged commit 2a618d2 into vllm-project:main Jan 21, 2026
20 checks passed

Yikun mentioned this pull request Feb 5, 2026

[v0.13.0rc2] FAQ / Feedback | 问题/反馈 #6186

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Ops] update causal_conv1d_update#5984

[Ops] update causal_conv1d_update#5984
wangxiyuan merged 5 commits intovllm-project:mainfrom
SunnyLee151064:add_update

SunnyLee151064 commented Jan 19, 2026 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Jan 19, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jan 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SunnyLee151064 commented Jan 19, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Jan 19, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SunnyLee151064 commented Jan 19, 2026 •

edited by github-actions bot

Loading