Skip to content

Revert "[0.13.0][cherry-pick][bugfix] fix bug of triton mrope"#6075

Merged
yiz-liu merged 1 commit intovllm-project:releases/v0.13.0from
shaopeng-666:revert-6009-releases/v0.13.0
Jan 21, 2026
Merged

Revert "[0.13.0][cherry-pick][bugfix] fix bug of triton mrope"#6075
yiz-liu merged 1 commit intovllm-project:releases/v0.13.0from
shaopeng-666:revert-6009-releases/v0.13.0

Conversation

@shaopeng-666
Copy link
Copy Markdown
Collaborator

Reverts #6009

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request reverts a previous bug fix related to Triton mrope. The changes involve modifying the import of triton_mrope to be unconditional and refactoring the AscendMRotaryEmbedding class. Specifically, the forward_triton method is re-introduced, but it removes explicit contiguous() calls on cos and sin that were present in the original implementation. This could potentially lead to issues if the triton_mrope kernel expects contiguous inputs for these tensors.


self._match_cos_sin_cache_dtype(query)
cos_sin = self.cos_sin_cache[positions]
cos, sin = cos_sin.chunk(2, dim=-1)
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

The forward_triton method, as re-introduced, removes the explicit contiguous() calls on cos and sin before passing them to triton_mrope. The chunk operation can return non-contiguous views. If triton_mrope expects contiguous tensors for cos and sin, this could lead to unexpected behavior or performance issues. It's safer to ensure contiguity for these inputs.

Suggested change
cos, sin = cos_sin.chunk(2, dim=-1)
cos, sin = cos_sin.chunk(2, dim=-1)
cos = cos.contiguous()
sin = sin.contiguous()

…roject#6009)"

This reverts commit 18eec9d.

Signed-off-by: 李少鹏 <lishaopeng21@huawei.com>
@shaopeng-666 shaopeng-666 force-pushed the revert-6009-releases/v0.13.0 branch from 2e10c06 to c2320a3 Compare January 21, 2026 03:44
@yiz-liu yiz-liu merged commit c80a50c into vllm-project:releases/v0.13.0 Jan 21, 2026
7 of 8 checks passed
845473182 pushed a commit to 845473182/vllm-ascend that referenced this pull request Jan 21, 2026
@shaopeng-666 shaopeng-666 deleted the revert-6009-releases/v0.13.0 branch January 21, 2026 06:24
starmountain1997 pushed a commit to starmountain1997/vllm-ascend that referenced this pull request Jan 31, 2026
tangtiangu pushed a commit to tangtiangu/jiusi-vllm-ascend that referenced this pull request Feb 24, 2026
tangtiangu pushed a commit to tangtiangu/jiusi-vllm-ascend that referenced this pull request Feb 24, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants