Skip to content

[0.13.0][cherry-pick][bugfix] fix bug of triton mrope#6090

Closed
ichaoren wants to merge 1 commit intovllm-project:releases/v0.13.0from
ichaoren:releases/v0.13.0
Closed

[0.13.0][cherry-pick][bugfix] fix bug of triton mrope#6090
ichaoren wants to merge 1 commit intovllm-project:releases/v0.13.0from
ichaoren:releases/v0.13.0

Conversation

@ichaoren
Copy link
Copy Markdown
Contributor

What this PR does / why we need it?

fix pr: #5827
keep same with main branch.

Does this PR introduce any user-facing change?

How was this patch tested?

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

The pull request refactors the AscendMRotaryEmbedding class to conditionally use Triton's mrope implementation. This involves moving the forward_triton logic into a dedicated method and guarding its invocation with a HAS_TRITON check. This improves modularity and handles the dependency on Triton more gracefully. However, a potential issue related to state management in the forward_triton method has been identified.

Comment thread vllm_ascend/ops/rotary_embedding.py
Signed-off-by: ichaoren <36871991+ichaoren@users.noreply.github.com>
@wangxiyuan
Copy link
Copy Markdown
Collaborator

replace by this one #6074

@github-actions
Copy link
Copy Markdown
Contributor

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants