[ROCm] Fix DeepSeek R1/V3 incorrect output in eager mode. by Duyi-Wang · Pull Request #27392 · vllm-project/vllm

Duyi-Wang · 2025-10-23T06:10:28Z

Purpose

DeepSeek R1/V3 models produce incorrect output when running in eager mode, while graph mode works correctly.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

This pull request addresses an issue where DeepSeek models produce incorrect output on ROCm in eager mode. The fix involves changing how forward_hip calls forward_cuda for rotary embeddings, correctly using the returned values instead of assuming an in-place operation. This change appears correct based on the problem description. I've added one comment to improve code clarity by removing a now-misleading comment.

gemini-code-assist · 2025-10-23T06:12:42Z

vllm/model_executor/layers/rotary_embedding/base.py

            # ops.rotary_embedding() is an in-place operation
            # that updates the query and key tensors.


These comments state that an internal operation is in-place, which contradicts the FIXME on the next line and the logic of the fix (which treats forward_cuda as not in-place by using its return value). To avoid confusion for future readers and maintainers, it would be better to remove these now-misleading comments.

…roject#19) Signed-off-by: Duyi-Wang <duyi.wang@amd.com>

divakar-amd · 2025-10-24T02:21:28Z

A similar fix recently got merged: #27373

github-project-automation bot added this to DeepSeek V3/R1 Oct 23, 2025

github-project-automation bot moved this to Backlog in DeepSeek V3/R1 Oct 23, 2025

mergify bot added deepseek Related to DeepSeek models rocm Related to AMD ROCm labels Oct 23, 2025

gemini-code-assist bot reviewed Oct 23, 2025

View reviewed changes

fix: walk around acc issue in eager mode for rope forward_hip (vllm-p…

8e17ff2

…roject#19) Signed-off-by: Duyi-Wang <duyi.wang@amd.com>

Duyi-Wang force-pushed the fix_eager_mode_issue branch from 47bdbdf to 8e17ff2 Compare October 23, 2025 12:21

Duyi-Wang closed this Oct 24, 2025

github-project-automation bot moved this from Backlog to Done in DeepSeek V3/R1 Oct 24, 2025

Duyi-Wang deleted the fix_eager_mode_issue branch November 18, 2025 03:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] Fix DeepSeek R1/V3 incorrect output in eager mode.#27392

[ROCm] Fix DeepSeek R1/V3 incorrect output in eager mode.#27392
Duyi-Wang wants to merge 1 commit intovllm-project:mainfrom
Duyi-Wang:fix_eager_mode_issue

Duyi-Wang commented Oct 23, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 23, 2025

Uh oh!

divakar-amd commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		# ops.rotary_embedding() is an in-place operation
		# that updates the query and key tensors.

Uh oh!

Conversation

Duyi-Wang commented Oct 23, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

divakar-amd commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Duyi-Wang commented Oct 23, 2025 •

edited by github-actions bot

Loading