Skip to content

Fix RoPE for bf16#3766

Merged
BrianHarrisonAMD merged 1 commit into
developfrom
users/bsiklosi/fix_rope_bf16
Jan 13, 2026
Merged

Fix RoPE for bf16#3766
BrianHarrisonAMD merged 1 commit into
developfrom
users/bsiklosi/fix_rope_bf16

Conversation

@sikba
Copy link
Copy Markdown
Contributor

@sikba sikba commented Jan 10, 2026

Motivation

During ivestigating MIOpen error tolerance #912 , CI failed for RoPE kernels in bf16.

Technical Details

This PR fixes the wrong casting and negating operation order for fwd and bwd RoPE.

Test Plan

All tests pass. For some driver calls, the error got smaller. After the error tolerance fix applied, this kernel will pass the tests too.

@sikba sikba self-assigned this Jan 10, 2026
@sikba sikba requested a review from a team as a code owner January 10, 2026 12:22
@sikba sikba added the organization: streamhpc contributors from streamhpc label Jan 10, 2026
Copy link
Copy Markdown
Member

@BalintCsala BalintCsala left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Copy Markdown
Contributor

@BrianHarrisonAMD BrianHarrisonAMD left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@BrianHarrisonAMD BrianHarrisonAMD merged commit 176e3f2 into develop Jan 13, 2026
73 of 78 checks passed
@BrianHarrisonAMD BrianHarrisonAMD deleted the users/bsiklosi/fix_rope_bf16 branch January 13, 2026 14:59
assistant-librarian Bot pushed a commit to ROCm/MIOpen that referenced this pull request Jan 13, 2026
Fix RoPE for bf16

## Motivation

During ivestigating MIOpen error tolerance #912 , CI failed for RoPE
kernels in bf16.

## Technical Details

This PR fixes the wrong casting and negating operation order for fwd and
bwd RoPE.

## Test Plan

All tests pass. For some driver calls, the error got smaller. After the
error tolerance fix applied, this kernel will pass the tests too.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants