MIOpen error tolerance#912
Open
assistant-librarian[bot] wants to merge 25 commits into
Open
Conversation
…evelop/StreamHPC_MIOpen/miopen-error-tolerance
jonatluu
pushed a commit
to jonatluu/rocm-libraries
that referenced
this pull request
Jul 28, 2025
ammallya
pushed a commit
that referenced
this pull request
Sep 18, 2025
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.17.1 to 1.18.1. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](ROCm/rocm-docs-core@v1.17.1...v1.18.1) --- updated-dependencies: - dependency-name: rocm-docs-core dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> [ROCm/rocSOLVER commit: ee9e185]
Merged
9af1e6a to
c782460
Compare
BrianHarrisonAMD
pushed a commit
that referenced
this pull request
Jan 13, 2026
## Motivation During ivestigating MIOpen error tolerance #912 , CI failed for RoPE kernels in bf16. ## Technical Details This PR fixes the wrong casting and negating operation order for fwd and bwd RoPE. ## Test Plan All tests pass. For some driver calls, the error got smaller. After the error tolerance fix applied, this kernel will pass the tests too.
c782460 to
578cb4a
Compare
578cb4a to
37be11f
Compare
BradPepersAMD
pushed a commit
that referenced
this pull request
Feb 17, 2026
## Motivation During ivestigating MIOpen error tolerance #912 , CI failed for the above mentioned kernel. ## Technical Details BF16 FLOAT maps to ushort storage type. Accumulating directly in FLOAT causes arithmetic on BF16 bit patterns and produces corrupted dx. This PR replaces FLOAT accumulation with FLOAT_ACCUM. Adds explicit convert-in/convert-out through CVT_FLOAT2ACCUM / CVT_ACCUM2FLOAT. ## Test Result The previously failing tests with the tightened error check are now also passing.
SamuelReeder
pushed a commit
that referenced
this pull request
Feb 17, 2026
## Motivation During ivestigating MIOpen error tolerance #912 , CI failed for the above mentioned kernel. ## Technical Details BF16 FLOAT maps to ushort storage type. Accumulating directly in FLOAT causes arithmetic on BF16 bit patterns and produces corrupted dx. This PR replaces FLOAT accumulation with FLOAT_ACCUM. Adds explicit convert-in/convert-out through CVT_FLOAT2ACCUM / CVT_ACCUM2FLOAT. ## Test Result The previously failing tests with the tightened error check are now also passing.
kamuruga08
pushed a commit
that referenced
this pull request
Feb 19, 2026
## Motivation During ivestigating MIOpen error tolerance #912 , CI failed for the above mentioned kernel. ## Technical Details BF16 FLOAT maps to ushort storage type. Accumulating directly in FLOAT causes arithmetic on BF16 bit patterns and produces corrupted dx. This PR replaces FLOAT accumulation with FLOAT_ACCUM. Adds explicit convert-in/convert-out through CVT_FLOAT2ACCUM / CVT_ACCUM2FLOAT. ## Test Result The previously failing tests with the tightened error check are now also passing.
kamuruga08
pushed a commit
that referenced
this pull request
Feb 19, 2026
## Motivation During ivestigating MIOpen error tolerance #912 , CI failed for the above mentioned kernel. ## Technical Details BF16 FLOAT maps to ushort storage type. Accumulating directly in FLOAT causes arithmetic on BF16 bit patterns and produces corrupted dx. This PR replaces FLOAT accumulation with FLOAT_ACCUM. Adds explicit convert-in/convert-out through CVT_FLOAT2ACCUM / CVT_ACCUM2FLOAT. ## Test Result The previously failing tests with the tightened error check are now also passing.
mgates3
pushed a commit
to mgates3/rocm-libraries
that referenced
this pull request
Mar 6, 2026
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.17.1 to 1.18.1. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](ROCm/rocm-docs-core@v1.17.1...v1.18.1) --- updated-dependencies: - dependency-name: rocm-docs-core dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
jovanau
pushed a commit
to jovanau/rocm-libraries
that referenced
this pull request
Mar 19, 2026
## Motivation During ivestigating MIOpen error tolerance ROCm#912 , CI failed for the above mentioned kernel. ## Technical Details BF16 FLOAT maps to ushort storage type. Accumulating directly in FLOAT causes arithmetic on BF16 bit patterns and produces corrupted dx. This PR replaces FLOAT accumulation with FLOAT_ACCUM. Adds explicit convert-in/convert-out through CVT_FLOAT2ACCUM / CVT_ACCUM2FLOAT. ## Test Result The previously failing tests with the tightened error check are now also passing.
EwanC
approved these changes
Apr 21, 2026
Contributor
EwanC
left a comment
There was a problem hiding this comment.
LGTM, but the PR description that got ported over from the original MIOpen PR is missing all the information in backticks. It would be worth restoring that information so that the git commit message retains those details when this is merged (assuming the PR description is used as part of the commit message).
1 task
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Scope
While testing the variant 0 for bnorm backward spatial single, we observed that the error tolerance in MIOpenDriver is sub-optimal. For instance, the following results were observed
Notice that the execution is successful, according to the driver checks. However, the error reported is high so this should actually be failing.
Notes for the reviewer
🔁 Imported from ROCm/MIOpen#3638
🧑💻 Originally authored by @Beanavil