Skip to content

[Origami] fp4 fix#3407

Closed
bnemanich wants to merge 2 commits into
developfrom
users/bnemanich/origami_fp4_fix
Closed

[Origami] fp4 fix#3407
bnemanich wants to merge 2 commits into
developfrom
users/bnemanich/origami_fp4_fix

Conversation

@bnemanich
Copy link
Copy Markdown
Contributor

@bnemanich bnemanich commented Dec 16, 2025

Motivation

The use of data_type_to_bytes in Origami is causing incorrect calculations in sub-byte datatypes (such as FP4). This has caused a regression in hipBLASLt performance.

Technical Details

data_type_to_bytes currently returns an integer, and sub-byte datatypes sizes are rounded up to 1. This PR changes data_type_to_bytes to return a double, so FP4 will be represented as 0.5 bytes.

Test Plan

A new test was added to check for a few FP4 calculations of rank_configs. #3301 should also be updated to added some more fp4 unit tests.

Test Result

All Origami tests passing locally.

Submission Checklist

@bnemanich bnemanich requested a review from a team as a code owner December 16, 2025 05:19
COrruDXC pushed a commit to COrruDXC/rocm-libraries that referenced this pull request Dec 16, 2025
@bnemanich bnemanich changed the title Users/bnemanich/origami fp4 fix [Origami] fp4 fix Dec 16, 2025
@neoblizz
Copy link
Copy Markdown
Member

Merged in #3452

@neoblizz neoblizz closed this Dec 19, 2025
@neoblizz neoblizz deleted the users/bnemanich/origami_fp4_fix branch December 19, 2025 17:04
ammallya pushed a commit that referenced this pull request Feb 3, 2026
[ROCm/composable_kernel commit: 45c4ea5]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants