[hipBLASLt] Enable MX data generation for Tensile host and support calling Tensile MX kernel by amd-chunxlin · Pull Request #4599 · ROCm/rocm-libraries

amd-chunxlin · 2026-02-16T22:51:16Z

Motivation

This PR enables using mxDataGenerator when Tensile is the host and supports calling FP4 kernels generated from Tensile.

Technical Details

Add a FP4 library (yaml) generated by Tensile under GridBased category: YAML
Remove macros to use mxDataGenerator regardless which host used. Now the default C++ standard is set to C++20 as it is required by mxDataGenerator.
Support calling Tensile FP4 solutions

Test Plan

Use cmake preset build with rocRoller host off (i.e., use Tensile as host) , gpu target set to gfx950 and -DBUILD_TESTING:BOOL=OFF (turn off tensileLite test which will error out during build)

Use hipblaslt-test
./clients/hipblaslt-test --gtest_filter=*matmul_tensile_fp4*
Use hipblaslt-bench
./clients/hipblaslt-bench --iters 0 --cold_iters 0 --transA T --transB N --a_type f4_r --b_type f4_r --c_type f32_r --d_type f32_r -m 256 -n 256 -k 256 --alpha 2.1 --beta 0.7 --scaleA 3 --scaleB 3 --scale_type f32_r --verify

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

bnemanich · 2026-02-18T20:11:46Z

        tensileProblem.setSwizzleTensorA(prob.swizzleA);
        tensileProblem.setSwizzleTensorB(prob.swizzleB);
+
+	if(prob.scaleAType == RocblasltContractionProblem::ScalingFormat::Block_32_UE8M0)


This should also handle Block_32_UE8M0_32_8_EXT

Added Block_32_UE8M0_32_8_EXT, thanks!

bnemanich · 2026-02-18T20:15:25Z

+	// NOTE: an assumption here is A & B must be both MX data types or non-MX data types.
+	//       Mixing is not supported.
+        if(!problemType.useScaleAB.empty() or
+	    (problemType.mxBlockA == 32 && problemType.mxBlockB == 32)) //kernel input data


Maybe should check that mxBlockA != 0 instead.

Changed to !=0 instead of ==32, thanks!

amd-chunxlin added 3 commits February 16, 2026 22:01

Add Tensile FP4 library

b28fbf5

Enable mxDataGenerator without rocRoller

b7f1bcd

Add support for calling Tensile FP4 kernel and include a test

1848898

github-actions Bot added the project: hipblaslt label Feb 16, 2026

assistant-librarian Bot added the organization: ROCm label Feb 16, 2026

amd-chunxlin marked this pull request as ready for review February 17, 2026 19:56

amd-chunxlin requested review from a team as code owners February 17, 2026 19:56

bnemanich reviewed Feb 18, 2026

View reviewed changes

Address reviewers' feedback

bc7c813

bnemanich approved these changes Feb 20, 2026

View reviewed changes

amd-chunxlin merged commit e0a7991 into gfx950_mx_rebase Feb 20, 2026
13 of 22 checks passed

amd-chunxlin deleted the users/chunxlin/mxGen branch February 20, 2026 19:36

amd-chunxlin mentioned this pull request Feb 23, 2026

[hipBLASLt] Fix numeric regression of MXFP4 #4808

Closed

1 task

amd-chunxlin restored the users/chunxlin/mxGen branch March 2, 2026 20:57

talumbau mentioned this pull request Mar 11, 2026

Fix testSolutionStructsUtilities Unit Test #5350

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[hipBLASLt] Enable MX data generation for Tensile host and support calling Tensile MX kernel#4599

[hipBLASLt] Enable MX data generation for Tensile host and support calling Tensile MX kernel#4599
amd-chunxlin merged 4 commits into
gfx950_mx_rebasefrom
users/chunxlin/mxGen

amd-chunxlin commented Feb 16, 2026 •

edited

Loading

Uh oh!

bnemanich Feb 18, 2026

Uh oh!

amd-chunxlin Feb 19, 2026

Uh oh!

bnemanich Feb 18, 2026

Uh oh!

amd-chunxlin Feb 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

amd-chunxlin commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

bnemanich Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

amd-chunxlin Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

bnemanich Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

amd-chunxlin Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

amd-chunxlin commented Feb 16, 2026 •

edited

Loading