Skip to content

[hipblaslt] Fixing build issues for gfx_950_mx_rebase#4465

Merged
NineKa merged 33 commits into
gfx950_mx_rebasefrom
users/hongjche/gfx950_mx_rebase_build
Feb 25, 2026
Merged

[hipblaslt] Fixing build issues for gfx_950_mx_rebase#4465
NineKa merged 33 commits into
gfx950_mx_rebasefrom
users/hongjche/gfx950_mx_rebase_build

Conversation

@NineKa
Copy link
Copy Markdown
Contributor

@NineKa NineKa commented Feb 10, 2026

Motivation

Solving various issues blocking hipblaslt from built

Technical Details

Solving various issues blocking hipblaslt from built

Submission Checklist

@NineKa NineKa requested a review from amd-chunxlin February 10, 2026 17:47
@NineKa NineKa self-assigned this Feb 10, 2026
@NineKa NineKa marked this pull request as ready for review February 10, 2026 19:40
@NineKa NineKa requested a review from a team as a code owner February 10, 2026 19:40
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/Common/Parallel.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/Common/Parallel.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py
Comment thread projects/hipblaslt/tensilelite/Tensile/KernelWriterAssembly.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/KernelWriterAssembly.py Outdated
@NineKa NineKa force-pushed the users/hongjche/gfx950_mx_rebase_build branch from ccc586c to 72db936 Compare February 19, 2026 20:58
@NineKa NineKa force-pushed the users/hongjche/gfx950_mx_rebase_build branch from 72db936 to 3c1f207 Compare February 19, 2026 21:01
@NineKa
Copy link
Copy Markdown
Contributor Author

NineKa commented Feb 19, 2026

Build verified for a455dcb with install.sh -c -a gfx950

Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
@NineKa
Copy link
Copy Markdown
Contributor Author

NineKa commented Feb 23, 2026

build verified for 3ef482e

Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
@NineKa NineKa requested a review from nakajee February 24, 2026 17:42
@NineKa NineKa force-pushed the users/hongjche/gfx950_mx_rebase_build branch from 7f724a9 to f720d64 Compare February 24, 2026 18:03
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py
Comment thread projects/hipblaslt/tensilelite/Tensile/Components/LraTileAssignment.py Outdated
@NineKa NineKa requested a review from nakajee February 25, 2026 00:05
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py
else:
if state["ProblemType"]["Sparse"] and state["MIInputPerThread"] * state["ProblemType"]["MacDataTypeA"].numBytes() > 16:
autoLRVWA = 1
if state["TransposeLDS"] and (not state["DirectToLds"]):
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Better to use "DirectToLdsA" if the check is for A only.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another thing is why we need to check if it is DirectToLds or not here?
No need to change LocalReadVectorWidth depending on whether it is DirectToLds or not.

Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
Comment thread projects/hipblaslt/tensilelite/Tensile/SolutionStructs/Solution.py Outdated
ldsNumBytesA, ldsNumBytesAlignedA, ldsNumBytesB, ldsNumBytesAlignedB, ldsNumBytesMetadata, ldsNumBytesAlignedMetadata, \
ldsNumBytesMXSA, ldsNumBytesAlignedMXSA, ldsNumBytesMXSB, ldsNumBytesAlignedMXSB \
= calcLdsNumBytes(padA, ldsBlockSizePerPadA, padB, ldsBlockSizePerPadB)
if (ldsNumBytesAlignedA + ldsNumBytesAlignedB) > state["MaxLDS"]:
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should consider ldsNumBytesAlignedMXSA and ldsNumBytesAlignedMXSB here

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Probably, ldsNumBytesAlignedMetadata as well

@NineKa NineKa requested a review from nakajee February 25, 2026 16:35
Copy link
Copy Markdown
Contributor

@nakajee nakajee left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good

@NineKa NineKa merged commit a3654aa into gfx950_mx_rebase Feb 25, 2026
4 of 5 checks passed
@NineKa NineKa deleted the users/hongjche/gfx950_mx_rebase_build branch February 25, 2026 17:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants