Cherry-pick 2 amd-llvm reverts for performance regressions by ronlieb · Pull Request #4013 · ROCm/TheRock

ronlieb · 2026-03-17T15:41:41Z

Compiler Submodule Update

Cherry-picks 2 revert commits from amd-llvm to address performance regressions.

Submodule Changes

Submodule	Old	New
amd-llvm	376decc81	910937524

Cherry-picked Commits

Commit	Description
28700f4d2228	Revert "AMDGPU: Fix runtime unrolling when cascaded GEPs present" - llama.cpp regression
910937524277	Revert SLP vectorizer external uses estimation - tree throttling issue

ronlieb · 2026-03-18T02:57:58Z

please review, we wish to land this sometime wednesday.

lamb-j · 2026-03-18T05:50:44Z

@ronlieb, @searlmc1, @kzhuravl, think adding something like this to the gh PR description for our submodule updates would be useful, or just noise? I know this one is out of date after we rebase this PR (or do we still even need this PR after #3834?), but just using it as an example

Branch base date: 2026-02-17

Submodule updates

Component	From	To
amd-llvm	`c849bc16b0`	`91093752427` (amd-compiler-2026-06)
hipify	`05290949a8`	`86c76dc618`
spirv-llvm-translator	`3bceafa607`	`d575617fd4`

llvm-project changes (5708 commits)

AMDGPU-specific:

Add gfx12-5-generic subtarget support
Add gfx1250 revision kernel note and B0-specific option
GFX1250 A0-specific patches
Asynchronous loads from global/buffer to LDS on pre-GFX12
Introduce asyncmark/wait intrinsics
GlobalISel RegBankLegalize rules for buffer load LDS and atomics

Compiler infrastructure:

Multiple merges from LLVM main into amd-staging
SLP vectorizer fixes and improvements
DAGCombiner crash fix
Loop vectorization: early exit loops with multiple exits
New llvm.looptrap intrinsic

hipify changes (1 commit)

Remove unlicensed files

spirv-llvm-translator changes (24 commits)

Upstream merges and LLVM API updates
Bug fixes: FMA, MergeBlock, LoopMerge, vload/vstore

Cherry-picked commits (after branch base date)

Commit	Date	Description
`910937524277`	2026-03-17	revert [SLP] external uses estimations
`9868d54e96fb`	2026-03-08	regen lit test for cluster.load.async.to.lds
`9760a5ffbf77`	2026-03-05	[AMDGPU] Add .gfx1250_revision kernel note
`080153b39b5b`	2026-03-04	[AMDGPU] Add -amdgpu-gfx1250-b0-specific option
`fcc41f00ef54`	2026-02-27	[SLP] Reject duplicate shift amounts
`28700f4d2228`	2026-02-26	Revert AMDGPU runtime unrolling fix
`739776a9841d`	2026-02-25	[AMDGPU] Add gfx12-5-generic subtarget
`65b91fabafc5`	2026-02-19	[GFX1250] A0-specific patches
`376decc81273`	2026-02-17	Revert [IndVarsSimplify] sinkUnusedInvariants

Patches removed (now upstreamed)

0001-Ensure-to-use-libamdhip64-with-major-version.patch
0009-Add-gcc-toolset-13-prefix-detection.patch

marbre

There is a merge conflict that needs to be resolved first.

ronlieb · 2026-03-18T10:40:27Z

this looks awesome, thanks. hoping we land this one today

marbre · 2026-03-18T12:07:48Z

this looks awesome, thanks. hoping we land this one today

We can't without resolving the conflict first :(

ronlieb · 2026-03-18T12:10:23Z

this looks awesome, thanks. hoping we land this one today

We can't without resolving the conflict first :(

Ack

amd-llvm: 376decc81 -> 910937524 Cherry-picked commits: 1. Revert AMDGPU runtime unrolling fix for cascaded GEPs (#183641) Addresses llama.cpp performance regression 2. Revert SLP vectorizer external uses estimation (fc648683cd75) Reverts tree throttling estimation changes Co-Authored-By: Claude <noreply@anthropic.com>

ronlieb requested review from SyamaAmd, kzhuravl, lajagapp, lamb-j and searlmc1 March 17, 2026 15:41

ronlieb requested review from ScottTodd and marbre as code owners March 17, 2026 15:41

github-project-automation Bot added this to TheRock Triage Mar 17, 2026

github-project-automation Bot moved this to TODO in TheRock Triage Mar 17, 2026

ronlieb requested a review from macurtis-amd March 17, 2026 15:44

marbre reviewed Mar 18, 2026

View reviewed changes

lamb-j force-pushed the amd/dev/rlieberm/SMPbumpWW06-7.13-2prs branch from 6a154ff to 4f92953 Compare March 18, 2026 15:27

lamb-j force-pushed the amd/dev/rlieberm/SMPbumpWW06-7.13-2prs branch from 4f92953 to ac68de6 Compare March 18, 2026 15:34

lamb-j changed the title ~~update submodule pointer for amd-llvm:amd-compiler-2026-06 91093752427~~ Cherry-pick 2 amd-llvm reverts for performance regressions Mar 18, 2026

lamb-j approved these changes Mar 18, 2026

View reviewed changes

ronlieb merged commit 376a534 into main Mar 18, 2026
173 of 181 checks passed

github-project-automation Bot moved this from TODO to Done in TheRock Triage Mar 18, 2026

ronlieb deleted the amd/dev/rlieberm/SMPbumpWW06-7.13-2prs branch March 18, 2026 20:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cherry-pick 2 amd-llvm reverts for performance regressions#4013

Cherry-pick 2 amd-llvm reverts for performance regressions#4013
ronlieb merged 1 commit into
mainfrom
amd/dev/rlieberm/SMPbumpWW06-7.13-2prs

ronlieb commented Mar 17, 2026 •

edited by lamb-j

Loading

Uh oh!

ronlieb commented Mar 18, 2026

Uh oh!

lamb-j commented Mar 18, 2026

Uh oh!

marbre left a comment

Uh oh!

ronlieb commented Mar 18, 2026

Uh oh!

marbre commented Mar 18, 2026

Uh oh!

ronlieb commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ronlieb commented Mar 17, 2026 • edited by lamb-j Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Compiler Submodule Update

Submodule Changes

Cherry-picked Commits

Uh oh!

ronlieb commented Mar 18, 2026

Uh oh!

lamb-j commented Mar 18, 2026

Submodule updates

llvm-project changes (5708 commits)

hipify changes (1 commit)

spirv-llvm-translator changes (24 commits)

Cherry-picked commits (after branch base date)

Patches removed (now upstreamed)

Uh oh!

marbre left a comment

Choose a reason for hiding this comment

Uh oh!

ronlieb commented Mar 18, 2026

Uh oh!

marbre commented Mar 18, 2026

Uh oh!

ronlieb commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ronlieb commented Mar 17, 2026 •

edited by lamb-j

Loading