Bump IREE requirement pins to 3.8.0rc20250923 by shark-pr-automator[bot] · Pull Request #2205 · nod-ai/amd-shark-ai

shark-pr-automator · 2025-09-09T03:16:10Z

Diff: iree-org/iree@iree-3.7.0rc20250828...iree-3.8.0rc20250923

Auto-generated by GitHub Actions using .github/workflows/update_iree_requirement_pins.yml.

pickup specialization microkernels for gfx950.
Refer to this IREE commit

Test result on gfx950 shows including the below compiling option when using iree-compile get better performance:
--iree-hip-enable-tensor-ukernels

github-actions · 2025-09-09T03:24:51Z

Coverage report

Click to see where and how coverage changed

File	Statements	Missing	Coverage	Coverage (new stmts)	Lines missing
sharktank
conftest.py
Project Total

_{This report was generated by python-coverage-comment-action}

Signed-off-by: dezhliao <dezhi.liao@amd.com>

…Perplexity[False] tests Signed-off-by: dezhliao <dezhi.liao@amd.com>

Signed-off-by: dezhliao <dezhliao@amd.com>

dezhiAmd · 2025-09-16T15:45:24Z

I am working on bump to 3.8.0rc20250909.
We need to disable a few llvm-compile-target-as-cpu.
please refer to
#2242

BTW, what fix, new feature is 3.8.0rc20250910?

Signed-off-by: dezhliao <dezhi.liao@amd.com>

Signed-off-by: dezhliao <dezhliao@amd.com>

Signed-off-by: dezhliao <dezhi.liao@amd.com>

amd-vivekag · 2025-09-17T06:47:40Z

I am working on bump to 3.8.0rc20250909. We need to disable a few llvm-compile-target-as-cpu. please refer to #2242

BTW, what fix, new feature is 3.8.0rc20250910?

Hi @dezhiAmd did you have any progress on this? Since I didn't hear from you since night, so I was also looking into it today and found that 2 more toyLlama tests started failing since following PR commit: iree-org/iree#21851

I'm adding xfail on those 2 testcases and will try to update the PR with latest IREE build.

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

This reverts commit 7b60b61.

…regressed

amd-vivekag · 2025-09-17T08:52:33Z

XFailed 2 toy llama testcases and filed an issue for that: iree-org/iree#22015

amd-vivekag · 2025-09-17T09:20:44Z

@dezhiAmd I just noticed that you have already worked upon the xfailing testcases and created issues w.r.t. following PR: #2242

I'll let you continue on that now. I'm not making any changes further. we can close this PR once your PR gets merged.

Alex-Vasile · 2025-09-17T10:48:24Z

We should be doing our IREE bumps using these automated PRs. They insure we don't forget to change one of the files and they use a standardized naming scheme which makes it easy to find later.

Additionally, the automation will only run if this branch does not exist. We have done manual bumps before and forgot to delete this branch, leading to us staying on an old iree pin for weeks before realizing the issue.

amd-vivekag · 2025-09-17T11:03:16Z

We should be doing our IREE bumps using these automated PRs. They insure we don't forget to change one of the files and they use a standardized naming scheme which makes it easy to find later.

Additionally, the automation will only run if this branch does not exist. We have done manual bumps before and forgot to delete this branch, leading to us staying on an old iree pin for weeks before realizing the issue.

If that is the case, then I can continue on this, and will try to merge.

amd-vivekag · 2025-09-17T12:46:23Z

Somehow smoke tests are failing in CI but passing in my local run. Any suggestion what should I do locally to reproduce the issue?

dezhiAmd · 2025-09-17T21:44:16Z

Somehow smoke tests are failing in CI but passing in my local run. Any suggestion what should I do locally to reproduce the issue?

I reproduced the issue locally. And reported the iree issue here: iree-org/iree#22026

Signed-off-by: dezhliao <dezhliao@amd.com>

dezhiAmd · 2025-09-17T22:00:52Z

I am working on bump to 3.8.0rc20250909. We need to disable a few llvm-compile-target-as-cpu. please refer to #2242

BTW, what fix, new feature is 3.8.0rc20250910?

Also I am curious about the scenarios where compiling MLIR to a VMFB for a CPU target would be beneficial. From my understanding, AMD's strengths lie in GPU hardware, and AI inference workloads are typically GPU-accelerated. So I'm trying to better understand the rationale or use cases behind targeting the CPU in this context

amd-vivekag · 2025-09-18T05:35:50Z

We can revert changes in toy llama testcase because it has been fixed under PR: iree-org/iree#22010

amd-vivekag · 2025-09-18T06:21:48Z

I am working on bump to 3.8.0rc20250909. We need to disable a few llvm-compile-target-as-cpu. please refer to #2242
BTW, what fix, new feature is 3.8.0rc20250910?

Also I am curious about the scenarios where compiling MLIR to a VMFB for a CPU target would be beneficial. From my understanding, AMD's strengths lie in GPU hardware, and AI inference workloads are typically GPU-accelerated. So I'm trying to better understand the rationale or use cases behind targeting the CPU in this context

As far as I know, this is the rationale behind supporting CPU: the complete workflow should support both CPU and GPU because there are lot of light weight models which can be run on CPU.

dezhiAmd · 2025-09-18T22:26:11Z

Failure of [CI - sharktank / Unit Tests (linux-mi325-8gpu-ossci-nod-ai, 3.12, 2.6.0) is caused by this iree issue: https://github.com/iree-org/iree/issues/22018

archana-ramalingam · 2025-09-19T18:58:54Z

Failure of [CI - sharktank / Unit Tests (linux-mi325-8gpu-ossci-nod-ai, 3.12, 2.6.0) is caused by this iree issue: https://github.com/iree-org/iree/issues/22018

Do we wait for this issue to be fixed or plan on xfailing and landing this for now? Preferably the latter, as long as it's only impacting toy llama.

amd-vivekag · 2025-09-22T09:11:18Z

Failure of [CI - sharktank / Unit Tests (linux-mi325-8gpu-ossci-nod-ai, 3.12, 2.6.0) is caused by this iree issue: https://github.com/iree-org/iree/issues/22018

Do we wait for this issue to be fixed or plan on xfailing and landing this for now? Preferably the latter, as long as it's only impacting toy llama.

The error got changed it seems. So, I have created another issue for the same: iree-org/iree#22055
I'm not sure if xfailing the testcase is the right approach.

@rsuderman , @IanNod please share your thoughts.

Signed-off-by: dezhliao <dezhliao@amd.com>

Diff: iree-org/iree@iree-3.7.0rc20250828...iree-3.8.0rc20250923 Auto-generated by GitHub Actions using [`.github/workflows/update_iree_requirement_pins.yml`](https://github.com/nod-ai/shark-ai/blob/main/.github/workflows/update_iree_requirement_pins.yml). pickup specialization microkernels for gfx950. Refer to this IREE [commit](iree-org/iree@a523efe) Test result on gfx950 shows including the below compiling option when using iree-compile get better performance: --iree-hip-enable-tensor-ukernels --------- Signed-off-by: dezhliao <dezhi.liao@amd.com> Signed-off-by: dezhliao <dezhliao@amd.com> Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: dezhliao <dezhi.liao@amd.com> Co-authored-by: dezhliao <dezhliao@amd.com> Co-authored-by: shark-pr-automator[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: archana-ramalingam <archana.ramalingam@amd.com> Co-authored-by: Archana Ramalingam <98564406+archana-ramalingam@users.noreply.github.com> Co-authored-by: Vivek Agrawal <197589114+amd-vivekag@users.noreply.github.com>

archana-ramalingam changed the title ~~Bump IREE requirement pins to 3.7.1~~ Bump IREE requirement pins to 3.8.0rc20250910 Sep 10, 2025

archana-ramalingam requested a review from IanNod September 11, 2025 05:49

dezhiAmd and others added 3 commits September 12, 2025 14:07

test 3.8.0rc20250909

919fa63

Signed-off-by: dezhliao <dezhi.liao@amd.com>

writing a temporary xfail marker for the TestToyLlamaIree::testDecode…

86b60c6

…Perplexity[False] tests Signed-off-by: dezhliao <dezhi.liao@amd.com>

ROCm 6.2 index only provides up to torch2.5.1, up ROCm to 6.4

218e6f4

Signed-off-by: dezhliao <dezhliao@amd.com>

amd-vivekag force-pushed the integrates/iree branch from 261ac00 to 0b5e6d7 Compare September 16, 2025 14:12

dezhiAmd and others added 6 commits September 16, 2025 10:23

resolve conflict

a919fdf

Signed-off-by: dezhliao <dezhi.liao@amd.com>

remove smoke_test on cpu, remove direct_to_batcher_test on cpu

b1dfc31

Signed-off-by: dezhliao <dezhi.liao@amd.com>

reformat

768fc33

Signed-off-by: dezhliao <dezhliao@amd.com>

remove added files by accident

8258cf2

Signed-off-by: dezhliao <dezhliao@amd.com>

Add details about xfail

4fa46ae

Signed-off-by: dezhliao <dezhi.liao@amd.com>

revert change to pytorch-rocm-requirements.txt

9bdf62d

Signed-off-by: dezhliao <dezhi.liao@amd.com>

github-actions Bot and others added 6 commits September 17, 2025 12:53

Bump IREE to 3.7.1.

7bfe5fb

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Fix iree versions

ad780c9

Enable extend attention for llama test

f64b1af

Update CMakeLists.txt

37bbbc3

Revert "Enable extend attention for llama test"

aa282a6

This reverts commit 7b60b61.

updates iree versions to 20250916 and xfail toy_llama testcase which …

c34fde3

…regressed

amd-vivekag force-pushed the integrates/iree branch from 9d09db1 to c34fde3 Compare September 17, 2025 07:23

Removes direct to batcher tests

cc2f7ec

amd-vivekag changed the title ~~Bump IREE requirement pins to 3.8.0rc20250910~~ Bump IREE requirement pins to 3.8.0rc20250916 Sep 17, 2025

amd-vivekag changed the title ~~Bump IREE requirement pins to 3.8.0rc20250916~~ Bump IREE requirement pins to 3.8.0rc20250917 Sep 17, 2025

remove files I added by accident

b0d5228

Signed-off-by: dezhliao <dezhliao@amd.com>

dezhiAmd mentioned this pull request Sep 17, 2025

pickup specialization microkernels for gfx950 3.8.0rc20250909 #2242

Closed

amd-vivekag force-pushed the integrates/iree branch from f08bb4e to b0d5228 Compare September 18, 2025 05:52

dezhiAmd changed the title ~~Bump IREE requirement pins to 3.8.0rc20250916~~ Bump IREE requirement pins to 3.8.0rc20250918 Sep 18, 2025

archana-ramalingam reviewed Sep 19, 2025

View reviewed changes

Comment thread .github/workflows/pkgci_shark_ai.yml Outdated

amd-vivekag mentioned this pull request Sep 22, 2025

error: function 'decode_bs4$async_dispatch_26_matmul_like_4x1536x32_f32' uses 74896 bytes of shared memory; exceeded the limit of 65536 bytes iree-org/iree#22055

Closed

amd-vivekag mentioned this pull request Sep 22, 2025

[Dispatch Creation] Rework dispatch formation logic iree-org/iree#21854

Merged

dezhiAmd added 4 commits September 22, 2025 18:32

bump iree to 20250922

fbed7e6

Signed-off-by: dezhliao <dezhliao@amd.com>

Merge branch 'main' into bump

52f143e

Merge branch 'main' into bump_new

2d7134a

Merge branch 'bump_new' into bump

70b5be9

dezhiAmd force-pushed the integrates/iree branch from 675de40 to 70b5be9 Compare September 22, 2025 18:38

Add other changes

6359ba5

Signed-off-by: dezhliao <dezhliao@amd.com>

dezhiAmd changed the title ~~Bump IREE requirement pins to 3.8.0rc20250918~~ Bump IREE requirement pins to 3.8.0rc20250922 Sep 22, 2025

amd-vivekag added 2 commits September 23, 2025 06:46

moves iree version to 0923

772e667

Merge branch 'main' into integrates/iree

3879dbf

amd-vivekag changed the title ~~Bump IREE requirement pins to 3.8.0rc20250922~~ Bump IREE requirement pins to 3.8.0rc20250923 Sep 23, 2025

amd-vivekag requested review from archana-ramalingam and pdhirajkumarprasad September 23, 2025 12:20

amd-vivekag approved these changes Sep 23, 2025

View reviewed changes

amd-vivekag merged commit 17210d9 into main Sep 23, 2025
76 of 105 checks passed

amd-vivekag deleted the integrates/iree branch September 23, 2025 12:30

Conversation

shark-pr-automator Bot commented Sep 9, 2025 • edited by amd-vivekag Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage report

Uh oh!

dezhiAmd commented Sep 16, 2025

Uh oh!

amd-vivekag commented Sep 17, 2025

Uh oh!

amd-vivekag commented Sep 17, 2025

Uh oh!

amd-vivekag commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Alex-Vasile commented Sep 17, 2025

Uh oh!

amd-vivekag commented Sep 17, 2025

Uh oh!

amd-vivekag commented Sep 17, 2025

Uh oh!

dezhiAmd commented Sep 17, 2025

Uh oh!

dezhiAmd commented Sep 17, 2025

Uh oh!

amd-vivekag commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amd-vivekag commented Sep 18, 2025

Uh oh!

dezhiAmd commented Sep 18, 2025

Uh oh!

Uh oh!

archana-ramalingam commented Sep 19, 2025

Uh oh!

amd-vivekag commented Sep 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shark-pr-automator Bot commented Sep 9, 2025 •

edited by amd-vivekag

Loading

github-actions Bot commented Sep 9, 2025 •

edited

Loading

amd-vivekag commented Sep 17, 2025 •

edited

Loading

amd-vivekag commented Sep 18, 2025 •

edited

Loading