[DEBUG] Revert "Enable `SPV_INTEL_fp_fast_math_mode` (#4058)" #4473

anmyachev · 2025-06-10T15:57:24Z

This reverts commit 353d6ff.

New CI link:

E2E before rebase on main: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15567222381
Inductor after rebase: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15630616934 (failed, but there is a regression on main)
New run: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15638597833 (passed)

Before rebasing on main: there was one failure: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15617981986/job/43995909816?pr=4473

third_party/intel/lib/Target/SPIRV/SPIRVTranslation.cpp

anmyachev · 2025-06-10T16:11:52Z

@whitneywhtsang could you remind me how to disable this mode using env var?

whitneywhtsang · 2025-06-10T16:15:55Z

@whitneywhtsang could you remind me how to disable this mode using env var?

By default fast math is not enabled, but allow contract is, it can be disabled by TRITON_INTEL_FAST_MATH=0 or TRITON_DEFAULT_FP_FUSION=0.

whitneywhtsang · 2025-06-10T19:50:17Z

Instead of reverting the SPV extension, can we do the change below?

diff --git a/third_party/intel/triton_xpu.cc b/third_party/intel/triton_xpu.cc
index 908d57c2e6..286eadd7f5 100644
--- a/third_party/intel/triton_xpu.cc
+++ b/third_party/intel/triton_xpu.cc
@@ -296,7 +296,7 @@ void init_triton_intel(py::module &&m) {
         if (auto *op = dyn_cast<FPMathOperator>(&inst)) {
           FastMathFlags FMF;
           // Default to allow contract when default fp fusion is not disabled.
-          if ((!enableFpFusion.has_value() || enableFpFusion.value()) &&
+          if ((enableFpFusion.has_value() && enableFpFusion.value()) &&
               !fastMath.has_value()) {

anmyachev · 2025-06-11T00:37:16Z

FYI: finally it passes: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15572699540/job/43851688164. Most likely the reason is #4479

anmyachev · 2025-06-11T09:18:53Z

Instead of reverting the SPV extension, can we do the change below?

diff --git a/third_party/intel/triton_xpu.cc b/third_party/intel/triton_xpu.cc
index 908d57c2e6..286eadd7f5 100644
--- a/third_party/intel/triton_xpu.cc
+++ b/third_party/intel/triton_xpu.cc
@@ -296,7 +296,7 @@ void init_triton_intel(py::module &&m) {
         if (auto *op = dyn_cast<FPMathOperator>(&inst)) {
           FastMathFlags FMF;
           // Default to allow contract when default fp fusion is not disabled.
-          if ((!enableFpFusion.has_value() || enableFpFusion.value()) &&
+          if ((enableFpFusion.has_value() && enableFpFusion.value()) &&
               !fastMath.has_value()) {

I'll try locally first.

UPD it works as well. However we may not have to make this change, since without using freezing option one of the models passes #4479 (comment). Let's look at the rest

scripts/inductor_xpu_test.sh

anmyachev · 2025-06-13T12:22:22Z

Instead of reverting the SPV extension, can we do the change below?

diff --git a/third_party/intel/triton_xpu.cc b/third_party/intel/triton_xpu.cc
index 908d57c2e6..286eadd7f5 100644
--- a/third_party/intel/triton_xpu.cc
+++ b/third_party/intel/triton_xpu.cc
@@ -296,7 +296,7 @@ void init_triton_intel(py::module &&m) {
         if (auto *op = dyn_cast<FPMathOperator>(&inst)) {
           FastMathFlags FMF;
           // Default to allow contract when default fp fusion is not disabled.
-          if ((!enableFpFusion.has_value() || enableFpFusion.value()) &&
+          if ((enableFpFusion.has_value() && enableFpFusion.value()) &&
               !fastMath.has_value()) {

@whitneywhtsang according to results from Inductor tests: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15630616934/job/44033895910 it's not an option.

alexbaden · 2025-06-13T14:13:32Z

What is the impact of this change on our micro benchmarks?

whitneywhtsang · 2025-06-13T15:16:28Z

@whitneywhtsang according to results from Inductor tests: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15630616934/job/44033895910 it's not an option.

The reversal of 353d6ff and the suggested change should behave the same unless TRITON_DEFAULT_FP_FUSION is set explicitly.
Do those test cases only fail with the suggested change?

anmyachev · 2025-06-13T15:19:28Z

What is the impact of this change on our micro benchmarks?

I don't know

Do those test cases only fail with the suggested change?

It looks like we have a regression regardless of these changes, between https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15632063117 and https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15607709679

anmyachev · 2025-06-13T15:45:47Z

What is the impact of this change on our micro benchmarks?

Should we run benchmarks to get information?

Signed-off-by: Anatoly Myachev <[email protected]>

third_party/intel/triton_xpu.cc

Co-authored-by: Whitney Tsang <[email protected]>

anmyachev · 2025-06-13T18:10:41Z

Benchmarks run: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15641089963. I'm probably done for today. If you can see that there are no regressions, we can merge it today. Since there is no more regression in Inductor tests.

whitneywhtsang · 2025-06-13T18:18:12Z

Started another one https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15641240442 with special tag pr-4473, so it will be easier to use Grafana to check for performance impact later.

whitneywhtsang

Please create an issue to track reverting this change.
The change LGTM assuming no performance degradation.

anmyachev · 2025-06-16T17:18:13Z

Please create an issue to track reverting this change. The change LGTM assuming no performance degradation.

#4514

Previously we only filter leave nodes. This PR improves the filter function by supporting filter both internal and leave nodes. `-i` finds frames that match the given regular expression and return *all nodes* in the paths that pass through the matching frames. `-e` excludes frames that match the given regular expression and their children.

This reverts commit 353d6ff. CI link: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15638597833 (passed) --------- Signed-off-by: Anatoly Myachev <[email protected]> Co-authored-by: Whitney Tsang <[email protected]>

…#4473)" This reverts commit 38a1984.

…4473)" (#4576) This reverts commit 38a1984. Known cases of impact on accuracy of the following models: detectron2 and doctr_reco_predictor from #4412 on PVC and LayoutLMForSequenceClassification from #4509 on ARL --------- Signed-off-by: Anatoly Myachev <[email protected]>

anmyachev commented Jun 10, 2025

View reviewed changes

third_party/intel/lib/Target/SPIRV/SPIRVTranslation.cpp Outdated Show resolved Hide resolved

anmyachev commented Jun 11, 2025

View reviewed changes

scripts/inductor_xpu_test.sh Outdated Show resolved Hide resolved

anmyachev commented Jun 11, 2025

View reviewed changes

scripts/inductor_xpu_test.sh Outdated Show resolved Hide resolved

anmyachev force-pushed the amyachev/test-e2e branch from 9662132 to 87e68ae Compare June 12, 2025 17:59

anmyachev changed the base branch from main to release/3.4.x June 12, 2025 18:00

anmyachev force-pushed the amyachev/test-e2e branch 2 times, most recently from 6a9a54b to d299f9c Compare June 12, 2025 18:11

anmyachev changed the base branch from release/3.4.x to main June 13, 2025 08:52

anmyachev force-pushed the amyachev/test-e2e branch from d299f9c to 2d8e1d2 Compare June 13, 2025 08:54

anmyachev force-pushed the amyachev/test-e2e branch from 2d8e1d2 to 0bf0876 Compare June 13, 2025 15:47

Update condition for setAllowContract

de86a60

Signed-off-by: Anatoly Myachev <[email protected]>

anmyachev force-pushed the amyachev/test-e2e branch from 0bf0876 to de86a60 Compare June 13, 2025 18:05

whitneywhtsang reviewed Jun 13, 2025

View reviewed changes

third_party/intel/triton_xpu.cc Outdated Show resolved Hide resolved

Update third_party/intel/triton_xpu.cc

6a7c822

Co-authored-by: Whitney Tsang <[email protected]>

anmyachev marked this pull request as ready for review June 13, 2025 18:10

whitneywhtsang approved these changes Jun 13, 2025

View reviewed changes

whitneywhtsang merged commit 38a1984 into main Jun 15, 2025
33 of 34 checks passed

whitneywhtsang deleted the amyachev/test-e2e branch June 15, 2025 20:37

anmyachev restored the amyachev/test-e2e branch June 20, 2025 12:42

whitneywhtsang added a commit that referenced this pull request Jun 25, 2025

Revert "[DEBUG] Revert "Enable SPV_INTEL_fp_fast_math_mode (#4058)" (…

3b99006

…#4473)" This reverts commit 38a1984.

anmyachev added a commit that referenced this pull request Jun 26, 2025

Revert "[DEBUG] Revert "Enable SPV_INTEL_fp_fast_math_mode (#4058)" (…

3ed2624

…#4473)" This reverts commit 38a1984.

[DEBUG] Revert "Enable SPV_INTEL_fp_fast_math_mode (#4058)" #4473

[DEBUG] Revert "Enable SPV_INTEL_fp_fast_math_mode (#4058)" #4473

Uh oh!

Conversation

anmyachev commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

anmyachev commented Jun 10, 2025

Uh oh!

whitneywhtsang commented Jun 10, 2025

Uh oh!

whitneywhtsang commented Jun 10, 2025

Uh oh!

anmyachev commented Jun 11, 2025

Uh oh!

anmyachev commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

anmyachev commented Jun 13, 2025

Uh oh!

alexbaden commented Jun 13, 2025

Uh oh!

whitneywhtsang commented Jun 13, 2025

Uh oh!

anmyachev commented Jun 13, 2025

Uh oh!

anmyachev commented Jun 13, 2025

Uh oh!

Uh oh!

anmyachev commented Jun 13, 2025

Uh oh!

whitneywhtsang commented Jun 13, 2025

Uh oh!

whitneywhtsang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

anmyachev commented Jun 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[DEBUG] Revert "Enable `SPV_INTEL_fp_fast_math_mode` (#4058)" #4473

[DEBUG] Revert "Enable `SPV_INTEL_fp_fast_math_mode` (#4058)" #4473

anmyachev commented Jun 10, 2025 •

edited

Loading

anmyachev commented Jun 11, 2025 •

edited

Loading