[VPlan] Don't drop NUW flag on tail folded canonical IVs by lukel97 · Pull Request #183301 · llvm/llvm-project

lukel97 · 2026-02-25T13:56:18Z

After #183080 vscale can no longer be a non-power of 2, which means the canonical IV can't overflow with tail folding w/ scalable vectors anymore. Therefore we don't need to drop the NUW flag.

IVUpdateMayOverflow is left to be removed in a separate PR since it removes further runtime checks.

After llvm#183080 vscale can no longer be a power of 2, which means the canonical IV can't overflow with tail folding w/ scalable vectors anymore. Therefore we don't need to drop the NUW flag. IVUpdateMayOverflow is left to be removed in a separate PR since it removes further runtime checks.

llvmbot · 2026-02-25T13:56:54Z

@llvm/pr-subscribers-backend-powerpc
@llvm/pr-subscribers-vectorizers
@llvm/pr-subscribers-llvm-transforms

@llvm/pr-subscribers-backend-risc-v

Author: Luke Lau (lukel97)

Changes

After #183080 vscale can no longer be a power of 2, which means the canonical IV can't overflow with tail folding w/ scalable vectors anymore. Therefore we don't need to drop the NUW flag.

IVUpdateMayOverflow is left to be removed in a separate PR since it removes further runtime checks.

Patch is 177.50 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/183301.diff

46 Files Affected:

(modified) llvm/lib/Transforms/Vectorize/LoopVectorize.cpp (-15)
(modified) llvm/test/Transforms/LoopVectorize/AArch64/induction-costs-sve.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-epilogue.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-neon.ll (+3-3)
(modified) llvm/test/Transforms/LoopVectorize/AArch64/tail-folding-styles.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/bf16.ll (+5-5)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/blocks-with-dead-instructions.ll (+4-4)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/dead-ops-cost.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/evl-compatible-loops.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/f16.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/induction-costs.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/inloop-reduction.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/predicated-reverse-store.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/reductions.ll (+36-36)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/select-cmp-reduction.ll (+17-17)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-call-intrinsics.ll (+18-18)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-cast-intrinsics.ll (+22-22)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-complex-mask.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-cond-reduction.ll (+8-8)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-fixed-order-recurrence.ll (+10-10)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-inloop-reduction.ll (+14-14)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-interleave.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-intermediate-store.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-iv32.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-known-no-overflow.ll (+4-4)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-masked-loadstore.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-ordered-reduction.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-reduction.ll (+14-14)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/tail-folding-uniform-store.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/RISCV/vectorize-vp-intrinsics.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/X86/divs-with-tail-folding.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/X86/fold-tail-low-trip-count.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/X86/tail_loop_folding.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/X86/vectorize-force-tail-with-evl.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/dont-fold-tail-for-divisible-TC.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/first-order-recurrence-tail-folding.ll (+9-9)
(modified) llvm/test/Transforms/LoopVectorize/fmax-without-fast-math-flags-interleave.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/no-fold-tail-by-masking-iv-external-uses.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/reduction-inloop-uf4.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/scalable-predication.ll (+2-2)
(modified) llvm/test/Transforms/LoopVectorize/store-reduction-results-in-tail-folded-loop.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/tail-folding-iv-outside-user.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/tail-folding-optimize-vector-induction-width.ll (+3-3)
(modified) llvm/test/Transforms/LoopVectorize/tail-folding-vectorization-factor-1.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/use-scalar-epilogue-if-tp-fails.ll (+2-2)

diff --git a/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp b/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
index 2342c8bfa502e..4eccf79afad51 100644
--- a/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
+++ b/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
@@ -8201,22 +8201,7 @@ VPlanPtr LoopVectorizationPlanner::tryToBuildVPlanWithVPRecipes(
     IVUpdateMayOverflow |= !isIndvarOverflowCheckKnownFalse(&CM, VF);
 
   TailFoldingStyle Style = CM.getTailFoldingStyle(IVUpdateMayOverflow);
-  // Use NUW for the induction increment if we proved that it won't overflow in
-  // the vector loop or when not folding the tail. In the later case, we know
-  // that the canonical induction increment will not overflow as the vector trip
-  // count is >= increment and a multiple of the increment.
   VPRegionBlock *LoopRegion = Plan->getVectorLoopRegion();
-  bool HasNUW = !IVUpdateMayOverflow || Style == TailFoldingStyle::None;
-  if (!HasNUW) {
-    auto *IVInc =
-        LoopRegion->getExitingBasicBlock()->getTerminator()->getOperand(0);
-    assert(match(IVInc,
-                 m_VPInstruction<Instruction::Add>(
-                     m_Specific(LoopRegion->getCanonicalIV()), m_VPValue())) &&
-           "Did not find the canonical IV increment");
-    cast<VPRecipeWithIRFlags>(IVInc)->dropPoisonGeneratingFlags();
-  }
-
   // ---------------------------------------------------------------------------
   // Pre-construction: record ingredients whose recipes we'll need to further
   // process after constructing the initial VPlan.
diff --git a/llvm/test/Transforms/LoopVectorize/AArch64/induction-costs-sve.ll b/llvm/test/Transforms/LoopVectorize/AArch64/induction-costs-sve.ll
index 9b4d07925a57b..e6f75cd814b22 100644
--- a/llvm/test/Transforms/LoopVectorize/AArch64/induction-costs-sve.ll
+++ b/llvm/test/Transforms/LoopVectorize/AArch64/induction-costs-sve.ll
@@ -618,7 +618,7 @@ define void @exit_cond_zext_iv(ptr %dst, i64 %N) {
 ; PRED-NEXT:    store i32 0, ptr [[TMP12]], align 8
 ; PRED-NEXT:    br label %[[PRED_STORE_CONTINUE5]]
 ; PRED:       [[PRED_STORE_CONTINUE5]]:
-; PRED-NEXT:    [[INDEX_NEXT]] = add i64 [[INDEX]], 2
+; PRED-NEXT:    [[INDEX_NEXT]] = add nuw i64 [[INDEX]], 2
 ; PRED-NEXT:    [[TMP13:%.*]] = icmp eq i64 [[INDEX_NEXT]], [[N_VEC]]
 ; PRED-NEXT:    br i1 [[TMP13]], label %[[MIDDLE_BLOCK:.*]], label %[[VECTOR_BODY]], !llvm.loop [[LOOP4:![0-9]+]]
 ; PRED:       [[MIDDLE_BLOCK]]:
diff --git a/llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-epilogue.ll b/llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-epilogue.ll
index 5c2c67337625a..5dc1d654c1600 100644
--- a/llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-epilogue.ll
+++ b/llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-epilogue.ll
@@ -416,7 +416,7 @@ define i32 @dotp_predicated(i64 %N, ptr %a, ptr %b) {
 ; CHECK-NEXT:    [[TMP179:%.*]] = mul nsw <16 x i32> [[TMP178]], [[TMP97]]
 ; CHECK-NEXT:    [[TMP180:%.*]] = select <16 x i1> [[TMP16]], <16 x i32> [[TMP179]], <16 x i32> zeroinitializer
 ; CHECK-NEXT:    [[PARTIAL_REDUCE]] = call <4 x i32> @llvm.vector.partial.reduce.add.v4i32.v16i32(<4 x i32> [[VEC_PHI]], <16 x i32> [[TMP180]])
-; CHECK-NEXT:    [[INDEX_NEXT]] = add i64 [[INDEX]], 16
+; CHECK-NEXT:    [[INDEX_NEXT]] = add nuw i64 [[INDEX]], 16
 ; CHECK-NEXT:    [[VEC_IND_NEXT]] = add <16 x i64> [[VEC_IND]], splat (i64 16)
 ; CHECK-NEXT:    [[TMP181:%.*]] = icmp eq i64 [[INDEX_NEXT]], [[N_VEC]]
 ; CHECK-NEXT:    br i1 [[TMP181]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP8:![0-9]+]]
diff --git a/llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-neon.ll b/llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-neon.ll
index ad92b56218bb5..d588aa06d7156 100644
--- a/llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-neon.ll
+++ b/llvm/test/Transforms/LoopVectorize/AArch64/partial-reduce-dot-product-neon.ll
@@ -1227,7 +1227,7 @@ define i32 @dotp_predicated(i64 %N, ptr %a, ptr %b) {
 ; CHECK-INTERLEAVE1-NEXT:    [[TMP179:%.*]] = mul nsw <16 x i32> [[TMP178]], [[TMP97]]
 ; CHECK-INTERLEAVE1-NEXT:    [[TMP180:%.*]] = select <16 x i1> [[TMP16]], <16 x i32> [[TMP179]], <16 x i32> zeroinitializer
 ; CHECK-INTERLEAVE1-NEXT:    [[PARTIAL_REDUCE]] = call <4 x i32> @llvm.vector.partial.reduce.add.v4i32.v16i32(<4 x i32> [[VEC_PHI]], <16 x i32> [[TMP180]])
-; CHECK-INTERLEAVE1-NEXT:    [[INDEX_NEXT]] = add i64 [[INDEX]], 16
+; CHECK-INTERLEAVE1-NEXT:    [[INDEX_NEXT]] = add nuw i64 [[INDEX]], 16
 ; CHECK-INTERLEAVE1-NEXT:    [[VEC_IND_NEXT]] = add <16 x i64> [[VEC_IND]], splat (i64 16)
 ; CHECK-INTERLEAVE1-NEXT:    [[TMP181:%.*]] = icmp eq i64 [[INDEX_NEXT]], [[N_VEC]]
 ; CHECK-INTERLEAVE1-NEXT:    br i1 [[TMP181]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP8:![0-9]+]]
@@ -1483,7 +1483,7 @@ define i32 @dotp_predicated(i64 %N, ptr %a, ptr %b) {
 ; CHECK-INTERLEAVED-NEXT:    [[TMP179:%.*]] = mul nsw <16 x i32> [[TMP178]], [[TMP97]]
 ; CHECK-INTERLEAVED-NEXT:    [[TMP180:%.*]] = select <16 x i1> [[TMP16]], <16 x i32> [[TMP179]], <16 x i32> zeroinitializer
 ; CHECK-INTERLEAVED-NEXT:    [[PARTIAL_REDUCE]] = call <4 x i32> @llvm.vector.partial.reduce.add.v4i32.v16i32(<4 x i32> [[VEC_PHI]], <16 x i32> [[TMP180]])
-; CHECK-INTERLEAVED-NEXT:    [[INDEX_NEXT]] = add i64 [[INDEX]], 16
+; CHECK-INTERLEAVED-NEXT:    [[INDEX_NEXT]] = add nuw i64 [[INDEX]], 16
 ; CHECK-INTERLEAVED-NEXT:    [[VEC_IND_NEXT]] = add <16 x i64> [[VEC_IND]], splat (i64 16)
 ; CHECK-INTERLEAVED-NEXT:    [[TMP181:%.*]] = icmp eq i64 [[INDEX_NEXT]], [[N_VEC]]
 ; CHECK-INTERLEAVED-NEXT:    br i1 [[TMP181]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP8:![0-9]+]]
@@ -1739,7 +1739,7 @@ define i32 @dotp_predicated(i64 %N, ptr %a, ptr %b) {
 ; CHECK-MAXBW-NEXT:    [[TMP179:%.*]] = mul nsw <16 x i32> [[TMP178]], [[TMP97]]
 ; CHECK-MAXBW-NEXT:    [[TMP180:%.*]] = select <16 x i1> [[TMP16]], <16 x i32> [[TMP179]], <16 x i32> zeroinitializer
 ; CHECK-MAXBW-NEXT:    [[PARTIAL_REDUCE]] = call <4 x i32> @llvm.vector.partial.reduce.add.v4i32.v16i32(<4 x i32> [[VEC_PHI]], <16 x i32> [[TMP180]])
-; CHECK-MAXBW-NEXT:    [[INDEX_NEXT]] = add i64 [[INDEX]], 16
+; CHECK-MAXBW-NEXT:    [[INDEX_NEXT]] = add nuw i64 [[INDEX]], 16
 ; CHECK-MAXBW-NEXT:    [[VEC_IND_NEXT]] = add <16 x i64> [[VEC_IND]], splat (i64 16)
 ; CHECK-MAXBW-NEXT:    [[TMP181:%.*]] = icmp eq i64 [[INDEX_NEXT]], [[N_VEC]]
 ; CHECK-MAXBW-NEXT:    br i1 [[TMP181]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP8:![0-9]+]]
diff --git a/llvm/test/Transforms/LoopVectorize/AArch64/tail-folding-styles.ll b/llvm/test/Transforms/LoopVectorize/AArch64/tail-folding-styles.ll
index b4a3921974d35..cde86eaf9f23d 100644
--- a/llvm/test/Transforms/LoopVectorize/AArch64/tail-folding-styles.ll
+++ b/llvm/test/Transforms/LoopVectorize/AArch64/tail-folding-styles.ll
@@ -67,7 +67,7 @@ define void @simple_memset_tailfold(i32 %val, ptr %ptr, i64 %n) "target-features
 ; DATA-NEXT:    [[ACTIVE_LANE_MASK:%.*]] = call <vscale x 4 x i1> @llvm.get.active.lane.mask.nxv4i1.i64(i64 [[INDEX1]], i64 [[UMAX]])
 ; DATA-NEXT:    [[TMP10:%.*]] = getelementptr i32, ptr [[PTR:%.*]], i64 [[INDEX1]]
 ; DATA-NEXT:    call void @llvm.masked.store.nxv4i32.p0(<vscale x 4 x i32> [[BROADCAST_SPLAT]], ptr align 4 [[TMP10]], <vscale x 4 x i1> [[ACTIVE_LANE_MASK]])
-; DATA-NEXT:    [[INDEX_NEXT2]] = add i64 [[INDEX1]], [[TMP5]]
+; DATA-NEXT:    [[INDEX_NEXT2]] = add nuw i64 [[INDEX1]], [[TMP5]]
 ; DATA-NEXT:    [[TMP14:%.*]] = icmp eq i64 [[INDEX_NEXT2]], [[N_VEC]]
 ; DATA-NEXT:    br i1 [[TMP14]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP0:![0-9]+]]
 ; DATA:       middle.block:
@@ -102,7 +102,7 @@ define void @simple_memset_tailfold(i32 %val, ptr %ptr, i64 %n) "target-features
 ; DATA_NO_LANEMASK-NEXT:    [[TMP12:%.*]] = icmp ule <vscale x 4 x i64> [[VEC_IV]], [[BROADCAST_SPLAT]]
 ; DATA_NO_LANEMASK-NEXT:    [[TMP13:%.*]] = getelementptr i32, ptr [[PTR:%.*]], i64 [[INDEX1]]
 ; DATA_NO_LANEMASK-NEXT:    call void @llvm.masked.store.nxv4i32.p0(<vscale x 4 x i32> [[BROADCAST_SPLAT5]], ptr align 4 [[TMP13]], <vscale x 4 x i1> [[TMP12]])
-; DATA_NO_LANEMASK-NEXT:    [[INDEX_NEXT6]] = add i64 [[INDEX1]], [[TMP5]]
+; DATA_NO_LANEMASK-NEXT:    [[INDEX_NEXT6]] = add nuw i64 [[INDEX1]], [[TMP5]]
 ; DATA_NO_LANEMASK-NEXT:    [[TMP17:%.*]] = icmp eq i64 [[INDEX_NEXT6]], [[N_VEC]]
 ; DATA_NO_LANEMASK-NEXT:    br i1 [[TMP17]], label [[MIDDLE_BLOCK:%.*]], label [[VECTOR_BODY]], !llvm.loop [[LOOP0:![0-9]+]]
 ; DATA_NO_LANEMASK:       middle.block:
diff --git a/llvm/test/Transforms/LoopVectorize/RISCV/bf16.ll b/llvm/test/Transforms/LoopVectorize/RISCV/bf16.ll
index 024194db39332..05e5a52408ea2 100644
--- a/llvm/test/Transforms/LoopVectorize/RISCV/bf16.ll
+++ b/llvm/test/Transforms/LoopVectorize/RISCV/bf16.ll
@@ -30,7 +30,7 @@ define void @fadd(ptr noalias %a, ptr noalias %b, i64 %n) {
 ; ZVFBFMIN:       [[VECTOR_PH]]:
 ; ZVFBFMIN-NEXT:    br label %[[VECTOR_BODY:.*]]
 ; ZVFBFMIN:       [[VECTOR_BODY]]:
-; ZVFBFMIN-NEXT:    [[TMP0:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[INDEX_EVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
+; ZVFBFMIN-NEXT:    [[TMP0:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[CURRENT_ITERATION_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; ZVFBFMIN-NEXT:    [[AVL:%.*]] = phi i64 [ [[N]], %[[VECTOR_PH]] ], [ [[AVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; ZVFBFMIN-NEXT:    [[TMP6:%.*]] = call i32 @llvm.experimental.get.vector.length.i64(i64 [[AVL]], i32 8, i1 true)
 ; ZVFBFMIN-NEXT:    [[TMP1:%.*]] = getelementptr bfloat, ptr [[A]], i64 [[TMP0]]
@@ -40,7 +40,7 @@ define void @fadd(ptr noalias %a, ptr noalias %b, i64 %n) {
 ; ZVFBFMIN-NEXT:    [[TMP11:%.*]] = fadd <vscale x 8 x bfloat> [[WIDE_LOAD]], [[WIDE_LOAD1]]
 ; ZVFBFMIN-NEXT:    call void @llvm.vp.store.nxv8bf16.p0(<vscale x 8 x bfloat> [[TMP11]], ptr align 2 [[TMP1]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP6]])
 ; ZVFBFMIN-NEXT:    [[TMP13:%.*]] = zext i32 [[TMP6]] to i64
-; ZVFBFMIN-NEXT:    [[INDEX_EVL_NEXT]] = add i64 [[TMP13]], [[TMP0]]
+; ZVFBFMIN-NEXT:    [[CURRENT_ITERATION_NEXT]] = add nuw i64 [[TMP13]], [[TMP0]]
 ; ZVFBFMIN-NEXT:    [[AVL_NEXT]] = sub nuw i64 [[AVL]], [[TMP13]]
 ; ZVFBFMIN-NEXT:    [[TMP7:%.*]] = icmp eq i64 [[AVL_NEXT]], 0
 ; ZVFBFMIN-NEXT:    br i1 [[TMP7]], label %[[MIDDLE_BLOCK:.*]], label %[[VECTOR_BODY]], !llvm.loop [[LOOP0:![0-9]+]]
@@ -122,7 +122,7 @@ define void @vfwmaccbf16.vv(ptr noalias %a, ptr noalias %b, ptr noalias %c, i64
 ; ZVFBFMIN:       [[VECTOR_PH]]:
 ; ZVFBFMIN-NEXT:    br label %[[VECTOR_BODY:.*]]
 ; ZVFBFMIN:       [[VECTOR_BODY]]:
-; ZVFBFMIN-NEXT:    [[TMP6:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[INDEX_EVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
+; ZVFBFMIN-NEXT:    [[TMP6:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[CURRENT_ITERATION_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; ZVFBFMIN-NEXT:    [[AVL:%.*]] = phi i64 [ [[N]], %[[VECTOR_PH]] ], [ [[AVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; ZVFBFMIN-NEXT:    [[TMP11:%.*]] = call i32 @llvm.experimental.get.vector.length.i64(i64 [[AVL]], i32 4, i1 true)
 ; ZVFBFMIN-NEXT:    [[TMP7:%.*]] = getelementptr bfloat, ptr [[A]], i64 [[TMP6]]
@@ -136,10 +136,10 @@ define void @vfwmaccbf16.vv(ptr noalias %a, ptr noalias %b, ptr noalias %c, i64
 ; ZVFBFMIN-NEXT:    [[TMP15:%.*]] = call <vscale x 4 x float> @llvm.fmuladd.nxv4f32(<vscale x 4 x float> [[TMP13]], <vscale x 4 x float> [[TMP14]], <vscale x 4 x float> [[WIDE_LOAD2]])
 ; ZVFBFMIN-NEXT:    call void @llvm.vp.store.nxv4f32.p0(<vscale x 4 x float> [[TMP15]], ptr align 4 [[TMP9]], <vscale x 4 x i1> splat (i1 true), i32 [[TMP11]])
 ; ZVFBFMIN-NEXT:    [[TMP12:%.*]] = zext i32 [[TMP11]] to i64
-; ZVFBFMIN-NEXT:    [[INDEX_EVL_NEXT]] = add i64 [[TMP12]], [[TMP6]]
+; ZVFBFMIN-NEXT:    [[CURRENT_ITERATION_NEXT]] = add nuw i64 [[TMP12]], [[TMP6]]
 ; ZVFBFMIN-NEXT:    [[AVL_NEXT]] = sub nuw i64 [[AVL]], [[TMP12]]
 ; ZVFBFMIN-NEXT:    [[TMP10:%.*]] = icmp eq i64 [[AVL_NEXT]], 0
-; ZVFBFMIN-NEXT:    br i1 [[TMP10]], label %[[MIDDLE_BLOCK:.*]], label %[[VECTOR_BODY]], !llvm.loop [[LOOP4:![0-9]+]]
+; ZVFBFMIN-NEXT:    br i1 [[TMP10]], label %[[MIDDLE_BLOCK:.*]], label %[[VECTOR_BODY]], !llvm.loop [[LOOP3:![0-9]+]]
 ; ZVFBFMIN:       [[MIDDLE_BLOCK]]:
 ; ZVFBFMIN-NEXT:    br label %[[EXIT:.*]]
 ; ZVFBFMIN:       [[EXIT]]:
diff --git a/llvm/test/Transforms/LoopVectorize/RISCV/blocks-with-dead-instructions.ll b/llvm/test/Transforms/LoopVectorize/RISCV/blocks-with-dead-instructions.ll
index 263c200c28801..69af9999a9bd0 100644
--- a/llvm/test/Transforms/LoopVectorize/RISCV/blocks-with-dead-instructions.ll
+++ b/llvm/test/Transforms/LoopVectorize/RISCV/blocks-with-dead-instructions.ll
@@ -386,7 +386,7 @@ define void @empty_block_with_phi_1(ptr %src, i64 %N) #0 {
 ; CHECK:       [[VECTOR_PH]]:
 ; CHECK-NEXT:    br label %[[VECTOR_BODY:.*]]
 ; CHECK:       [[VECTOR_BODY]]:
-; CHECK-NEXT:    [[TMP9:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[INDEX_EVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
+; CHECK-NEXT:    [[TMP9:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[CURRENT_ITERATION_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; CHECK-NEXT:    [[AVL:%.*]] = phi i64 [ [[N]], %[[VECTOR_PH]] ], [ [[AVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; CHECK-NEXT:    [[TMP13:%.*]] = call i32 @llvm.experimental.get.vector.length.i64(i64 [[AVL]], i32 8, i1 true)
 ; CHECK-NEXT:    [[TMP10:%.*]] = getelementptr i16, ptr [[SRC]], i64 [[TMP9]]
@@ -395,7 +395,7 @@ define void @empty_block_with_phi_1(ptr %src, i64 %N) #0 {
 ; CHECK-NEXT:    [[PREDPHI:%.*]] = select <vscale x 8 x i1> [[TMP2]], <vscale x 8 x i16> splat (i16 99), <vscale x 8 x i16> [[VP_OP_LOAD]]
 ; CHECK-NEXT:    call void @llvm.vp.store.nxv8i16.p0(<vscale x 8 x i16> [[PREDPHI]], ptr align 2 [[TMP10]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP13]])
 ; CHECK-NEXT:    [[TMP11:%.*]] = zext i32 [[TMP13]] to i64
-; CHECK-NEXT:    [[INDEX_EVL_NEXT]] = add i64 [[TMP11]], [[TMP9]]
+; CHECK-NEXT:    [[CURRENT_ITERATION_NEXT]] = add nuw i64 [[TMP11]], [[TMP9]]
 ; CHECK-NEXT:    [[AVL_NEXT]] = sub nuw i64 [[AVL]], [[TMP11]]
 ; CHECK-NEXT:    [[TMP12:%.*]] = icmp eq i64 [[AVL_NEXT]], 0
 ; CHECK-NEXT:    br i1 [[TMP12]], label %[[MIDDLE_BLOCK:.*]], label %[[VECTOR_BODY]], !llvm.loop [[LOOP8:![0-9]+]]
@@ -438,7 +438,7 @@ define void @empty_block_with_phi_2(ptr %src, i64 %N) #0 {
 ; CHECK:       [[VECTOR_PH]]:
 ; CHECK-NEXT:    br label %[[VECTOR_BODY:.*]]
 ; CHECK:       [[VECTOR_BODY]]:
-; CHECK-NEXT:    [[TMP9:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[INDEX_EVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
+; CHECK-NEXT:    [[TMP9:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[CURRENT_ITERATION_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; CHECK-NEXT:    [[AVL:%.*]] = phi i64 [ [[N]], %[[VECTOR_PH]] ], [ [[AVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; CHECK-NEXT:    [[TMP13:%.*]] = call i32 @llvm.experimental.get.vector.length.i64(i64 [[AVL]], i32 8, i1 true)
 ; CHECK-NEXT:    [[TMP10:%.*]] = getelementptr i16, ptr [[SRC]], i64 [[TMP9]]
@@ -447,7 +447,7 @@ define void @empty_block_with_phi_2(ptr %src, i64 %N) #0 {
 ; CHECK-NEXT:    [[PREDPHI:%.*]] = select <vscale x 8 x i1> [[TMP12]], <vscale x 8 x i16> [[WIDE_LOAD]], <vscale x 8 x i16> splat (i16 99)
 ; CHECK-NEXT:    call void @llvm.vp.store.nxv8i16.p0(<vscale x 8 x i16> [[PREDPHI]], ptr align 2 [[TMP10]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP13]])
 ; CHECK-NEXT:    [[TMP11:%.*]] = zext i32 [[TMP13]] to i64
-; CHECK-NEXT:    [[INDEX_EVL_NEXT]] = add i64 [[TMP11]], [[TMP9]]
+; CHECK-NEXT:    [[CURRENT_ITERATION_NEXT]] = add nuw i64 [[TMP11]], [[TMP9]]
 ; CHECK-NEXT:    [[AVL_NEXT]] = sub nuw i64 [[AVL]], [[TMP11]]
 ; CHECK-NEXT:    [[TMP15:%.*]] = icmp eq i64 [[AVL_NEXT]], 0
 ; CHECK-NEXT:    br i1 [[TMP15]], label %[[MIDDLE_BLOCK:.*]], label %[[VECTOR_BODY]], !llvm.loop [[LOOP9:![0-9]+]]
diff --git a/llvm/test/Transforms/LoopVectorize/RISCV/dead-ops-cost.ll b/llvm/test/Transforms/LoopVectorize/RISCV/dead-ops-cost.ll
index bb926ab181cb7..870f02fa9c762 100644
--- a/llvm/test/Transforms/LoopVectorize/RISCV/dead-ops-cost.ll
+++ b/llvm/test/Transforms/LoopVectorize/RISCV/dead-ops-cost.ll
@@ -349,7 +349,7 @@ define void @gather_interleave_group_with_dead_insert_pos(i64 %N, ptr noalias %s
 ; CHECK-NEXT:    [[TMP4:%.*]] = mul nsw <vscale x 4 x i64> [[TMP9]], splat (i64 2)
 ; CHECK-NEXT:    br label %[[VECTOR_BODY:.*]]
 ; CHECK:       [[VECTOR_BODY]]:
-; CHECK-NEXT:    [[EVL_BASED_IV:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[INDEX_EVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
+; CHECK-NEXT:    [[EVL_BASED_IV:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[CURRENT_ITERATION_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; CHECK-NEXT:    [[VEC_IND:%.*]] = phi <vscale x 4 x i64> [ [[TMP4]], %[[VECTOR_PH]] ], [ [[VEC_IND_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; CHECK-NEXT:    [[AVL:%.*]] = phi i64 [ [[TMP2]], %[[VECTOR_PH]] ], [ [[AVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; CHECK-NEXT:    [[TMP10:%.*]] = call i32 @llvm.experimental.get.vector.length.i64(i64 [[AVL]], i32 4, i1 true)
@@ -367,7 +367,7 @@ define void @gather_interleave_group_with_dead_insert_pos(i64 %N, ptr noalias %s
 ; CHECK-NEXT:    [[TMP18:%.*]] = zext <vscale x 4 x i8> [[TMP17]] to <vscale x 4 x i32>
 ; CHECK-NEXT:    [[TMP19:%.*]] = getelementptr i32, ptr [[DST]], <vscale x 4 x i64> [[VEC_IND]]
 ; CHECK-NEXT:    call void @llvm.vp.scatter.nxv4i32.nxv4p0(<vscale x 4 x i32> [[TMP18]], <vscale x 4 x ptr> align 4 [[TMP19]], <vscale x 4 x i1> splat (i1 true), i32 [[TMP10]])
-; CHECK-NEXT:    [[INDEX_EVL_NEXT]] = add i64 [[TMP16]], [[EVL_BASED_IV]]
+; CHECK-NEXT:    [[CURRENT_ITERATION_NEXT]] = add nuw i64 [[TMP16]], [[EVL_BASED_IV]]
 ; CHECK-NEXT:    [[AVL_NEXT]] = sub nuw i64 [[AVL]], [[TMP16]]
 ; CHECK-NEXT:    [[VEC_IND_NEXT]] = add nsw <vscale x 4 x i64> [[VEC_IND]], [[BROADCAST_SPLAT]]
 ; CHECK-NEXT:    [[TMP21:%.*]] = icmp eq i64 [[AVL_NEXT]], 0
diff --git a/llvm/test/Transforms/LoopVectorize/RISCV/evl-compatible-loops.ll b/llvm/test/Transforms/LoopVectorize/RISCV/evl-compatible-loops.ll
index 7953ba30cf9a8..05fe5cacaabec 100644
--- a/llvm/test/Transforms/LoopVectorize/RISCV/evl-compatible-loops.ll
+++ b/llvm/test/Transforms/LoopVectorize/RISCV/evl-compatible-loops.ll
@@ -22,7 +22,7 @@ define void @test_wide_integer_induction(ptr noalias %a, i64 %N) {
 ; CHECK-NEXT:    [[BROADCAST_SPLAT:%.*]] = shufflevector <vscale x 2 x i64> [[BROADCAST_SPLATINSERT]], <vscale x 2 x i64> poison, <vscale x 2 x i32> zeroinitializer
 ; CHECK-NEXT:    [[TMP14:%.*]] = getelementptr inbounds i64, ptr [[A]], i64 [[EVL_BASED_IV]]
 ; CHECK-NEXT:    call void @llvm.vp.store.nxv2i64.p0(<vscale x 2 x i64> [[VEC_IND]], ptr align 8 [[TMP14]], <vscale x 2 x i1> splat (i1 true), i32 [[TMP11]])
-; CHECK-NEXT:    [[INDEX_EVL_NEXT]] = add i64 [[TMP12]], [[EVL_BASED_IV]]
+; CHECK-NEXT:    [[INDEX_EVL_NEXT]] = add nuw i64 [[TMP12]], [[EVL_BASED_IV]]
 ; CHECK-NEXT:    [[AVL_NEXT]] = sub nuw i64 [[AVL]], [[TMP12]]
 ; CHECK-NEXT:    [[VEC_IND_NEXT]] = add <vscale x 2 x i64> [[VEC_IND]], [[BROADCAST_SPLAT]]
 ; CHECK-NEXT:    [[TMP15:%.*]] = icmp eq i64 [[AVL_NEXT]], 0
@@ -66,7 +66,7 @@ define void @test_wide_ptr_induction(ptr noalias %a, ptr noalias %b, i64 %N) {
 ; CHECK-NEXT:    [[TMP8:%.*]] = getelementptr inbounds i64, ptr [[A]], i64 [[EVL_BASED_IV]]
 ; CHECK-NEXT:    call void @llvm.vp.store.nxv2p0.p0(<vscale x 2 x ptr> [[VECTOR_GEP]], ptr align 8 [[TMP8]], <vscale x 2 x i1> splat (i1 true), i32 [[TMP7]])
 ; CHECK-NEXT:    [[TMP9:%.*]] = zext i32 [[TMP7]] to i64
-; CHECK-NEXT:    [[INDEX_EVL_NEXT]] = add i64 [[TMP9]], [[EVL_BASED_IV]]
+; CHECK-NEXT:    [[INDEX_EVL_NEXT]] = add nuw i64 [[TMP9]], [[EVL_BASED_IV]]
 ; CHECK-NEXT:    [[AVL_NEXT]] = sub nuw i64 [[AVL]], [[TMP9]]
 ; CHECK-NEXT:    [[TMP10:%.*]] = shl i64 [[TMP9]], 3
 ; CHECK-NEXT:    [[PTR_IND]] = getelementptr i8, ptr [[POINTER_PHI]], i64 [[TMP10]]
diff --git a/llvm/test/Transforms/LoopVectorize/RISCV/f16.ll b/llvm/test/Transforms/LoopVectorize/RISCV/f16.ll
index 143a51dc811f1..963e634495e38 100644
--- a/llvm/test/Transforms/LoopVectorize/RISCV/f16.ll
+++ b/llvm/test/Transforms/LoopVectorize/RISCV/f16.ll
@@ -30,7 +30,7 @@ define void @fadd(ptr noalias %a, ptr noalias %b, i64 %n) {
 ; ZVFHMIN:       [[VECTOR_PH]]:
 ; ZVFHMIN-NEXT:    br label %[[VECTOR_BODY:.*]]
 ; ZVFHMIN:       [[VECTOR_BODY]]:
-; ZVFHMIN-NEXT:    [[INDEX:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[INDEX_EVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
+; ZVFHMIN-NEXT:    [[INDEX:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[CURRENT_ITERATION_NEXT:%.*]], %[[VECTOR_BODY]] ]
 ; ...
[truncated]

github-actions · 2026-02-25T14:36:14Z

🐧 Linux x64 Test Results

170203 tests passed
3085 tests skipped
3 tests failed

Failed Tests

(click on a test name to see its output)

LLVM

LLVM.Transforms/LoopVectorize/VPlan/conditional-scalar-assignment-vplan.ll

Exit Code: 1

Command Output (stdout):
--
# RUN: at line 2
/home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/opt -passes=loop-vectorize -vplan-print-after="optimize$"  -force-vector-width=4 -disable-output 2>&1 < /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/VPlan/conditional-scalar-assignment-vplan.ll | /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/FileCheck /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/VPlan/conditional-scalar-assignment-vplan.ll
# executed command: /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/opt -passes=loop-vectorize '-vplan-print-after=optimize$' -force-vector-width=4 -disable-output
# note: command had no output on stdout or stderr
# executed command: /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/FileCheck /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/VPlan/conditional-scalar-assignment-vplan.ll
# note: command had no output on stdout or stderr
# RUN: at line 4
/home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/opt -passes=loop-vectorize -vplan-print-after="optimize$"  -force-vector-width=4 -prefer-predicate-over-epilogue=predicate-dont-vectorize  -disable-output 2>&1 < /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/VPlan/conditional-scalar-assignment-vplan.ll | /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/FileCheck /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/VPlan/conditional-scalar-assignment-vplan.ll --check-prefix=CHECK-TF
# executed command: /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/opt -passes=loop-vectorize '-vplan-print-after=optimize$' -force-vector-width=4 -prefer-predicate-over-epilogue=predicate-dont-vectorize -disable-output
# note: command had no output on stdout or stderr
# executed command: /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/FileCheck /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/VPlan/conditional-scalar-assignment-vplan.ll --check-prefix=CHECK-TF
# .---command stderr------------
# | /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/VPlan/conditional-scalar-assignment-vplan.ll:123:18: error: CHECK-TF-NEXT: expected string not found in input
# | ; CHECK-TF-NEXT: EMIT vp<%index.next> = add vp<[[VP4]]>, vp<[[VP1]]>
# |                  ^
# | <stdin>:46:54: note: scanning from here
# |  EMIT vp<%12> = select vp<%10>, vp<%8>, ir<%data.phi>
# |                                                      ^
# | <stdin>:46:54: note: with "VP4" equal to "%4"
# |  EMIT vp<%12> = select vp<%10>, vp<%8>, ir<%data.phi>
# |                                                      ^
# | <stdin>:46:54: note: with "VP1" equal to "%1"
# |  EMIT vp<%12> = select vp<%10>, vp<%8>, ir<%data.phi>
# |                                                      ^
# | <stdin>:47:2: note: possible intended match here
# |  EMIT vp<%index.next> = add nuw vp<%4>, vp<%1>
# |  ^
# | /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/VPlan/conditional-scalar-assignment-vplan.ll:323:18: error: CHECK-TF-NEXT: expected string not found in input
# | ; CHECK-TF-NEXT: EMIT vp<%index.next> = add vp<[[VP4]]>, vp<[[VP1]]>
# |                  ^
# | <stdin>:143:55: note: scanning from here
# |  EMIT vp<%14> = select vp<%12>, vp<%10>, ir<%data.phi>
# |                                                       ^
# | <stdin>:143:55: note: with "VP4" equal to "%4"
# |  EMIT vp<%14> = select vp<%12>, vp<%10>, ir<%data.phi>
# |                                                       ^
# | <stdin>:143:55: note: with "VP1" equal to "%1"
# |  EMIT vp<%14> = select vp<%12>, vp<%10>, ir<%data.phi>
# |                                                       ^
# | <stdin>:144:2: note: possible intended match here
# |  EMIT vp<%index.next> = add nuw vp<%4>, vp<%1>
# |  ^
# | 
# | Input file: <stdin>
# | Check file: /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/VPlan/conditional-scalar-assignment-vplan.ll
# | 
# | -dump-input=help explains the following input dump.
# | 
# | Input was:
# | <<<<<<
# |             .
# |             .
# |             .
# |            41:  loop.0: 
# |            42:  WIDEN ir<%select.cmp> = icmp slt ir<%a>, vp<%8> 
# |            43:  EMIT vp<%9> = logical-and vp<%6>, ir<%select.cmp> 
# |            44:  EMIT vp<%10> = any-of vp<%9> 
# |            45:  EMIT vp<%11> = select vp<%10>, vp<%9>, vp<%5> 
# |            46:  EMIT vp<%12> = select vp<%10>, vp<%8>, ir<%data.phi> 
# | next:123'0                                                          X error: no match found
# | next:123'1                                                            with "VP4" equal to "%4"
# | next:123'2                                                            with "VP1" equal to "%1"
# |            47:  EMIT vp<%index.next> = add nuw vp<%4>, vp<%1> 
# | next:123'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | next:123'3      ?                                              possible intended match
# |            48:  EMIT branch-on-count vp<%index.next>, vp<%2> 
# | next:123'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            49:  No successors 
# | next:123'0     ~~~~~~~~~~~~~~~
# |            50: } 
# | next:123'0     ~~
# |            51: Successor(s): middle.block 
# | next:123'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            52:  
# | next:123'0     ~
# |             .
# |             .
# |             .
# |           138:  
# |           139:  if.then.1: 
# |           140:  EMIT vp<%11> = logical-and vp<%7>, vp<%9> 
# |           141:  EMIT vp<%12> = any-of vp<%11> 
# |           142:  EMIT vp<%13> = select vp<%12>, vp<%11>, vp<%5> 
# |           143:  EMIT vp<%14> = select vp<%12>, vp<%10>, ir<%data.phi> 
# | next:323'0                                                           X error: no match found
# | next:323'1                                                             with "VP4" equal to "%4"
# | next:323'2                                                             with "VP1" equal to "%1"
# |           144:  EMIT vp<%index.next> = add nuw vp<%4>, vp<%1> 
# | next:323'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | next:323'3      ?                                              possible intended match
# |           145:  EMIT branch-on-count vp<%index.next>, vp<%2> 
# | next:323'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           146:  No successors 
# | next:323'0     ~~~~~~~~~~~~~~~
# |           147: } 
# | next:323'0     ~~
# |           148: Successor(s): middle.block 
# | next:323'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           149:  
# | next:323'0     ~
# |             .
# |             .
# |             .
# | >>>>>>
# `-----------------------------
# error: command failed with exit status: 1

--

LLVM.Transforms/LoopVectorize/X86/fold-tail-low-trip-count.ll

Exit Code: 1

Command Output (stdout):
--
# RUN: at line 2
/home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/opt -p loop-vectorize -S /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/X86/fold-tail-low-trip-count.ll | /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/FileCheck /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/X86/fold-tail-low-trip-count.ll
# executed command: /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/opt -p loop-vectorize -S /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/X86/fold-tail-low-trip-count.ll
# note: command had no output on stdout or stderr
# executed command: /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/FileCheck /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/X86/fold-tail-low-trip-count.ll
# .---command stderr------------
# | /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/X86/fold-tail-low-trip-count.ll:72:15: error: CHECK-NEXT: expected string not found in input
# | ; CHECK-NEXT: [[INDEX_NEXT]] = add i64 [[INDEX]], 64
# |               ^
# | <stdin>:54:105: note: scanning from here
# |  call void @llvm.masked.store.v64i8.p0(<64 x i8> %wide.masked.load, ptr align 1 %next.gep, <64 x i1> %3)
# |                                                                                                         ^
# | <stdin>:54:105: note: with "INDEX_NEXT" equal to "%index.next"
# |  call void @llvm.masked.store.v64i8.p0(<64 x i8> %wide.masked.load, ptr align 1 %next.gep, <64 x i1> %3)
# |                                                                                                         ^
# | <stdin>:54:105: note: with "INDEX" equal to "%index"
# |  call void @llvm.masked.store.v64i8.p0(<64 x i8> %wide.masked.load, ptr align 1 %next.gep, <64 x i1> %3)
# |                                                                                                         ^
# | <stdin>:55:12: note: possible intended match here
# |  %index.next = add nuw i64 %index, 64
# |            ^
# | /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/X86/fold-tail-low-trip-count.ll:128:91: error: undefined variable: PROF1
# | ; CHECK-NEXT: br i1 [[TMP5]], label %[[MIDDLE_BLOCK:.*]], label %[[VECTOR_BODY]], !prof [[PROF1]], !llvm.loop [[LOOP6:![0-9]+]]
# |                                                                                           ^
# | <stdin>:95:38: note: with "TMP5" equal to "%5"
# |  %5 = icmp eq i64 %index.next, %n.vec
# |                                      ^
# | <stdin>:95:38: note: with "VECTOR_BODY" equal to "vector.body"
# |  %5 = icmp eq i64 %index.next, %n.vec
# |                                      ^
# | <stdin>:96:26: note: possible intended match here
# |  br i1 %5, label %middle.block, label %vector.body, !prof !1, !llvm.loop !6
# |                          ^
# | 
# | Input file: <stdin>
# | Check file: /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/X86/fold-tail-low-trip-count.ll
# | 
# | -dump-input=help explains the following input dump.
# | 
# | Input was:
# | <<<<<<
# |             .
# |             .
# |             .
# |            49:  %vec.iv = add <64 x i64> %broadcast.splat4, <i64 0, i64 1, i64 2, i64 3, i64 4, i64 5, i64 6, i64 7, i64 8, i64 9, i64 10, i64 11, i64 12, i64 13, i64 14, i64 15, i64 16, i64 17, i64 18, i64 19, i64 20, i64 21, i64 22, i64 23, i64 24, i64 25, i64 26, i64 27, i64 28, i64 29, i64 30, i64 31, i64 32, i64 33, i64 34, i64 35, i64 36, i64 37, i64 38, i64 39, i64 40, i64 41, i64 42, i64 43, i64 44, i64 45, i64 46, i64 47, i64 48, i64 49, i64 50, i64 51, i64 52, i64 53, i64 54, i64 55, i64 56, i64 57, i64 58, i64 59, i64 60, i64 61, i64 62, i64 63> 
# |            50:  %3 = icmp ule <64 x i64> %vec.iv, %broadcast.splat 
# |            51:  %4 = add i64 %index, 1 
# |            52:  %5 = getelementptr i8, ptr %src, i64 %4 
# |            53:  %wide.masked.load = call <64 x i8> @llvm.masked.load.v64i8.p0(ptr align 1 %5, <64 x i1> %3, <64 x i8> poison) 
# |            54:  call void @llvm.masked.store.v64i8.p0(<64 x i8> %wide.masked.load, ptr align 1 %next.gep, <64 x i1> %3) 
# | next:72'0                                                                                                              X error: no match found
# | next:72'1                                                                                                                with "INDEX_NEXT" equal to "%index.next"
# | next:72'2                                                                                                                with "INDEX" equal to "%index"
# |            55:  %index.next = add nuw i64 %index, 64 
# | next:72'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | next:72'3                 ?                           possible intended match
# |            56:  %6 = icmp eq i64 %index.next, %n.vec 
# | next:72'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            57:  br i1 %6, label %middle.block, label %vector.body, !prof !1, !llvm.loop !2 
# | next:72'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            58:  
# | next:72'0      ~
# |            59: middle.block: ; preds = %vector.body 
# | next:72'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            60:  br label %exit 
# | next:72'0      ~~~~~~~~~~~~~~~~
# |             .
# |             .
# |             .
# |            90:  %3 = add i64 %index, 1 
# |            91:  %4 = getelementptr i8, ptr %src, i64 %3 
# |            92:  %wide.masked.load = call <64 x i8> @llvm.masked.load.v64i8.p0(ptr align 1 %4, <64 x i1> %2, <64 x i8> poison) 
# |            93:  call void @llvm.masked.store.v64i8.p0(<64 x i8> %wide.masked.load, ptr align 1 %next.gep, <64 x i1> %2) 
# |            94:  %index.next = add nuw i64 %index, 64 
# |            95:  %5 = icmp eq i64 %index.next, %n.vec 
# | next:128'0                                          X error: match failed for invalid pattern
# | next:128'1                                            undefined variable: PROF1
# | next:128'2                                            with "TMP5" equal to "%5"
# | next:128'3                                            with "VECTOR_BODY" equal to "vector.body"
# |            96:  br i1 %5, label %middle.block, label %vector.body, !prof !1, !llvm.loop !6 
# | next:128'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | next:128'4                              ?                                                   possible intended match
# |            97:  
# | next:128'0     ~
# |            98: middle.block: ; preds = %vector.body 
# | next:128'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            99:  br label %exit 
# | next:128'0     ~~~~~~~~~~~~~~~~
# |           100:  
# | next:128'0     ~
# |           101: exit: ; preds = %middle.block 
# | next:128'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |             .
# |             .
# |             .
# | >>>>>>
# `-----------------------------
# error: command failed with exit status: 1

--

LLVM.Transforms/LoopVectorize/iv-select-cmp-fold-tail.ll

Exit Code: 1

Command Output (stdout):
--
# RUN: at line 2
/home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/opt -p loop-vectorize -prefer-predicate-over-epilogue=predicate-dont-vectorize -force-vector-width=4 -S /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/iv-select-cmp-fold-tail.ll | /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/FileCheck /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/iv-select-cmp-fold-tail.ll
# executed command: /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/opt -p loop-vectorize -prefer-predicate-over-epilogue=predicate-dont-vectorize -force-vector-width=4 -S /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/iv-select-cmp-fold-tail.ll
# note: command had no output on stdout or stderr
# executed command: /home/gha/actions-runner/_work/llvm-project/llvm-project/build/bin/FileCheck /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/iv-select-cmp-fold-tail.ll
# .---command stderr------------
# | /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/iv-select-cmp-fold-tail.ll:71:15: error: CHECK-NEXT: expected string not found in input
# | ; CHECK-NEXT: [[INDEX_NEXT]] = add i64 [[INDEX]], 4
# |               ^
# | <stdin>:78:62: note: scanning from here
# |  %32 = select i1 %30, <4 x i32> %vec.ind1, <4 x i32> %vec.phi
# |                                                              ^
# | <stdin>:78:62: note: with "INDEX_NEXT" equal to "%index.next"
# |  %32 = select i1 %30, <4 x i32> %vec.ind1, <4 x i32> %vec.phi
# |                                                              ^
# | <stdin>:78:62: note: with "INDEX" equal to "%index"
# |  %32 = select i1 %30, <4 x i32> %vec.ind1, <4 x i32> %vec.phi
# |                                                              ^
# | <stdin>:80:14: note: possible intended match here
# |  %vec.ind.next = add <4 x i64> %vec.ind, splat (i64 4)
# |              ^
# | 
# | Input file: <stdin>
# | Check file: /home/gha/actions-runner/_work/llvm-project/llvm-project/llvm/test/Transforms/LoopVectorize/iv-select-cmp-fold-tail.ll
# | 
# | -dump-input=help explains the following input dump.
# | 
# | Input was:
# | <<<<<<
# |            .
# |            .
# |            .
# |           73:  %27 = icmp eq <4 x i32> %26, zeroinitializer 
# |           74:  %28 = select <4 x i1> %2, <4 x i1> %27, <4 x i1> zeroinitializer 
# |           75:  %29 = freeze <4 x i1> %28 
# |           76:  %30 = call i1 @llvm.vector.reduce.or.v4i1(<4 x i1> %29) 
# |           77:  %31 = select i1 %30, <4 x i1> %28, <4 x i1> %1 
# |           78:  %32 = select i1 %30, <4 x i32> %vec.ind1, <4 x i32> %vec.phi 
# | next:71'0                                                                  X error: no match found
# | next:71'1                                                                    with "INDEX_NEXT" equal to "%index.next"
# | next:71'2                                                                    with "INDEX" equal to "%index"
# |           79:  %index.next = add nuw i64 %index, 4 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           80:  %vec.ind.next = add <4 x i64> %vec.ind, splat (i64 4) 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | next:71'3                  ?                                          possible intended match
# |           81:  %vec.ind.next8 = add <4 x i32> %vec.ind1, splat (i32 4) 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           82:  %33 = icmp eq i64 %index.next, %n.vec 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           83:  br i1 %33, label %middle.block, label %vector.body, !llvm.loop !0 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           84:  
# | next:71'0     ~
# |           85: middle.block: ; preds = %pred.load.continue7 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            .
# |            .
# |            .
# | >>>>>>
# `-----------------------------
# error: command failed with exit status: 1

--

If these failures are unrelated to your changes (for example tests are broken or flaky at HEAD), please open an issue at https://github.com/llvm/llvm-project/issues and add the infrastructure label.

github-actions · 2026-02-25T14:36:14Z

🪟 Windows x64 Test Results

131411 tests passed
2950 tests skipped
3 tests failed

Failed Tests

(click on a test name to see its output)

LLVM

LLVM.Transforms/LoopVectorize/VPlan/conditional-scalar-assignment-vplan.ll

Exit Code: 1

Command Output (stdout):
--
# RUN: at line 2
c:\_work\llvm-project\llvm-project\build\bin\opt.exe -passes=loop-vectorize -vplan-print-after="optimize$"  -force-vector-width=4 -disable-output 2>&1 < C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\VPlan\conditional-scalar-assignment-vplan.ll | c:\_work\llvm-project\llvm-project\build\bin\filecheck.exe C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\VPlan\conditional-scalar-assignment-vplan.ll
# executed command: 'c:\_work\llvm-project\llvm-project\build\bin\opt.exe' -passes=loop-vectorize '-vplan-print-after=optimize$' -force-vector-width=4 -disable-output
# note: command had no output on stdout or stderr
# executed command: 'c:\_work\llvm-project\llvm-project\build\bin\filecheck.exe' 'C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\VPlan\conditional-scalar-assignment-vplan.ll'
# note: command had no output on stdout or stderr
# RUN: at line 4
c:\_work\llvm-project\llvm-project\build\bin\opt.exe -passes=loop-vectorize -vplan-print-after="optimize$"  -force-vector-width=4 -prefer-predicate-over-epilogue=predicate-dont-vectorize  -disable-output 2>&1 < C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\VPlan\conditional-scalar-assignment-vplan.ll | c:\_work\llvm-project\llvm-project\build\bin\filecheck.exe C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\VPlan\conditional-scalar-assignment-vplan.ll --check-prefix=CHECK-TF
# executed command: 'c:\_work\llvm-project\llvm-project\build\bin\opt.exe' -passes=loop-vectorize '-vplan-print-after=optimize$' -force-vector-width=4 -prefer-predicate-over-epilogue=predicate-dont-vectorize -disable-output
# note: command had no output on stdout or stderr
# executed command: 'c:\_work\llvm-project\llvm-project\build\bin\filecheck.exe' 'C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\VPlan\conditional-scalar-assignment-vplan.ll' --check-prefix=CHECK-TF
# .---command stderr------------
# | C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\VPlan\conditional-scalar-assignment-vplan.ll:123:18: error: CHECK-TF-NEXT: expected string not found in input
# | ; CHECK-TF-NEXT: EMIT vp<%index.next> = add vp<[[VP4]]>, vp<[[VP1]]>
# |                  ^
# | <stdin>:46:54: note: scanning from here
# |  EMIT vp<%12> = select vp<%10>, vp<%8>, ir<%data.phi>
# |                                                      ^
# | <stdin>:46:54: note: with "VP4" equal to "%4"
# |  EMIT vp<%12> = select vp<%10>, vp<%8>, ir<%data.phi>
# |                                                      ^
# | <stdin>:46:54: note: with "VP1" equal to "%1"
# |  EMIT vp<%12> = select vp<%10>, vp<%8>, ir<%data.phi>
# |                                                      ^
# | <stdin>:47:2: note: possible intended match here
# |  EMIT vp<%index.next> = add nuw vp<%4>, vp<%1>
# |  ^
# | C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\VPlan\conditional-scalar-assignment-vplan.ll:323:18: error: CHECK-TF-NEXT: expected string not found in input
# | ; CHECK-TF-NEXT: EMIT vp<%index.next> = add vp<[[VP4]]>, vp<[[VP1]]>
# |                  ^
# | <stdin>:143:55: note: scanning from here
# |  EMIT vp<%14> = select vp<%12>, vp<%10>, ir<%data.phi>
# |                                                       ^
# | <stdin>:143:55: note: with "VP4" equal to "%4"
# |  EMIT vp<%14> = select vp<%12>, vp<%10>, ir<%data.phi>
# |                                                       ^
# | <stdin>:143:55: note: with "VP1" equal to "%1"
# |  EMIT vp<%14> = select vp<%12>, vp<%10>, ir<%data.phi>
# |                                                       ^
# | <stdin>:144:2: note: possible intended match here
# |  EMIT vp<%index.next> = add nuw vp<%4>, vp<%1>
# |  ^
# | 
# | Input file: <stdin>
# | Check file: C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\VPlan\conditional-scalar-assignment-vplan.ll
# | 
# | -dump-input=help explains the following input dump.
# | 
# | Input was:
# | <<<<<<
# |             .
# |             .
# |             .
# |            41:  loop.0: 
# |            42:  WIDEN ir<%select.cmp> = icmp slt ir<%a>, vp<%8> 
# |            43:  EMIT vp<%9> = logical-and vp<%6>, ir<%select.cmp> 
# |            44:  EMIT vp<%10> = any-of vp<%9> 
# |            45:  EMIT vp<%11> = select vp<%10>, vp<%9>, vp<%5> 
# |            46:  EMIT vp<%12> = select vp<%10>, vp<%8>, ir<%data.phi> 
# | next:123'0                                                          X error: no match found
# | next:123'1                                                            with "VP4" equal to "%4"
# | next:123'2                                                            with "VP1" equal to "%1"
# |            47:  EMIT vp<%index.next> = add nuw vp<%4>, vp<%1> 
# | next:123'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | next:123'3      ?                                              possible intended match
# |            48:  EMIT branch-on-count vp<%index.next>, vp<%2> 
# | next:123'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            49:  No successors 
# | next:123'0     ~~~~~~~~~~~~~~~
# |            50: } 
# | next:123'0     ~~
# |            51: Successor(s): middle.block 
# | next:123'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            52:  
# | next:123'0     ~
# |             .
# |             .
# |             .
# |           138:  
# |           139:  if.then.1: 
# |           140:  EMIT vp<%11> = logical-and vp<%7>, vp<%9> 
# |           141:  EMIT vp<%12> = any-of vp<%11> 
# |           142:  EMIT vp<%13> = select vp<%12>, vp<%11>, vp<%5> 
# |           143:  EMIT vp<%14> = select vp<%12>, vp<%10>, ir<%data.phi> 
# | next:323'0                                                           X error: no match found
# | next:323'1                                                             with "VP4" equal to "%4"
# | next:323'2                                                             with "VP1" equal to "%1"
# |           144:  EMIT vp<%index.next> = add nuw vp<%4>, vp<%1> 
# | next:323'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | next:323'3      ?                                              possible intended match
# |           145:  EMIT branch-on-count vp<%index.next>, vp<%2> 
# | next:323'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           146:  No successors 
# | next:323'0     ~~~~~~~~~~~~~~~
# |           147: } 
# | next:323'0     ~~
# |           148: Successor(s): middle.block 
# | next:323'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           149:  
# | next:323'0     ~
# |             .
# |             .
# |             .
# | >>>>>>
# `-----------------------------
# error: command failed with exit status: 1

--

LLVM.Transforms/LoopVectorize/X86/fold-tail-low-trip-count.ll

Exit Code: 1

Command Output (stdout):
--
# RUN: at line 2
c:\_work\llvm-project\llvm-project\build\bin\opt.exe -p loop-vectorize -S C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\X86\fold-tail-low-trip-count.ll | c:\_work\llvm-project\llvm-project\build\bin\filecheck.exe C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\X86\fold-tail-low-trip-count.ll
# executed command: 'c:\_work\llvm-project\llvm-project\build\bin\opt.exe' -p loop-vectorize -S 'C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\X86\fold-tail-low-trip-count.ll'
# note: command had no output on stdout or stderr
# executed command: 'c:\_work\llvm-project\llvm-project\build\bin\filecheck.exe' 'C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\X86\fold-tail-low-trip-count.ll'
# .---command stderr------------
# | C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\X86\fold-tail-low-trip-count.ll:72:15: error: CHECK-NEXT: expected string not found in input
# | ; CHECK-NEXT: [[INDEX_NEXT]] = add i64 [[INDEX]], 64
# |               ^
# | <stdin>:54:105: note: scanning from here
# |  call void @llvm.masked.store.v64i8.p0(<64 x i8> %wide.masked.load, ptr align 1 %next.gep, <64 x i1> %3)
# |                                                                                                         ^
# | <stdin>:54:105: note: with "INDEX_NEXT" equal to "%index.next"
# |  call void @llvm.masked.store.v64i8.p0(<64 x i8> %wide.masked.load, ptr align 1 %next.gep, <64 x i1> %3)
# |                                                                                                         ^
# | <stdin>:54:105: note: with "INDEX" equal to "%index"
# |  call void @llvm.masked.store.v64i8.p0(<64 x i8> %wide.masked.load, ptr align 1 %next.gep, <64 x i1> %3)
# |                                                                                                         ^
# | <stdin>:55:12: note: possible intended match here
# |  %index.next = add nuw i64 %index, 64
# |            ^
# | C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\X86\fold-tail-low-trip-count.ll:128:91: error: undefined variable: PROF1
# | ; CHECK-NEXT: br i1 [[TMP5]], label %[[MIDDLE_BLOCK:.*]], label %[[VECTOR_BODY]], !prof [[PROF1]], !llvm.loop [[LOOP6:![0-9]+]]
# |                                                                                           ^
# | <stdin>:95:38: note: with "TMP5" equal to "%5"
# |  %5 = icmp eq i64 %index.next, %n.vec
# |                                      ^
# | <stdin>:95:38: note: with "VECTOR_BODY" equal to "vector.body"
# |  %5 = icmp eq i64 %index.next, %n.vec
# |                                      ^
# | <stdin>:96:26: note: possible intended match here
# |  br i1 %5, label %middle.block, label %vector.body, !prof !1, !llvm.loop !6
# |                          ^
# | 
# | Input file: <stdin>
# | Check file: C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\X86\fold-tail-low-trip-count.ll
# | 
# | -dump-input=help explains the following input dump.
# | 
# | Input was:
# | <<<<<<
# |             .
# |             .
# |             .
# |            49:  %vec.iv = add <64 x i64> %broadcast.splat4, <i64 0, i64 1, i64 2, i64 3, i64 4, i64 5, i64 6, i64 7, i64 8, i64 9, i64 10, i64 11, i64 12, i64 13, i64 14, i64 15, i64 16, i64 17, i64 18, i64 19, i64 20, i64 21, i64 22, i64 23, i64 24, i64 25, i64 26, i64 27, i64 28, i64 29, i64 30, i64 31, i64 32, i64 33, i64 34, i64 35, i64 36, i64 37, i64 38, i64 39, i64 40, i64 41, i64 42, i64 43, i64 44, i64 45, i64 46, i64 47, i64 48, i64 49, i64 50, i64 51, i64 52, i64 53, i64 54, i64 55, i64 56, i64 57, i64 58, i64 59, i64 60, i64 61, i64 62, i64 63> 
# |            50:  %3 = icmp ule <64 x i64> %vec.iv, %broadcast.splat 
# |            51:  %4 = add i64 %index, 1 
# |            52:  %5 = getelementptr i8, ptr %src, i64 %4 
# |            53:  %wide.masked.load = call <64 x i8> @llvm.masked.load.v64i8.p0(ptr align 1 %5, <64 x i1> %3, <64 x i8> poison) 
# |            54:  call void @llvm.masked.store.v64i8.p0(<64 x i8> %wide.masked.load, ptr align 1 %next.gep, <64 x i1> %3) 
# | next:72'0                                                                                                              X error: no match found
# | next:72'1                                                                                                                with "INDEX_NEXT" equal to "%index.next"
# | next:72'2                                                                                                                with "INDEX" equal to "%index"
# |            55:  %index.next = add nuw i64 %index, 64 
# | next:72'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | next:72'3                 ?                           possible intended match
# |            56:  %6 = icmp eq i64 %index.next, %n.vec 
# | next:72'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            57:  br i1 %6, label %middle.block, label %vector.body, !prof !1, !llvm.loop !2 
# | next:72'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            58:  
# | next:72'0      ~
# |            59: middle.block: ; preds = %vector.body 
# | next:72'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            60:  br label %exit 
# | next:72'0      ~~~~~~~~~~~~~~~~
# |             .
# |             .
# |             .
# |            90:  %3 = add i64 %index, 1 
# |            91:  %4 = getelementptr i8, ptr %src, i64 %3 
# |            92:  %wide.masked.load = call <64 x i8> @llvm.masked.load.v64i8.p0(ptr align 1 %4, <64 x i1> %2, <64 x i8> poison) 
# |            93:  call void @llvm.masked.store.v64i8.p0(<64 x i8> %wide.masked.load, ptr align 1 %next.gep, <64 x i1> %2) 
# |            94:  %index.next = add nuw i64 %index, 64 
# |            95:  %5 = icmp eq i64 %index.next, %n.vec 
# | next:128'0                                          X error: match failed for invalid pattern
# | next:128'1                                            undefined variable: PROF1
# | next:128'2                                            with "TMP5" equal to "%5"
# | next:128'3                                            with "VECTOR_BODY" equal to "vector.body"
# |            96:  br i1 %5, label %middle.block, label %vector.body, !prof !1, !llvm.loop !6 
# | next:128'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | next:128'4                              ?                                                   possible intended match
# |            97:  
# | next:128'0     ~
# |            98: middle.block: ; preds = %vector.body 
# | next:128'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            99:  br label %exit 
# | next:128'0     ~~~~~~~~~~~~~~~~
# |           100:  
# | next:128'0     ~
# |           101: exit: ; preds = %middle.block 
# | next:128'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |             .
# |             .
# |             .
# | >>>>>>
# `-----------------------------
# error: command failed with exit status: 1

--

LLVM.Transforms/LoopVectorize/iv-select-cmp-fold-tail.ll

Exit Code: 1

Command Output (stdout):
--
# RUN: at line 2
c:\_work\llvm-project\llvm-project\build\bin\opt.exe -p loop-vectorize -prefer-predicate-over-epilogue=predicate-dont-vectorize -force-vector-width=4 -S C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\iv-select-cmp-fold-tail.ll | c:\_work\llvm-project\llvm-project\build\bin\filecheck.exe C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\iv-select-cmp-fold-tail.ll
# executed command: 'c:\_work\llvm-project\llvm-project\build\bin\opt.exe' -p loop-vectorize -prefer-predicate-over-epilogue=predicate-dont-vectorize -force-vector-width=4 -S 'C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\iv-select-cmp-fold-tail.ll'
# note: command had no output on stdout or stderr
# executed command: 'c:\_work\llvm-project\llvm-project\build\bin\filecheck.exe' 'C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\iv-select-cmp-fold-tail.ll'
# .---command stderr------------
# | C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\iv-select-cmp-fold-tail.ll:71:15: error: CHECK-NEXT: expected string not found in input
# | ; CHECK-NEXT: [[INDEX_NEXT]] = add i64 [[INDEX]], 4
# |               ^
# | <stdin>:78:62: note: scanning from here
# |  %32 = select i1 %30, <4 x i32> %vec.ind1, <4 x i32> %vec.phi
# |                                                              ^
# | <stdin>:78:62: note: with "INDEX_NEXT" equal to "%index.next"
# |  %32 = select i1 %30, <4 x i32> %vec.ind1, <4 x i32> %vec.phi
# |                                                              ^
# | <stdin>:78:62: note: with "INDEX" equal to "%index"
# |  %32 = select i1 %30, <4 x i32> %vec.ind1, <4 x i32> %vec.phi
# |                                                              ^
# | <stdin>:80:14: note: possible intended match here
# |  %vec.ind.next = add <4 x i64> %vec.ind, splat (i64 4)
# |              ^
# | 
# | Input file: <stdin>
# | Check file: C:\_work\llvm-project\llvm-project\llvm\test\Transforms\LoopVectorize\iv-select-cmp-fold-tail.ll
# | 
# | -dump-input=help explains the following input dump.
# | 
# | Input was:
# | <<<<<<
# |            .
# |            .
# |            .
# |           73:  %27 = icmp eq <4 x i32> %26, zeroinitializer 
# |           74:  %28 = select <4 x i1> %2, <4 x i1> %27, <4 x i1> zeroinitializer 
# |           75:  %29 = freeze <4 x i1> %28 
# |           76:  %30 = call i1 @llvm.vector.reduce.or.v4i1(<4 x i1> %29) 
# |           77:  %31 = select i1 %30, <4 x i1> %28, <4 x i1> %1 
# |           78:  %32 = select i1 %30, <4 x i32> %vec.ind1, <4 x i32> %vec.phi 
# | next:71'0                                                                  X error: no match found
# | next:71'1                                                                    with "INDEX_NEXT" equal to "%index.next"
# | next:71'2                                                                    with "INDEX" equal to "%index"
# |           79:  %index.next = add nuw i64 %index, 4 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           80:  %vec.ind.next = add <4 x i64> %vec.ind, splat (i64 4) 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# | next:71'3                  ?                                          possible intended match
# |           81:  %vec.ind.next8 = add <4 x i32> %vec.ind1, splat (i32 4) 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           82:  %33 = icmp eq i64 %index.next, %n.vec 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           83:  br i1 %33, label %middle.block, label %vector.body, !llvm.loop !0 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |           84:  
# | next:71'0     ~
# |           85: middle.block: ; preds = %pred.load.continue7 
# | next:71'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# |            .
# |            .
# |            .
# | >>>>>>
# `-----------------------------
# error: command failed with exit status: 1

--

If these failures are unrelated to your changes (for example tests are broken or flaky at HEAD), please open an issue at https://github.com/llvm/llvm-project/issues and add the infrastructure label.

…ze/dont-drop-can-iv-tail-fold

fhahn · 2026-02-25T15:03:03Z

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

-  // the vector loop or when not folding the tail. In the later case, we know
-  // that the canonical induction increment will not overflow as the vector trip
-  // count is >= increment and a multiple of the increment.
  VPRegionBlock *LoopRegion = Plan->getVectorLoopRegion();


can sink this now closer to first use

Done in 853c211

fhahn · 2026-02-25T15:03:49Z

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

@@ -8201,22 +8201,7 @@ VPlanPtr LoopVectorizationPlanner::tryToBuildVPlanWithVPRecipes(
    IVUpdateMayOverflow |= !isIndvarOverflowCheckKnownFalse(&CM, VF);

  TailFoldingStyle Style = CM.getTailFoldingStyle(IVUpdateMayOverflow);


can also sink this and IVUpdateMayOverflow closer to use

Done in 853c211

fhahn

LGTM, thanks!

…ze/dont-drop-can-iv-tail-fold

llvm-ci · 2026-02-27T08:22:52Z

LLVM Buildbot has detected a new failure on builder sanitizer-x86_64-linux-fast running on sanitizer-buildbot4 while building llvm at step 2 "annotate".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/169/builds/20391

Here is the relevant piece of the build log for the reference

Step 2 (annotate) failure: 'python ../sanitizer_buildbot/sanitizers/zorg/buildbot/builders/sanitizers/buildbot_selector.py' (failure)
...
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using lld-link: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/lld-link
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using ld64.lld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/ld64.lld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using wasm-ld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/wasm-ld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using ld.lld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/ld.lld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using lld-link: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/lld-link
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using ld64.lld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/ld64.lld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using wasm-ld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/wasm-ld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/main.py:74: note: The test suite configuration requested an individual test timeout of 0 seconds but a timeout of 900 seconds was requested on the command line. Forcing timeout to be 900 seconds.
-- Testing: 96703 tests, 64 workers --
Testing:  0.. 10.. 20.. 30.. 40.. 50.. 60.. 70.. 80.. 90.
FAIL: LLVM :: ExecutionEngine/JITLink/x86-64/COFF_directive_include.s (75007 of 96703)
******************** TEST 'LLVM :: ExecutionEngine/JITLink/x86-64/COFF_directive_include.s' FAILED ********************
Exit Code: 1

Command Output (stdout):
--
# RUN: at line 1
/home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/llvm-mc -filetype=obj -triple=x86_64-windows-msvc /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/ExecutionEngine/JITLink/x86-64/COFF_directive_include.s -o /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/test/ExecutionEngine/JITLink/x86-64/Output/COFF_directive_include.s.tmp
# executed command: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/llvm-mc -filetype=obj -triple=x86_64-windows-msvc /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/ExecutionEngine/JITLink/x86-64/COFF_directive_include.s -o /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/test/ExecutionEngine/JITLink/x86-64/Output/COFF_directive_include.s.tmp
# note: command had no output on stdout or stderr
# RUN: at line 2
not /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/llvm-jitlink -noexec /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/test/ExecutionEngine/JITLink/x86-64/Output/COFF_directive_include.s.tmp 2>&1 | /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/FileCheck /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/ExecutionEngine/JITLink/x86-64/COFF_directive_include.s
# executed command: not /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/llvm-jitlink -noexec /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/test/ExecutionEngine/JITLink/x86-64/Output/COFF_directive_include.s.tmp
# note: command had no output on stdout or stderr
# error: command failed with exit status: 1
# executed command: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/FileCheck /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/ExecutionEngine/JITLink/x86-64/COFF_directive_include.s
# note: command had no output on stdout or stderr

--

********************
Testing:  0.. 10.. 20.. 30.. 40.. 50.. 60.. 70.. 80.. 90..
Slowest Tests:
--------------------------------------------------------------------------
386.65s: LLVM :: CodeGen/AMDGPU/sched-group-barrier-pipeline-solver.mir
205.77s: Clang :: Preprocessor/riscv-target-features.c
200.88s: LLVM :: CodeGen/AMDGPU/amdgcn.bitcast.1024bit.ll
180.59s: Clang :: OpenMP/target_update_codegen.cpp
172.97s: Clang :: OpenMP/target_defaultmap_codegen_01.cpp
170.44s: Clang :: Driver/arm-cortex-cpus-1.c
168.41s: Clang :: Driver/arm-cortex-cpus-2.c
165.02s: Clang :: Preprocessor/aarch64-target-features.c
160.00s: Clang :: Preprocessor/arm-target-features.c
140.83s: Clang :: ClangScanDeps/build-session-validation-outdated-module.c
138.75s: Clang :: Preprocessor/predefined-arch-macros.c
137.47s: Clang :: Analysis/a_flaky_crash.cpp
124.27s: LLVM :: CodeGen/RISCV/attributes.ll
119.35s: LLVM :: CodeGen/AMDGPU/memintrinsic-unroll.ll
118.84s: Clang :: Driver/fsanitize.c
Step 10 (stage2/asan_ubsan check) failure: stage2/asan_ubsan check (failure)
...
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using lld-link: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/lld-link
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using ld64.lld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/ld64.lld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using wasm-ld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/wasm-ld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using ld.lld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/ld.lld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using lld-link: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/lld-link
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using ld64.lld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/ld64.lld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using wasm-ld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/wasm-ld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/main.py:74: note: The test suite configuration requested an individual test timeout of 0 seconds but a timeout of 900 seconds was requested on the command line. Forcing timeout to be 900 seconds.
-- Testing: 96703 tests, 64 workers --
Testing:  0.. 10.. 20.. 30.. 40.. 50.. 60.. 70.. 80.. 90.
FAIL: LLVM :: ExecutionEngine/JITLink/x86-64/COFF_directive_include.s (75007 of 96703)
******************** TEST 'LLVM :: ExecutionEngine/JITLink/x86-64/COFF_directive_include.s' FAILED ********************
Exit Code: 1

Command Output (stdout):
--
# RUN: at line 1
/home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/llvm-mc -filetype=obj -triple=x86_64-windows-msvc /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/ExecutionEngine/JITLink/x86-64/COFF_directive_include.s -o /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/test/ExecutionEngine/JITLink/x86-64/Output/COFF_directive_include.s.tmp
# executed command: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/llvm-mc -filetype=obj -triple=x86_64-windows-msvc /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/ExecutionEngine/JITLink/x86-64/COFF_directive_include.s -o /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/test/ExecutionEngine/JITLink/x86-64/Output/COFF_directive_include.s.tmp
# note: command had no output on stdout or stderr
# RUN: at line 2
not /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/llvm-jitlink -noexec /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/test/ExecutionEngine/JITLink/x86-64/Output/COFF_directive_include.s.tmp 2>&1 | /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/FileCheck /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/ExecutionEngine/JITLink/x86-64/COFF_directive_include.s
# executed command: not /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/llvm-jitlink -noexec /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/test/ExecutionEngine/JITLink/x86-64/Output/COFF_directive_include.s.tmp
# note: command had no output on stdout or stderr
# error: command failed with exit status: 1
# executed command: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan_ubsan/bin/FileCheck /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/ExecutionEngine/JITLink/x86-64/COFF_directive_include.s
# note: command had no output on stdout or stderr

--

********************
Testing:  0.. 10.. 20.. 30.. 40.. 50.. 60.. 70.. 80.. 90..
Slowest Tests:
--------------------------------------------------------------------------
386.65s: LLVM :: CodeGen/AMDGPU/sched-group-barrier-pipeline-solver.mir
205.77s: Clang :: Preprocessor/riscv-target-features.c
200.88s: LLVM :: CodeGen/AMDGPU/amdgcn.bitcast.1024bit.ll
180.59s: Clang :: OpenMP/target_update_codegen.cpp
172.97s: Clang :: OpenMP/target_defaultmap_codegen_01.cpp
170.44s: Clang :: Driver/arm-cortex-cpus-1.c
168.41s: Clang :: Driver/arm-cortex-cpus-2.c
165.02s: Clang :: Preprocessor/aarch64-target-features.c
160.00s: Clang :: Preprocessor/arm-target-features.c
140.83s: Clang :: ClangScanDeps/build-session-validation-outdated-module.c
138.75s: Clang :: Preprocessor/predefined-arch-macros.c
137.47s: Clang :: Analysis/a_flaky_crash.cpp
124.27s: LLVM :: CodeGen/RISCV/attributes.ll
119.35s: LLVM :: CodeGen/AMDGPU/memintrinsic-unroll.ll
118.84s: Clang :: Driver/fsanitize.c
Step 14 (stage2/msan check) failure: stage2/msan check (failure)
...
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using lld-link: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/lld-link
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using ld64.lld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/ld64.lld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using wasm-ld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/wasm-ld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using ld.lld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/ld.lld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using lld-link: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/lld-link
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using ld64.lld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/ld64.lld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/llvm/config.py:561: note: using wasm-ld: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/wasm-ld
llvm-lit: /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/utils/lit/lit/main.py:74: note: The test suite configuration requested an individual test timeout of 0 seconds but a timeout of 900 seconds was requested on the command line. Forcing timeout to be 900 seconds.
-- Testing: 96701 tests, 64 workers --
Testing: 
FAIL: LLVM :: CodeGen/X86/basic-block-sections-clusters-bb-hash.ll (1 of 96701)
******************** TEST 'LLVM :: CodeGen/X86/basic-block-sections-clusters-bb-hash.ll' FAILED ********************
Exit Code: 1

Command Output (stdout):
--
# RUN: at line 11
/home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/llc /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/X86/basic-block-sections-clusters-bb-hash.ll -O0 -mtriple=x86_64-pc-linux -function-sections -filetype=obj -basic-block-address-map -emit-bb-hash -o /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/test/CodeGen/X86/Output/basic-block-sections-clusters-bb-hash.ll.tmp.o
# executed command: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/llc /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/X86/basic-block-sections-clusters-bb-hash.ll -O0 -mtriple=x86_64-pc-linux -function-sections -filetype=obj -basic-block-address-map -emit-bb-hash -o /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/test/CodeGen/X86/Output/basic-block-sections-clusters-bb-hash.ll.tmp.o
# note: command had no output on stdout or stderr
# RUN: at line 16
echo 'v1' > /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/test/CodeGen/X86/Output/basic-block-sections-clusters-bb-hash.ll.tmp1
# executed command: echo v1
# note: command had no output on stdout or stderr
# RUN: at line 17
echo 'f foo' >> /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/test/CodeGen/X86/Output/basic-block-sections-clusters-bb-hash.ll.tmp1
# executed command: echo 'f foo'
# note: command had no output on stdout or stderr
# RUN: at line 18
echo 'g 0:100,1:100,2:0 1:100,3:100 2:0,3:0 3:100' >> /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/test/CodeGen/X86/Output/basic-block-sections-clusters-bb-hash.ll.tmp1
# executed command: echo 'g 0:100,1:100,2:0 1:100,3:100 2:0,3:0 3:100'
# note: command had no output on stdout or stderr
# RUN: at line 22
/home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/llvm-readobj /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/test/CodeGen/X86/Output/basic-block-sections-clusters-bb-hash.ll.tmp.o --bb-addr-map |  awk 'BEGIN {printf "h"}      /ID: [0-9]+/ {id=$2}      /Hash: 0x[0-9A-Fa-f]+/ {gsub(/^0x/, "", $2); hash=$2; printf " %s:%s", id, hash}      END {print ""}'  >> /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/test/CodeGen/X86/Output/basic-block-sections-clusters-bb-hash.ll.tmp1
# executed command: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/llvm-readobj /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/test/CodeGen/X86/Output/basic-block-sections-clusters-bb-hash.ll.tmp.o --bb-addr-map
# note: command had no output on stdout or stderr
# executed command: awk 'BEGIN {printf "h"}      /ID: [0-9]+/ {id=$2}      /Hash: 0x[0-9A-Fa-f]+/ {gsub(/^0x/, "", $2); hash=$2; printf " %s:%s", id, hash}      END {print ""}'
# note: command had no output on stdout or stderr
# RUN: at line 29
/home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/llc < /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/X86/basic-block-sections-clusters-bb-hash.ll -O0 -mtriple=x86_64-pc-linux -function-sections -basic-block-sections=/home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/test/CodeGen/X86/Output/basic-block-sections-clusters-bb-hash.ll.tmp1 -basic-block-section-match-infer |  /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/FileCheck /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/X86/basic-block-sections-clusters-bb-hash.ll -check-prefixes=CHECK,LINUX-SECTIONS1
# executed command: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/llc -O0 -mtriple=x86_64-pc-linux -function-sections -basic-block-sections=/home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/test/CodeGen/X86/Output/basic-block-sections-clusters-bb-hash.ll.tmp1 -basic-block-section-match-infer
# note: command had no output on stdout or stderr
# executed command: /home/b/sanitizer-x86_64-linux-fast/build/llvm_build_msan/bin/FileCheck /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/X86/basic-block-sections-clusters-bb-hash.ll -check-prefixes=CHECK,LINUX-SECTIONS1
# .---command stderr------------
# | /home/b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/X86/basic-block-sections-clusters-bb-hash.ll:80:26: error: LINUX-SECTIONS1-LABEL: expected string not found in input
# | ; LINUX-SECTIONS1-LABEL: # %bb.1:
# |                          ^
# | <stdin>:7:5: note: scanning from here
# | foo: # @foo

…m#183301)" This reverts commit b0b3e3e. After thinking about this for a bit, I don't think this is correct. vscale being a power-of-2 only guarantees the canonical IV increment overflows to zero, but not overflows in general.

lukel97 · 2026-02-27T08:46:01Z

I thought about this a bit but I don't think this is correct, since vscale being a power of two only guarantees that the can IV increment overflows to zero, so it can still exhibit unsigned wrap. Reverting in #183698

…3301)" (#183698) This reverts commit b0b3e3e. After thinking about this for a bit, I don't think this is correct. vscale being a power-of-2 only guarantees the canonical IV increment overflows to zero, but not overflows in general.

fhahn · 2026-02-27T09:43:23Z

I thought about this a bit but I don't think this is correct, since vscale being a power of two only guarantees that the can IV increment overflows to zero, so it can still exhibit unsigned wrap. Reverting in #183698

yeah that sounds right; the only case this can happen is when the the trip count is -1, which is already be handled by IVUpdateMayOverflow

lukel97 · 2026-02-27T09:54:33Z

I thought about this a bit but I don't think this is correct, since vscale being a power of two only guarantees that the can IV increment overflows to zero, so it can still exhibit unsigned wrap. Reverting in #183698

yeah that sounds right; the only case this can happen is when the the trip count is -1, which is already be handled by IVUpdateMayOverflow

I think it can happen for any TC >= UINT_MAX - VF*UF, but yeah I think IVUpdateMayOverflow should check for that.

We still have the simplification now that the canonical IV itself (not the increment) can't wrap, which I think means we can add nuw to unrolled canonical IV steps.

After llvm#183080 vscale can no longer be a non-power of 2, which means the canonical IV can't overflow with tail folding w/ scalable vectors anymore. Therefore we don't need to drop the NUW flag. IVUpdateMayOverflow is left to be removed in a separate PR since it removes further runtime checks.

…m#183301)" (llvm#183698) This reverts commit b0b3e3e. After thinking about this for a bit, I don't think this is correct. vscale being a power-of-2 only guarantees the canonical IV increment overflows to zero, but not overflows in general.

lukel97 requested review from david-arm, fhahn and paulwalker-arm February 25, 2026 13:56

llvmbot added backend:RISC-V vectorizers llvm:transforms labels Feb 25, 2026

Merge branch 'main' of github.com:llvm/llvm-project into loop-vectori…

0dc2884

…ze/dont-drop-can-iv-tail-fold

fhahn reviewed Feb 25, 2026

View reviewed changes

lukel97 added 2 commits February 25, 2026 23:07

Update tests

90e8d1d

Sink to uses, update comment and replace with TODO to remove

853c211

llvmbot added the backend:PowerPC label Feb 25, 2026

Update phaseordering test

4752ae8

paulwalker-arm approved these changes Feb 26, 2026

View reviewed changes

fhahn approved these changes Feb 26, 2026

View reviewed changes

Merge branch 'main' into loop-vectorize/dont-drop-can-iv-tail-fold

06f32d9

lukel97 enabled auto-merge (squash) February 27, 2026 06:10

lukel97 disabled auto-merge February 27, 2026 06:39

lukel97 added 2 commits February 27, 2026 14:39

Merge branch 'main' of github.com:llvm/llvm-project into loop-vectori…

2598db1

…ze/dont-drop-can-iv-tail-fold

Update tests after merge

dc1f66c

lukel97 enabled auto-merge (squash) February 27, 2026 06:47

lukel97 merged commit b0b3e3e into llvm:main Feb 27, 2026
9 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[VPlan] Don't drop NUW flag on tail folded canonical IVs#183301

[VPlan] Don't drop NUW flag on tail folded canonical IVs#183301
lukel97 merged 8 commits intollvm:mainfrom
lukel97:loop-vectorize/dont-drop-can-iv-tail-fold

lukel97 commented Feb 25, 2026 •

edited by paulwalker-arm

Loading

Uh oh!

llvmbot commented Feb 25, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 25, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 25, 2026 •

edited

Loading

Uh oh!

fhahn Feb 25, 2026

Uh oh!

lukel97 Feb 25, 2026

Uh oh!

fhahn Feb 25, 2026

Uh oh!

lukel97 Feb 25, 2026

Uh oh!

fhahn left a comment

Uh oh!

Uh oh!

llvm-ci commented Feb 27, 2026

Uh oh!

lukel97 commented Feb 27, 2026 •

edited

Loading

Uh oh!

fhahn commented Feb 27, 2026

Uh oh!

lukel97 commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		@@ -8201,22 +8201,7 @@ VPlanPtr LoopVectorizationPlanner::tryToBuildVPlanWithVPRecipes(
		IVUpdateMayOverflow \|= !isIndvarOverflowCheckKnownFalse(&CM, VF);

		TailFoldingStyle Style = CM.getTailFoldingStyle(IVUpdateMayOverflow);

Conversation

lukel97 commented Feb 25, 2026 • edited by paulwalker-arm Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🐧 Linux x64 Test Results

Failed Tests

LLVM

Uh oh!

github-actions bot commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🪟 Windows x64 Test Results

Failed Tests

LLVM

Uh oh!

fhahn Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

lukel97 Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

fhahn Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

lukel97 Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

fhahn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

llvm-ci commented Feb 27, 2026

Uh oh!

lukel97 commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fhahn commented Feb 27, 2026

Uh oh!

lukel97 commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

lukel97 commented Feb 25, 2026 •

edited by paulwalker-arm

Loading

llvmbot commented Feb 25, 2026 •

edited

Loading

github-actions bot commented Feb 25, 2026 •

edited

Loading

github-actions bot commented Feb 25, 2026 •

edited

Loading

lukel97 commented Feb 27, 2026 •

edited

Loading