[mlir][linalg] Fix vectorizer generating invalid vector.gather for 0-D tensor.extract by edg-l · Pull Request #187085 · llvm/llvm-project

edg-l · 2026-03-17T18:25:26Z

Vectorizing a rank-0 linalg.generic whose body contains tensor.extract with data-dependent indices hits the Gather classification in getTensorExtractMemoryAccessPattern because isOutput1DVector returns false for a 0-D result. This produces an invalid vector.gather where operand #2 must be a vector of index values but gets a scalar index instead.

Fix classifies a 0-D result as ScalarBroadcast rather than Gather, and skips mask generation for 0-D in that path.

llvmbot · 2026-03-17T18:26:03Z

@llvm/pr-subscribers-mlir

@llvm/pr-subscribers-mlir-linalg

Author: Edgar (edg-l)

Changes

When vectorizing a rank-0 linalg.generic whose body contains tensor.extract with data-dependent indices, getTensorExtractMemoryAccessPattern fell through to the Gather classification because isOutput1DVector returns false for a 0-D result. This produced an invalid vector.gather where operand #2 must be a vector of index values but got a scalar index.

Error seen in practice:

'vector.gather' op operand #<!-- -->2 must be vector of integer or index values, but got 'index'

Fix in Vectorization.cpp:

Early-return ScalarBroadcast in getTensorExtractMemoryAccessPattern when resType.getRank() == 0, before the isOutput1DVector check.
Skip the vector<1xi1> masking step in the ScalarBroadcast handler when dstRank == 0, since 0-D vectors don't support masking.

Reproducer: ONNX Gather ops lowered to linalg.generic + tensor.extract on a rank-0 iteration space, as seen during GPT-2 model compilation via ONNX-MLIR.

Full diff: https://github.com/llvm/llvm-project/pull/187085.diff

1 Files Affected:

(modified) mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp (+18-11)

diff --git a/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp b/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp
index 0477815f329bf..d2439ef1f2bf4 100644
--- a/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp
+++ b/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp
@@ -1093,6 +1093,11 @@ getTensorExtractMemoryAccessPattern(tensor::ExtractOp extractOp,
   if (inputShape.getShape().empty())
     return VectorMemoryAccessKind::ScalarBroadcast;
 
+  // 0a. Is the result a 0-D vector? If yes, there are no iteration dimensions
+  // so the tensor.extract is a single scalar load regardless of the index.
+  if (resType.getRank() == 0)
+    return VectorMemoryAccessKind::ScalarBroadcast;
+
   // True for vectors that are effectively 1D, e.g. `vector<1x4x1xi32>`, false
   // otherwise.
   bool isOutput1DVector =
@@ -1254,19 +1259,21 @@ vectorizeTensorExtract(RewriterBase &rewriter, VectorizationState &state,
         rewriter, loc, resultType, extractOp.getTensor(), transferReadIdxs,
         /*padding=*/std::nullopt, permutationMap, inBounds);
 
-    // Mask this broadcasting xfer_read here rather than relying on the generic
-    // path (the generic path assumes identity masking map, which wouldn't be
-    // valid here).
-    SmallVector<int64_t> readMaskShape = {1};
-    auto readMaskType = VectorType::get(readMaskShape, rewriter.getI1Type());
-    auto allTrue = vector::ConstantMaskOp::create(
-        rewriter, loc, readMaskType, vector::ConstantMaskKind::AllTrue);
-    auto *maskedReadOp =
-        mlir::vector::maskOperation(rewriter, transferReadOp, allTrue);
+    Operation *resultOp = transferReadOp;
+    if (dstRank > 0) {
+      // Mask this broadcasting xfer_read here rather than relying on the
+      // generic path (the generic path assumes identity masking map, which
+      // wouldn't be valid here).
+      SmallVector<int64_t> readMaskShape = {1};
+      auto readMaskType = VectorType::get(readMaskShape, rewriter.getI1Type());
+      auto allTrue = vector::ConstantMaskOp::create(
+          rewriter, loc, readMaskType, vector::ConstantMaskKind::AllTrue);
+      resultOp =
+          mlir::vector::maskOperation(rewriter, transferReadOp, allTrue);
+    }
 
     LDBG() << "Vectorised as scalar broadcast load: " << extractOp;
-    return VectorizationHookResult{VectorizationHookStatus::NewOp,
-                                   maskedReadOp};
+    return VectorizationHookResult{VectorizationHookStatus::NewOp, resultOp};
   }
 
   // 2b. Handle contiguous access.

github-actions · 2026-03-17T18:28:37Z

✅ With the latest revision this PR passed the C/C++ code formatter.

banach-space · 2026-03-17T18:31:43Z

Thanks for the fix! Please add tests :)

…D tensor.extract When vectorizing a rank-0 linalg.generic whose body contains tensor.extract with data-dependent indices, the vectorizer incorrectly classified the access as a Gather (since the 0-D result vector has no dimension > 1). This produced an invalid vector.gather with a scalar index operand where a vector of indices is required. Fix by classifying 0-D result vectors as ScalarBroadcast in getTensorExtractMemoryAccessPattern, and skipping the masking logic in the ScalarBroadcast path when the result rank is 0 (0-D vectors don't support masking).

edg-l · 2026-03-18T11:31:19Z

Thanks for the fix! Please add tests :)

added

banach-space

LGTM % nits

Thanks!

mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp

mlir/test/Dialect/Linalg/vectorization/extract.mlir

banach-space · 2026-03-18T19:24:22Z

@edg-l , do you have commit access to land this?

edg-l · 2026-03-18T20:28:25Z

@edg-l , do you have commit access to land this?

no i dont have any permission

mlir/test/Dialect/Linalg/vectorization/extract.mlir

…D tensor.extract (llvm#187085) Vectorizing a rank-0 `linalg.generic` whose body contains `tensor.extract` with data-dependent indices hits the Gather classification in `getTensorExtractMemoryAccessPattern` because `isOutput1DVector` returns false for a 0-D result. This produces an invalid `vector.gather` where operand llvm#2 must be a vector of index values but gets a scalar `index` instead. Fix classifies a 0-D result as ScalarBroadcast rather than Gather, and skips mask generation for 0-D in that path.

edg-l requested review from Groverkss, banach-space, dcaballe and nicolasvasilache as code owners March 17, 2026 18:25

llvmbot added mlir:linalg mlir labels Mar 17, 2026

edg-l force-pushed the edgl/fix-vectorizer-0d-gather branch from 97daa08 to 2fccbf9 Compare March 17, 2026 18:45

edg-l force-pushed the edgl/fix-vectorizer-0d-gather branch from 2fccbf9 to a18ee37 Compare March 17, 2026 19:00

banach-space approved these changes Mar 18, 2026

View reviewed changes

mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp Outdated Show resolved Hide resolved

mlir/test/Dialect/Linalg/vectorization/extract.mlir Outdated Show resolved Hide resolved

banach-space reviewed Mar 19, 2026

View reviewed changes

mlir/test/Dialect/Linalg/vectorization/extract.mlir Outdated Show resolved Hide resolved

[mlir][linalg] Address review comments

b68108e

edg-l force-pushed the edgl/fix-vectorizer-0d-gather branch from 8d9649e to b68108e Compare March 19, 2026 19:02

banach-space merged commit ae6fbd0 into llvm:main Mar 19, 2026
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[mlir][linalg] Fix vectorizer generating invalid vector.gather for 0-D tensor.extract#187085

[mlir][linalg] Fix vectorizer generating invalid vector.gather for 0-D tensor.extract#187085
banach-space merged 2 commits intollvm:mainfrom
edg-l:edgl/fix-vectorizer-0d-gather

edg-l commented Mar 17, 2026 •

edited

Loading

Uh oh!

llvmbot commented Mar 17, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 17, 2026 •

edited

Loading

Uh oh!

banach-space commented Mar 17, 2026 •

edited

Loading

Uh oh!

edg-l commented Mar 18, 2026

Uh oh!

banach-space left a comment

Uh oh!

Uh oh!

Uh oh!

banach-space commented Mar 18, 2026

Uh oh!

edg-l commented Mar 18, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

edg-l commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

banach-space commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

edg-l commented Mar 18, 2026

Uh oh!

banach-space left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

banach-space commented Mar 18, 2026

Uh oh!

edg-l commented Mar 18, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

edg-l commented Mar 17, 2026 •

edited

Loading

llvmbot commented Mar 17, 2026 •

edited

Loading

github-actions bot commented Mar 17, 2026 •

edited

Loading

banach-space commented Mar 17, 2026 •

edited

Loading