[Enhancement] Improve handling of negative indices for ramp and broadcast node by LeiWang1999 · Pull Request #1207 · tile-ai/tilelang

LeiWang1999 · 2025-11-07T13:49:15Z

Added logic to handle scalar and vector indices separately, enhancing the ability to determine non-negativity and negativity of indices.
Introduced detailed logging for cases where non-negativity cannot be proven, improving debugging capabilities.
Refactored index state determination for vector types, including support for Ramp and Broadcast nodes.

Summary by CodeRabbit

Bug Fixes
- Enhanced index validation logic with improved detection of negative, non-negative, and unknown states.
- Extended pattern analysis capabilities for more accurate index state determination and comprehensive error reporting.

…ve_index pass * Added logic to handle scalar and vector indices separately, enhancing the ability to determine non-negativity and negativity of indices. * Introduced detailed logging for cases where non-negativity cannot be proven, improving debugging capabilities. * Refactored index state determination for vector types, including support for Ramp and Broadcast nodes.

github-actions · 2025-11-07T13:49:24Z

👋 Hi! Thank you for contributing to the TileLang project.

Please remember to run pre-commit run --all-files in the root directory of the project to ensure your changes are properly linted and formatted. This will help ensure your contribution passes the format check.

We appreciate you taking this step! Our team will review your contribution, and we look forward to your awesome work! 🚀

coderabbitai · 2025-11-07T13:49:30Z

Walkthrough

Enhanced index legalization logic in a single transformation file. Scalar index handling now proves both non-negativity and negativity conditions. Vector index handling added with per-pattern analysis for RampNode and BroadcastNode to determine state (NonNegative, Negative, or Unknown). State propagation updated accordingly with appropriate logging when proofs fail.

Changes

Cohort / File(s)	Summary
Index legalization analysis `src/transform/legalize_negative_index.cc`	Expanded scalar index handling to check negativity alongside non-negativity; introduced vector index analysis for RampNode and BroadcastNode patterns; updated state propagation logic to handle NonNegative, Negative, and Unknown states; retained logging for unknown indices with per-axis context.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Focus on understanding the new vector pattern analysis for RampNode and BroadcastNode
Verify correctness of state propagation logic (NonNegative, Negative, Unknown transitions)
Confirm logging context is properly maintained for debugging

Poem

🐰 Indices leap and bound so free,
Negative, positive, we now see!
Patterns dance—Ramp and Broadcast too,
States determined, old made new!
✨

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main enhancement: improved handling of negative indices for ramp and broadcast nodes, which is the core change in the changeset.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

📜 Recent review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5dfd206 and fbefa3f.

📒 Files selected for processing (1)

src/transform/legalize_negative_index.cc (2 hunks)

✅ Files skipped from review due to trivial changes (1)

src/transform/legalize_negative_index.cc

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Test for Python 3.12 with ROCm-6.3 (on self-hosted-amd)
GitHub Check: Test for Python 3.12 with Metal (on macos-latest)

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

src/transform/legalize_negative_index.cc (1)
94-110: Consider applying the fallback pattern to scalar indices.

The BroadcastNode handling uses a two-tier approach: first attempting CanProve, then falling back to const_int_bound. This is more robust than the scalar handling (lines 42-51), which only uses CanProve. Applying the same fallback pattern to scalar indices could reduce false unknowns.

For consistency, consider refactoring scalar index handling:
 if (simplified.dtype().lanes() == 1) {
   if (analyzer_.CanProve(simplified >= 0)) {
     states.push_back(IndexSignState::kNonNegative);
     continue;
   }
   if (analyzer_.CanProve(simplified < 0)) {
     states.push_back(IndexSignState::kNegative);
     needs_record = true;
     continue;
   }
+  // Fallback to bound analysis if proof unavailable
+  auto bound = analyzer_.const_int_bound(simplified);
+  if (bound->min_value >= 0) {
+    states.push_back(IndexSignState::kNonNegative);
+    continue;
+  }
+  if (bound->max_value < 0) {
+    states.push_back(IndexSignState::kNegative);
+    needs_record = true;
+    continue;
+  }
   states.push_back(IndexSignState::kUnknown);
   needs_record = true;
   LOG(WARNING)
       << "LegalizeNegativeIndex: cannot prove non-negative index "
       << simplified << " for buffer " << load->buffer->name << " (axis "
       << i << ").";
   continue;
 }

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 8119550 and edbe327.

📒 Files selected for processing (1)

src/transform/legalize_negative_index.cc (1 hunks)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Quick Lint

🔇 Additional comments (3)

src/transform/legalize_negative_index.cc (3)

42-58: Good improvement for scalar index handling.

The explicit separation of scalar index analysis improves code clarity. The logic correctly handles the three states (non-negative, negative, unknown) with appropriate warning messages for debugging.

113-121: LGTM!

The control flow correctly handles the determined vector states (non-negative and negative) by pushing the state and continuing, leaving the unknown case to fall through to the existing warning logic.

73-85: Manual verification required for overflow and sentinel value handling.

The code contains arithmetic operations that could potentially overflow (s_min * (lanes - 1) and s_max * (lanes - 1)), but the author's comment states this is intentionally not guarded because sentinel values (representing ±∞) are used for bounds.

While I confirmed the code matches the review comment and found similar patterns throughout the codebase without explicit overflow checks, I was unable to locate the ConstIntBound class definition to verify:

Which sentinel constants are actually used

How arithmetic operations behave with these sentinel values

Whether the author's claim is technically correct

Please verify whether TVM's sentinel value design actually prevents the overflow issue or if overflow guards should be added.

…rencing lanes to obtain the correct integer value.

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between edbe327 and 5dfd206.

📒 Files selected for processing (1)

src/transform/legalize_negative_index.cc (1 hunks)

coderabbitai · 2025-11-07T14:09:31Z

src/transform/legalize_negative_index.cc

+        int64_t base_min = base_bound->min_value;
+        int64_t base_max = base_bound->max_value;
+        int64_t s_min = stride_bound->min_value;
+        int64_t s_max = stride_bound->max_value;
+
+        // Guard against overflow is not strictly necessary here because
+        // bounds may be +/-inf represented by sentinel values.
+        int64_t lower = base_min;
+        if (s_min < 0)
+          lower += s_min * (lanes - 1);
+        int64_t upper = base_max;
+        if (s_max > 0)
+          upper += s_max * (lanes - 1);


⚠️ Potential issue | 🔴 Critical

Guard against overflow when combining ramp bounds

Here lower += s_min * (lanes - 1) and the analogous upper update operate directly on the results of const_int_bound. When either base_bound or stride_bound returns the sentinel ±∞ (the common case when the analyzer can’t tighten a bound), these additions run on INT64_MIN/INT64_MAX. That triggers signed overflow, which is UB in C++, and in practice can flip the sign so we misclassify a definitely-negative ramp as non-negative (or vice versa). That’s a correctness breaker for the legalization pass.

Please short-circuit before the arithmetic (e.g. if either operand is ConstIntBoundNode::kNegInf/kPosInf, keep the sentinel) or compute the ramp endpoints via the analyzer instead of manual int64 math so we never overflow.

- int64_t lower = base_min; - if (s_min < 0) - lower += s_min * (lanes - 1); - int64_t upper = base_max; - if (s_max > 0) - upper += s_max * (lanes - 1); + int64_t lower = base_min; + int64_t upper = base_max; + int64_t lane_span = static_cast<int64_t>(lanes - 1); + if (s_min < 0) { + if (lower == arith::ConstIntBoundNode::kNegInf || + lower == arith::ConstIntBoundNode::kPosInf || + s_min == arith::ConstIntBoundNode::kNegInf || + s_min == arith::ConstIntBoundNode::kPosInf) { + lower = arith::ConstIntBoundNode::kNegInf; + } else { + __int128 tmp = static_cast<__int128>(lower) + + static_cast<__int128>(s_min) * lane_span; + if (tmp <= std::numeric_limits<int64_t>::min()) { + lower = arith::ConstIntBoundNode::kNegInf; + } else if (tmp >= std::numeric_limits<int64_t>::max()) { + lower = arith::ConstIntBoundNode::kPosInf; + } else { + lower = static_cast<int64_t>(tmp); + } + } + } + if (s_max > 0) { + if (upper == arith::ConstIntBoundNode::kPosInf || + upper == arith::ConstIntBoundNode::kNegInf || + s_max == arith::ConstIntBoundNode::kPosInf || + s_max == arith::ConstIntBoundNode::kNegInf) { + upper = arith::ConstIntBoundNode::kPosInf; + } else { + __int128 tmp = static_cast<__int128>(upper) + + static_cast<__int128>(s_max) * lane_span; + if (tmp >= std::numeric_limits<int64_t>::max()) { + upper = arith::ConstIntBoundNode::kPosInf; + } else if (tmp <= std::numeric_limits<int64_t>::min()) { + upper = arith::ConstIntBoundNode::kNegInf; + } else { + upper = static_cast<int64_t>(tmp); + } + } + }

🤖 Prompt for AI Agents

In src/transform/legalize_negative_index.cc around lines 73 to 85, the code adds base_min and s_min*(lanes-1) (and analogously for upper) without handling sentinel +/-INF from const_int_bound, which causes signed overflow and UB when base_bound or stride_bound is kNegInf/kPosInf; modify the logic to short-circuit whenever base_bound->min_value or stride_bound->min_value (and the corresponding max values) equal ConstIntBoundNode::kNegInf or kPosInf so that you preserve the sentinel instead of performing the multiplication/addition, or else compute the ramp endpoints using the analyzer APIs that return ConstIntBoundNodes rather than doing raw int64 math, ensuring no arithmetic is performed on INT64_MIN/INT64_MAX sentinels.

…r TIR operations. This addition supports improved functionality and maintainability of the transformation logic.

…cast node (tile-ai#1207) * [Enhancement] Improve handling of negative indices in legalize_negative_index pass * Added logic to handle scalar and vector indices separately, enhancing the ability to determine non-negativity and negativity of indices. * Introduced detailed logging for cases where non-negativity cannot be proven, improving debugging capabilities. * Refactored index state determination for vector types, including support for Ramp and Broadcast nodes. * Fix incorrect lane handling in legalize_negative_index pass by dereferencing lanes to obtain the correct integer value. * Enhance legalize_negative_index pass by including necessary header for TIR operations. This addition supports improved functionality and maintainability of the transformation logic.

coderabbitai bot reviewed Nov 7, 2025

View reviewed changes

Fix incorrect lane handling in legalize_negative_index pass by derefe…

5dfd206

…rencing lanes to obtain the correct integer value.

coderabbitai bot reviewed Nov 7, 2025

View reviewed changes

Enhance legalize_negative_index pass by including necessary header fo…

fbefa3f

…r TIR operations. This addition supports improved functionality and maintainability of the transformation logic.

LeiWang1999 merged commit 918a21b into tile-ai:main Nov 8, 2025
6 checks passed

This was referenced Nov 10, 2025

[Fix] Fix buffer re-import typo in tilelang.languge #1214

Merged

[Fix] Fix a type that make wrong T.macro backtrace #1234

Merged

[Language] Add type stubs for tir op #1239

Merged

This was referenced Nov 21, 2025

[Feat] Add missing support for uint32x2, add unsigned implicit cast in bitwise op, add T.Ref as macro annotation #1302

Closed

[Fix] Remove unused let_bindings_ in CodeGenC to fix #1300 #1305

Merged

[Fix] Fix frame scope error in T.macro #1308

Merged

This was referenced Nov 25, 2025

[Feat] Extend LegalizeNegativeIndex to support buffer store stmts #1339

Merged

[Refactor] Simplify index sign state handling in LegalizeNegativeIndex #1354

Merged

kurisu6912 mentioned this pull request Dec 17, 2025

[Bug] Fix tvm build script when patchelf is not found #1459

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement] Improve handling of negative indices for ramp and broadcast node#1207

[Enhancement] Improve handling of negative indices for ramp and broadcast node#1207
LeiWang1999 merged 3 commits intotile-ai:mainfrom
LeiWang1999:improve-legalize-negative-index

LeiWang1999 commented Nov 7, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

github-actions bot commented Nov 7, 2025

Uh oh!

coderabbitai bot commented Nov 7, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

LeiWang1999 commented Nov 7, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

github-actions bot commented Nov 7, 2025

Uh oh!

coderabbitai bot commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

LeiWang1999 commented Nov 7, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 7, 2025 •

edited

Loading