[Refactor] Improve assertion handling in CodeGenCHost and ArgBinder #1352

LeiWang1999 · 2025-11-27T11:19:58Z

This commit refines the assertion message generation in CodeGenCHost by optimizing the handling of equality checks and reducing buffer size for error messages. Additionally, it enhances the ArgBinder by introducing a nullable guard mechanism for assertions, allowing for more precise error handling when binding arguments. The changes improve the clarity and efficiency of assertion handling across the codebase.

Summary by CodeRabbit

Bug Fixes
- Safer assertion messages and NULL-aware argument bindings to avoid null dereferences and spurious assertions.
Performance
- Merge/flatten consecutive identical-condition branches and added an extra simplification pass to reduce control-flow overhead.
Improvements
- More concise, informative runtime diagnostics for argument/dtype validation; unused/optional inputs are skipped where appropriate.
- New runtime helpers to produce richer dtype-mismatch errors.
Documentation
- Detailed internal guide on host-side argument checks and troubleshooting.
Tools
- New reproducible host-check scripts and an orchestrator to run and log them.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

This commit refines the assertion message generation in CodeGenCHost by optimizing the handling of equality checks and reducing buffer size for error messages. Additionally, it enhances the ArgBinder by introducing a nullable guard mechanism for assertions, allowing for more precise error handling when binding arguments. The changes improve the clarity and efficiency of assertion handling across the codebase.

github-actions · 2025-11-27T11:20:06Z

👋 Hi! Thank you for contributing to the TileLang project.

Please remember to run pre-commit run --all-files in the root directory of the project to ensure your changes are properly linted and formatted. This will help ensure your contribution passes the format check.

We appreciate you taking this step! Our team will review your contribution, and we look forward to your awesome work! 🚀

coderabbitai · 2025-11-27T11:20:07Z

Caution

Review failed

The pull request is closed.

Walkthrough

Simplify host assertion messages to handle single EQ comparisons; propagate NULL-awareness through ArgBinder and BindDLTensor with an is_used path; add MergeIfStmt pass and invoke it in MakePackedAPI; add runtime dtype error helpers, documentation, and repro scripts for host-side tensor checks.

Changes

Cohort / File(s)	Summary
Host assertion formatting `src/target/codegen_c_host.cc`	Replace multi-EQ diagnostic logic with a targeted single-`EQNode` path that prints LHS/RHS values, reduce buffer (1024→512), build message via one `snprintf`, use explicit `tvm::tir::StringImmNode` casts and `ICHECK`, preserve fallback for non-EQ conditions.
Nullable guards & Arg binding `src/transform/arg_binder.cc`, `src/transform/arg_binder.h`	Add `nullable_guard` support to `BinderAddAssert`, propagate NULL-aware checks through `BindNullable` and `BindDLTensor`, convert OR-style null checks into guarded `IfThenElse`/`SeqStmt`, add `is_used` parameter to `BindDLTensor`, and update messages to avoid NULL dereference.
MakePackedAPI integration & used-buffer detection `src/transform/make_packed_api.cc`	Add pre-pass maps (`data_var2param`, `shape_var2params`) and `UsedBufferDetector` to mark used buffers; pass `is_used`/display names into `BindDLTensor`; include `merge_if_stmt.h`; invoke `MergeIfStmtSubstitute` after MakePackedAPI.
If-statement merging pass `src/transform/merge_if_stmt.cc`, `src/transform/merge_if_stmt.h`	New rewriter with `Apply`, `FlattenAppend`, and wrappers `MergeIfStmtSubstitute(PrimFunc&)` / `ApplyMergeIfStmt(Stmt)`; flatten `SeqStmt` and merge consecutive `IfThenElse` nodes with identical conditions (no else) into a single `IfThenElse` whose then-branch is a `SeqStmt`.
Runtime dtype error helpers `src/runtime/error_helpers.cc`	Add internal helpers `DTypeMismatch` and `DTypeMismatchNoNames`, register FFI exports `tilelang_error_dtype_mismatch` and `tilelang_error_dtype_mismatch2`, and initialize registration via a static init block.
Adapter & JIT adjustments `tilelang/jit/adapter/tvm_ffi.py`, `testing/python/jit/test_tilelang_jit_nullptr.py`	Remove extraction of `global_symbol` and drop an input-dtype validation block in `tvm_ffi.py`; update pointer-based test to tensor-based kernel and adjust test invocation/signatures accordingly.
Pipeline change `tilelang/engine/phase.py`	Insert an extra simplify pass: call `tilelang.transform.Simplify()(mod)` after `MakePackedAPI` in OptimizeForTarget.
Docs & repro scripts `docs/compiler_internals/tensor_checks.md`, `docs/index.md`, `maint/host_checks/*`, `maint/host_checks/common.py`, `maint/host_checks/run_all.py`	Add comprehensive host-side tensor-checks documentation, README, common kernel helpers, 10 repro scripts covering argument/shape/dtype/stride/device/null/scalar errors, and a `run_all.py` orchestrator that logs and classifies results.
Header/API update `src/transform/arg_binder.h`	Update `ArgBinder::BindDLTensor` signature to include `bool is_used`.
Misc / examples / ignore `.gitignore`, `examples/quickstart.py`	Add `.gitignore` rule for `maint/host_checks/logs/*`; adjust quickstart example call site/name for generated kernel source retrieval.

Sequence Diagram(s)

sequenceDiagram
    autonumber
    participant Detector as UsedBufferDetector (pre-pass)
    participant MakePackedAPI as MakePackedAPI pass
    participant ArgBinder as ArgBinder / BindDLTensor
    participant MergeIf as MergeIfStmtSubstitute
    participant CodeGen as CodeGenCHost

    Detector->>MakePackedAPI: build data_var2param / shape_var2params, mark used buffers
    MakePackedAPI->>ArgBinder: call BindDLTensor(buffer, ..., arg_name, is_used)
    ArgBinder->>ArgBinder: emit guarded assertions (nullable_guard → IfThenElse) or bind non-NULL path
    MakePackedAPI->>MergeIf: invoke MergeIfStmtSubstitute on lowered PrimFunc
    MergeIf->>MergeIf: flatten SeqStmt and merge consecutive IfThenElse with same condition
    MergeIf->>MakePackedAPI: return transformed PrimFunc
    MakePackedAPI->>CodeGen: hand off transformed PrimFunc
    CodeGen->>CodeGen: emit host asserts (single‑EQ diagnostics) and call runtime dtype helpers as needed

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Review focus:
- src/transform/arg_binder.{cc,h} — correctness of nullable_guard propagation, is_used semantics, and guarded assertion sequences.
- src/transform/merge_if_stmt.cc — correctness of FlattenAppend, flattening invariants, and merging logic ensuring semantics preserved.
- src/target/codegen_c_host.cc — both AssertStmt handlers for buffer sizing, escaping, and safety.
- src/transform/make_packed_api.cc — used-buffer detection accuracy and correct propagation of is_used/display names.
- src/runtime/error_helpers.cc — FFI registration and message construction edge cases.

Possibly related PRs

[Refactor] Improve assertion handling in CodeGenCHost and ArgBinder #1352 — Directly related changes touching CodeGenCHost assertions and ArgBinder nullable-guard/BinderAddAssert updates.
[Feature] Support None type as input for T.ptr and T.Tensor #1114 — Related NULL/None input support and adapter/test adjustments overlapping BindDLTensor and nullable guards.

Poem

🐰
I nibbled asserts down to tidy bits,
Wove null-safe fences around the pits,
I folded Ifs in tidy rows,
Logged dtype cries and tidy prose —
Hop, test, patch — then munch on bits! 🥕

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 9.80% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main focus of the changeset: refactoring assertion handling in CodeGenCHost and ArgBinder with improved mechanisms.

📜 Recent review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b45bb37 and e79d06b.

📒 Files selected for processing (1)

examples/quickstart.py (1 hunks)

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 3

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

src/transform/merge_if_stmt.cc (2)
1-4: Fix incorrect file name in header comment.

The file header comment incorrectly states \file if_stmt_binding.cc but the actual file is merge_if_stmt.cc.
 /*!
- * \file if_stmt_binding.cc
+ * \file merge_if_stmt.cc
  * \brief Merge the If Stmt in SeqStmt
  */
72-77: Bodies are visited twice when merged into SeqStmt.

When merging bodies into a SeqStmt, the code calls this->VisitStmt(SeqStmt(current_if_bodies)). However, the current_if_bodies contain then_case statements from already-processed if_nodes (line 68, 83). These statements come from the flat_seq which was already visited at line 53-54.

This means the bodies are visited twice, which could cause issues with transformations that aren't idempotent.

Consider removing the redundant VisitStmt call since bodies were already visited:
               auto if_stmt =
                   IfThenElse(current_condition,
                              current_if_bodies.size() == 1
                                  ? current_if_bodies[0]
-                                 : this->VisitStmt(SeqStmt(current_if_bodies)),
+                                 : SeqStmt(current_if_bodies),
                              Stmt());
Apply this change to all three occurrences (lines 76, 94, 109).

Also applies to: 90-95, 105-110

🧹 Nitpick comments (2)

src/transform/arg_binder.cc (1)

44-55: Consider whether the trailing Evaluate(0) in SeqStmt is necessary.

The pattern SeqStmt({check, Evaluate(0)}) wraps the guarded assertion with a nop statement. While this works, it creates slightly larger IR trees. If this is intentional for pass compatibility reasons, a brief comment explaining why would be helpful.
src/transform/merge_if_stmt.h (1)
40-40: Consider using const reference or value parameter.

The function takes a non-const reference but also returns the modified PrimFunc. This is an unusual API pattern - typically you'd either:

Take by value/const-ref and return the modified copy, or

Take by non-const ref, mutate in-place, and return void

The current signature allows mutation of the input while also returning it, which could be confusing.

Consider changing to:
-PrimFunc MergeIfStmtSubstitute(PrimFunc &f);
+PrimFunc MergeIfStmtSubstitute(PrimFunc f);
This would require updating the implementation in merge_if_stmt.cc accordingly.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b8240b7 and 2f9d02e.

📒 Files selected for processing (5)

src/target/codegen_c_host.cc (1 hunks)
src/transform/arg_binder.cc (16 hunks)
src/transform/make_packed_api.cc (2 hunks)
src/transform/merge_if_stmt.cc (4 hunks)
src/transform/merge_if_stmt.h (1 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-09-12T09:47:46.474Z

Learnt from: kurisu6912
Repo: tile-ai/tilelang PR: 794
File: tilelang/transform/add_bufstore_wrapper.py:30-33
Timestamp: 2025-09-12T09:47:46.474Z
Learning: In TVM's PyStmtExprMutator, visit_block_ methods typically call super().visit_block_(op) to process child nodes and update internal state, but return the original op when the block itself doesn't need transformation. The pattern `return op` is correct for blocks that serve as containers where mutations happen at deeper levels.

Applied to files:

src/target/codegen_c_host.cc

🧬 Code graph analysis (3)

src/transform/merge_if_stmt.h (1)

src/transform/merge_if_stmt.cc (9)

MergeIfStmtRewriter (35-35)

MergeIfStmtSubstitute (118-120)

MergeIfStmtSubstitute (118-118)

f (24-27)

f (24-24)

ApplyMergeIfStmt (122-122)

ApplyMergeIfStmt (122-122)

stmt (29-32)

stmt (29-29)

src/transform/make_packed_api.cc (1)

src/transform/merge_if_stmt.cc (2)

MergeIfStmtSubstitute (118-120)

MergeIfStmtSubstitute (118-118)

src/transform/merge_if_stmt.cc (1)

src/transform/if_stmt_binding.cc (2)

f (22-26)

f (22-22)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Quick Lint

🔇 Additional comments (2)

src/target/codegen_c_host.cc (1)

358-383: LGTM! The simplified single-EQ assertion path is cleaner.

The refactored code now handles only single equality checks directly, which is more straightforward than the previous multi-EQ collection approach. The 512-byte buffer is reasonable for typical assertion messages.

One minor observation: the format specifier %lld expects long long, and you're casting to (long long) which is correct.

src/transform/make_packed_api.cc (1)

469-470: LGTM! Integration of MergeIfStmt pass is correctly placed.

The MergeIfStmtSubstitute call after MakePackedAPI ensures that the consecutive if-statements generated by the nullable guard pattern get merged, which should improve the generated C code quality.

coderabbitai · 2025-11-27T11:23:50Z

src/transform/arg_binder.cc

+    PrimExpr cond = value == arg;
+    BinderAddAssert(&analyzer_, cond, arg_name, &asserts_, nullable_guard);
  }
+  // ICHECK(false);


🛠️ Refactor suggestion | 🟠 Major

Remove commented-out debug code.

The // ICHECK(false); appears to be leftover debug code that should be removed.

- // ICHECK(false); return false;

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

// ICHECK(false);

return false;

🤖 Prompt for AI Agents

In src/transform/arg_binder.cc around line 142, remove the commented-out debug line "// ICHECK(false);" because it is leftover debug code; delete the comment so the file contains only active, meaningful code and no commented-out ICHECK statement.

coderabbitai · 2025-11-27T11:23:50Z

src/transform/arg_binder.cc

+      BindNullable(offset, truncmod(offset, factor), arg_name + ".elem_offset",
+                   true, is_null);
    }


⚠️ Potential issue | 🟠 Major

Potential incorrect usage of BindNullable for offset factor check.

The call BindNullable(offset, truncmod(offset, factor), ...) appears to be checking that offset == truncmod(offset, factor), which would only pass when offset % factor == 0. However, the original logic should assert that truncmod(offset, factor) == 0. This seems like a logic error.

Consider this fix:

if (buffer->offset_factor > 1) { PrimExpr offset = buffer->elem_offset; PrimExpr factor = make_const(offset.dtype(), buffer->offset_factor); PrimExpr zero = make_zero(offset.dtype()); - BindNullable(offset, truncmod(offset, factor), arg_name + ".elem_offset", - true, is_null); + BinderAddAssert(&analyzer_, zero == truncmod(offset, factor), + arg_name + ".elem_offset", &asserts_, is_null); }

Committable suggestion skipped: line range outside the PR's diff.

🤖 Prompt for AI Agents

In src/transform/arg_binder.cc around lines 556 to 558, the BindNullable call currently uses BindNullable(offset, truncmod(offset, factor), ...), which effectively checks offset == truncmod(offset,factor) (true only when offset % factor == 0); instead it should assert that truncmod(offset, factor) == 0. Change the call so the first argument is truncmod(offset, factor) and the second argument is 0 (preserving the same arg_name, nullable flag and is_null), i.e. bind the truncated modulus and compare it to zero to correctly enforce divisibility by factor.

coderabbitai · 2025-11-27T11:23:50Z

src/transform/merge_if_stmt.h

+namespace tvm {
+namespace tl {
+
+using namespace tir;


🛠️ Refactor suggestion | 🟠 Major

Avoid using namespace in header files.

Placing using namespace tir; in a header pollutes the namespace for all translation units that include this header. This can cause name collisions and unexpected behavior.

Consider removing this and using qualified names in declarations:

-using namespace tir; - // Forward declaration class MergeIfStmtRewriter; ... -PrimFunc MergeIfStmtSubstitute(PrimFunc &f); +tir::PrimFunc MergeIfStmtSubstitute(tir::PrimFunc &f); -Stmt ApplyMergeIfStmt(Stmt stmt); +tir::Stmt ApplyMergeIfStmt(tir::Stmt stmt);

Committable suggestion skipped: line range outside the PR's diff.

🤖 Prompt for AI Agents

In src/transform/merge_if_stmt.h around line 14, remove the top-level 'using namespace tir;' which pollutes global namespaces in all translation units; instead, qualify all references to tir types/functions in this header with the 'tir::' prefix (or introduce a local namespace alias like 'namespace t = tir;' within the header and use 't::' qualifiers), or move any convenience 'using' directives into the corresponding .cpp file or into limited function scopes; also add any needed forward declarations or includes so the fully-qualified names resolve.

This commit enhances the matmul kernel by introducing additional tensor parameters and refining the pipeline stages for improved performance. It also updates the argument binding mechanism to include a flag indicating whether buffers are used, enhancing the efficiency of buffer management. Furthermore, the optimization phase in the engine is improved by adding a simplification step, ensuring better performance and clarity in the generated code.

…nding assertions This commit introduces a new documentation page for host-side tensor checks, detailing the automatic validations performed by TileLang on kernel arguments. It enhances the ArgBinder by adding assertions for non-null pointers when arguments are used, improving error handling. Additionally, the optimization phase in the engine is updated to include a simplification step, ensuring better performance and clarity in the generated code.

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (2)

src/transform/arg_binder.cc (2)
142-142: Remove commented-out debug code.

The // ICHECK(false); is leftover debug code that should be removed.
-  // ICHECK(false);
   return false;
563-564: Incorrect usage of BindNullable for divisibility check.

The call BindNullable(offset, truncmod(offset, factor), ...) attempts to assert that offset == truncmod(offset, factor), which only holds when offset % factor == 0. However, this is the second binding of offset (it was already bound at lines 556-557), so this becomes a constraint check.

The correct check should be truncmod(offset, factor) == 0.
-      BindNullable(offset, truncmod(offset, factor), arg_name + ".elem_offset",
-                   true, is_null);
+      PrimExpr zero = make_zero(offset.dtype());
+      BinderAddAssert(&analyzer_, zero == truncmod(offset, factor),
+                      arg_name + ".elem_offset", &asserts_, is_null);
Based on past review comments.

🧹 Nitpick comments (4)

src/transform/arg_binder.cc (2)
44-54: Simplify the NULL-guarded assertion wrapping.

When nullable_guard is defined, the assertion is wrapped in SeqStmt({check, Evaluate(0)}). This extra wrapping appears unnecessary since check is already a Stmt.

Consider simplifying:
-    Stmt check = AssertStmt(scond, StringImm(os.str()), Evaluate(0));
-    check = IfThenElse(Not(nullable_guard), check);
-    asserts->emplace_back(SeqStmt({check, Evaluate(0)}));
+    Stmt check = IfThenElse(Not(nullable_guard), 
+                           AssertStmt(scond, StringImm(os.str()), Evaluate(0)));
+    asserts->emplace_back(check);
This would make the code more concise and avoid the unnecessary SeqStmt wrapper.

327-330: Consider simplifying the SeqStmt wrapping pattern.

The pattern SeqStmt({check, nop}) appears multiple times throughout this function (lines 330, 409, 493, 546, 629). This wrapping seems unnecessary since the check is already a Stmt.

Consider simplifying to just:
-  Stmt ndim_check = AssertStmt(a_ndim == v_ndim, msg, nop);
-  ndim_check = IfThenElse(Not(is_null), ndim_check);
-  init_nest_.emplace_back(SeqStmt({ndim_check, nop}));
+  Stmt ndim_check = IfThenElse(Not(is_null), 
+                              AssertStmt(a_ndim == v_ndim, msg, nop));
+  init_nest_.emplace_back(ndim_check);
This would make the code more concise and consistent.
docs/compiler_internals/tensor_checks.md (1)
74-114: Clarify nullability rules for symbolic runtime conditions.

The nullability examples are helpful, but example #4 (lines 107-113) states that tensors are non-nullable when the condition is only known at runtime. This is consistent with conservative static analysis, but it might be helpful to explain why this design choice was made.

Consider adding a brief note:
 4) Must be non-NULL (runtime condition)
 ```python
 @T.prim_func
 def main(A: T.Tensor((M, K), dtype), some_cond: T.bool):
     if some_cond:
         A[0] = 1
Since some_cond is only known at runtime, static analysis cannot prove A is unused; A is thus non-nullable.
+
+Rationale: Conservative static analysis treats dynamically-gated accesses as "potentially used" to avoid runtime errors when the condition evaluates to true.
</blockquote></details>
<details>
<summary>src/transform/make_packed_api.cc (1)</summary><blockquote>

`546-546`: **Document why MergeIfStmtSubstitute is applied here.**

`MergeIfStmtSubstitute(func)` is called after `MakePackedAPI` to merge consecutive `if` statements with identical conditions. This makes sense given that NULL-guarded assertions introduce many conditional checks.

However, it's not immediately obvious why this happens inside the MakePackedAPI transformation rather than as a separate pass in the pipeline.

Consider adding a comment:

```diff
         func = MakePackedAPI(std::move(func));
+        // Merge consecutive if-statements with identical NULL-guards to simplify the generated code
         func = MergeIfStmtSubstitute(func);
Alternatively, could this be applied as a standalone pass in phase.py instead?

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2f9d02e and 3070a89.

📒 Files selected for processing (6)

docs/compiler_internals/tensor_checks.md (1 hunks)
docs/index.md (1 hunks)
src/transform/arg_binder.cc (18 hunks)
src/transform/arg_binder.h (1 hunks)
src/transform/make_packed_api.cc (4 hunks)
tilelang/engine/phase.py (1 hunks)

✅ Files skipped from review due to trivial changes (1)

docs/index.md

🧰 Additional context used

🧬 Code graph analysis (2)

tilelang/engine/phase.py (2)

src/transform/simplify.cc (2)

Simplify (530-538)

Simplify (530-530)

tilelang/transform/simplify.py (1)

Simplify (20-28)

src/transform/make_packed_api.cc (1)

src/transform/merge_if_stmt.cc (4)

op (47-115)

op (47-47)

MergeIfStmtSubstitute (118-120)

MergeIfStmtSubstitute (118-118)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Quick Lint

🔇 Additional comments (12)

tilelang/engine/phase.py (1)
228-228: Clarify the necessity of this additional Simplify pass.

An additional Simplify() pass is added immediately after MakePackedAPI(). While this may help fold redundant conditions introduced by the NULL-safety transformations, it's not clear whether this pass is essential or optional.

Please clarify:

Does this Simplify pass address specific patterns introduced by MakePackedAPI (e.g., redundant NULL checks or guards)?

Have you measured the impact on compile time vs. code quality?

Could this be combined with the existing Simplify calls at lines 183 and 188, or is the placement after MakePackedAPI critical?

Consider documenting the rationale inline:
 mod = tilelang.transform.MakePackedAPI()(mod)
+# Simplify to fold redundant NULL-guards and conditions introduced by MakePackedAPI
 mod = tilelang.transform.Simplify()(mod)
src/transform/arg_binder.cc (5)
146-168: LGTM: Bind_ correctly maintains non-nullable assertions.

The Bind_ function correctly continues to emit unconditional assertions without nullable guards, which is appropriate for required (non-nullable) bindings.

277-299: LGTM: is_used parameter correctly enforces NULL-safety.

The is_used parameter appropriately distinguishes between buffers that must be non-NULL (used by the function) and those that may be NULL (unused). The logic correctly enforces a non-NULL assertion when is_used is true.

584-600: LGTM: Enhanced device type error messages.

The device type check is correctly guarded by is_null, and the enhanced error message with the DLPack device code legend (line 594-595) will help users diagnose device mismatch issues more easily.

622-629: LGTM: Data pointer NULL check correctly handles edge cases.

The data pointer check correctly:

Allows NULL data pointers for size-0 arrays (alloc_size == 0)

Guards the check with is_null to skip when the handle itself is NULL

Provides a clear error message

439-448: I encountered a repository access issue. To verify this review comment about the potential NULL pointer dereference, I'll need additional information from you. Could you please provide:

The full context of lines 400-450 from src/transform/arg_binder.cc (including the sections mentioned: lines 406-409 for dtype check, lines 420-426 for buf_shape initialization, and lines 439-448 for the shape binding)

The implementation of BindNullable function to understand how it guards nullable operations

Usage examples of similar patterns in the codebase where nullable buffer operations are performed

Alternatively, if you have access to the repository, you could share the output of:
sed -n '400,450p' src/transform/arg_binder.cc
Once you provide this context, I'll be able to verify whether the NULL pointer dereference concern is valid and whether the proposed fix is necessary.
src/transform/arg_binder.h (1)

106-108: LGTM: Signature change for is_used parameter.

The addition of the bool is_used parameter to BindDLTensor is consistent with the implementation in arg_binder.cc and enables NULL-safe buffer handling.

docs/compiler_internals/tensor_checks.md (2)

1-56: LGTM: Well-structured documentation introduction.

The introduction clearly explains:

Why host-side checks are performed (ABI stability, low overhead, focused errors)

How to inspect the generated host source for debugging

What checks are performed

The documentation structure is logical and easy to follow.

236-381: LGTM: Comprehensive troubleshooting examples.

The minimal reproduction examples for each error type are extremely valuable for users debugging issues. The examples cover all major error scenarios:

Argument count/type mismatches

Shape/stride/dtype mismatches

Device mismatches

NULL pointer issues

These will significantly reduce support burden.

src/transform/make_packed_api.cc (3)

301-325: LGTM: Buffer-to-parameter mapping enables usage tracking.

The data_var2param and shape_var2params mappings correctly establish the relationship between:

Buffer data variables and their parameter variables

Buffer shape/stride variables and their parameter variables

This enables accurate detection of which buffers are actually used by the function body, supporting NULL-safe binding for unused buffers.

467-470: LGTM: is_used flag correctly propagated.

The used_param_buffers.count(var.get()) check correctly determines whether each buffer is used, passing the appropriate boolean to BindDLTensor. This enables NULL-safe handling of unused buffers.

329-366: Based on my analysis of the code and the TVM visitor pattern semantics, I can now provide a rewritten review comment:

The UsedBufferDetector correctly handles buffer usage through the recursive visitor pattern.

The implementation uses StmtExprVisitor, which recursively traverses the entire expression and statement tree. This means:

Variables in Let bindings are caught by VisitExpr_(const VarNode*) during recursive traversal

Variables captured in closures or passed to functions are similarly visited as VarNode instances

BufferStore and BufferLoad operations explicitly extract the underlying buffer->data reference

The three detection methods (direct variables, BufferStore, BufferLoad) are sufficient for the visitor's purpose. However, if AddressOf operations exist in the codebase that take buffer addresses without explicit BufferLoad/BufferStore context, those might not be detected—verify this specific pattern if it's used in the codebase.

… performance This commit adds host checks logs to the .gitignore file to prevent unnecessary log files from being tracked. Additionally, it refines the matmul kernel by adjusting pipeline stages, updating tensor parameters, and enhancing argument handling for better performance. The changes also include improved error messages in the argument binding process, ensuring clearer diagnostics for users.

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (8)

examples/quickstart.py (1)

52-55: Large matrix dimensions may be excessive for a quickstart example.

The matrix dimensions (16384×16384×16384) are very large for a quickstart example. This may cause:

Long execution times for users trying the example

High memory usage (~1.5GB+ for the tensors)

Consider using smaller dimensions (e.g., 1024 or 4096) for the example, or add a comment explaining why large sizes are needed.

src/runtime/error_helpers.cc (1)

1-3: Consider adding a license header.

This file is missing the Apache 2.0 license header that appears in other files in this project (e.g., make_packed_api.cc). For consistency and compliance, consider adding it.
src/transform/make_packed_api.cc (1)
399-404: Consider using strlen(suffix) or a named constant instead of the magic number 7.

The hardcoded 7 for the "_handle" suffix length is fragile if the suffix changes.
-        const char *suffix = "_handle";
-        if (display_name.size() >= 7 &&
-            display_name.compare(display_name.size() - 7, 7, suffix) == 0) {
-          display_name.erase(display_name.size() - 7);
+        constexpr std::string_view suffix = "_handle";
+        if (display_name.size() >= suffix.size() &&
+            display_name.compare(display_name.size() - suffix.size(), suffix.size(), suffix) == 0) {
+          display_name.erase(display_name.size() - suffix.size());
         }
maint/host_checks/06_strides_mismatch.py (1)

11-15: Consider clarifying the shape after transpose.

The transpose a.t() changes shape from (M, K) to (K, M), which may also trigger shape-related validation in addition to stride checks. If the intent is purely to test strides validation with non-contiguous memory, you could add a comment noting that both shape and stride mismatches may contribute to the error—or use a different technique to create non-contiguous memory while preserving the expected shape.

That said, this is sufficient for demonstrating strides-related failure behavior.
maint/host_checks/run_all.py (2)
44-50: Consider distinguishing environment/setup failures from “PASS” repros

Lines 44–50 treat any non-zero return code as PASS, which will also classify environment/setup errors (e.g., missing CUDA, import errors) as successful repros, even if the host checks were never exercised. If you expect this script to run in heterogeneous or partially misconfigured environments (e.g., CPU-only machines), consider special-casing well-known environment failures (like the "CUDA is not available; cannot build CUDA kernel for host-check repros." raised in maint/host_checks/common.py) as SKIP or a separate ENV_FAIL status so the summary better reflects what was actually tested.

66-71: Let main() return an exit code and only call sys.exit in the CLI entrypoint

Having main() call sys.exit directly makes it harder to re-use or unit-test. A small refactor lets main() return an int and keeps the CLI behavior identical:
 def main():
@@
-    # Exit non-zero if any FAIL
-    sys.exit(1 if counts.get("FAIL", 0) else 0)
+    # Return non-zero if any FAIL
+    return 1 if counts.get("FAIL", 0) else 0
@@
 if __name__ == "__main__":
-    main()
+    raise SystemExit(main())
This preserves the current exit codes when run as a script but gives you a pure callable for tests and tooling.
maint/host_checks/common.py (2)
35-50: Align CUDA-availability guarding between matmul and scalar-check kernels

build_matmul_kernel (Line 37) guards CUDA targets with torch.cuda.is_available(), but build_scalar_check_kernel (Lines 44–50) will still attempt tilelang.compile(..., target="cuda") even when CUDA is unavailable. For consistency and clearer failures when running maint/host_checks/10_scalar_type_mismatch.py, consider factoring the guard into a shared helper and reusing it:
 import tilelang
 import tilelang.language as T
 import torch
 
 
+def _ensure_cuda_available(target: str) -> None:
+    if target.startswith("cuda") and not torch.cuda.is_available():
+        raise RuntimeError(
+            "CUDA is not available; cannot build CUDA kernel for host-check repros."
+        )
+
+
 def make_matmul_prim(M,
@@
 def build_matmul_kernel(M=1024, N=1024, K=1024, target="cuda"):
     """Compile and return a callable kernel that takes (A, B) and returns C."""
-    if target.startswith("cuda") and not torch.cuda.is_available():
-        raise RuntimeError("CUDA is not available; cannot build CUDA kernel for host-check repros.")
+    _ensure_cuda_available(target)
@@
 def build_scalar_check_kernel(target="cuda"):
-
-    @T.prim_func
+    _ensure_cuda_available(target)
+
+    @T.prim_func
     def scalar_check(x: T.int32, flag: T.bool()):
         T.evaluate(0)
This keeps behavior identical for matmul kernels and makes the scalar-check path fail fast with the same, more informative message on non-CUDA hosts.

46-49: Silence Ruff ARG001 for intentionally unused scalar_check parameters

Static analysis (Ruff ARG001) correctly notes that x and flag in scalar_check are unused, but they are required by the prim_func signature so the host-side type checker can fire before running the body. To document that this is intentional and keep linters quiet, you can add a noqa on the def line:
-    @T.prim_func
-    def scalar_check(x: T.int32, flag: T.bool()):
+    @T.prim_func
+    def scalar_check(x: T.int32, flag: T.bool()):  # noqa: ARG001
         T.evaluate(0)
This avoids changing kernel behavior while making the intent explicit. Based on static analysis hints.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 3070a89 and e2cbf27.

📒 Files selected for processing (18)

.gitignore (1 hunks)
examples/quickstart.py (3 hunks)
maint/host_checks/01_num_args_mismatch.py (1 hunks)
maint/host_checks/02_pointer_type_error.py (1 hunks)
maint/host_checks/03_ndim_mismatch.py (1 hunks)
maint/host_checks/04_dtype_mismatch.py (1 hunks)
maint/host_checks/05_shape_mismatch.py (1 hunks)
maint/host_checks/06_strides_mismatch.py (1 hunks)
maint/host_checks/07_device_type_mismatch.py (1 hunks)
maint/host_checks/08_device_id_mismatch.py (1 hunks)
maint/host_checks/09_null_data_pointer.py (1 hunks)
maint/host_checks/10_scalar_type_mismatch.py (1 hunks)
maint/host_checks/README.md (1 hunks)
maint/host_checks/common.py (1 hunks)
maint/host_checks/run_all.py (1 hunks)
src/runtime/error_helpers.cc (1 hunks)
src/transform/make_packed_api.cc (8 hunks)
tilelang/jit/adapter/tvm_ffi.py (0 hunks)

💤 Files with no reviewable changes (1)

tilelang/jit/adapter/tvm_ffi.py

✅ Files skipped from review due to trivial changes (2)

.gitignore
maint/host_checks/README.md

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-11-14T07:56:11.098Z

Learnt from: lucifer1004
Repo: tile-ai/tilelang PR: 1256
File: testing/python/jit/test_tilelang_jit_gemm_nvrtc.py:55-115
Timestamp: 2025-11-14T07:56:11.098Z
Learning: In `testing/python/jit/test_tilelang_jit_gemm_nvrtc.py`, the global function `tilelang_callback_cuda_postproc` registered via `tvm.register_global_func(..., override=True)` is intentionally not restored after the test completes, as the persistent behavior is expected.

Applied to files:

examples/quickstart.py

🧬 Code graph analysis (12)

maint/host_checks/10_scalar_type_mismatch.py (2)

maint/host_checks/common.py (2)

build_scalar_check_kernel (44-50)

main (16-30)

maint/host_checks/run_all.py (1)

main (6-67)

maint/host_checks/06_strides_mismatch.py (3)

maint/host_checks/common.py (2)

build_matmul_kernel (35-41)

main (16-30)

maint/host_checks/run_all.py (1)

main (6-67)

tilelang/language/ast/ir.py (1)

target (1682-1713)

maint/host_checks/04_dtype_mismatch.py (2)

maint/host_checks/common.py (2)

build_matmul_kernel (35-41)

main (16-30)

tilelang/jit/adapter/tvm_ffi.py (1)

get_host_source (282-286)

examples/quickstart.py (3)

tilelang/transform/pass_config.py (1)

PassConfigKey (6-144)

tilelang/env.py (1)

disable_cache (275-276)

tilelang/jit/adapter/base.py (1)

get_kernel_source (93-97)

maint/host_checks/03_ndim_mismatch.py (5)

maint/host_checks/common.py (2)

build_matmul_kernel (35-41)

main (16-30)

maint/host_checks/run_all.py (1)

main (6-67)

maint/host_checks/01_num_args_mismatch.py (1)

main (10-17)

maint/host_checks/05_shape_mismatch.py (1)

main (7-15)

maint/host_checks/02_pointer_type_error.py (1)

main (10-18)

src/transform/make_packed_api.cc (1)

src/transform/merge_if_stmt.cc (4)

op (47-115)

op (47-47)

MergeIfStmtSubstitute (118-120)

MergeIfStmtSubstitute (118-118)

maint/host_checks/01_num_args_mismatch.py (2)

maint/host_checks/common.py (2)

build_matmul_kernel (35-41)

main (16-30)

maint/host_checks/run_all.py (1)

main (6-67)

maint/host_checks/09_null_data_pointer.py (4)

maint/host_checks/common.py (2)

build_matmul_kernel (35-41)

main (16-30)

maint/host_checks/run_all.py (1)

main (6-67)

maint/host_checks/01_num_args_mismatch.py (1)

main (10-17)

maint/host_checks/02_pointer_type_error.py (1)

main (10-18)

maint/host_checks/07_device_type_mismatch.py (2)

maint/host_checks/common.py (2)

build_matmul_kernel (35-41)

main (16-30)

maint/host_checks/run_all.py (1)

main (6-67)

maint/host_checks/02_pointer_type_error.py (2)

maint/host_checks/common.py (2)

build_matmul_kernel (35-41)

main (16-30)

maint/host_checks/run_all.py (1)

main (6-67)

maint/host_checks/common.py (15)

maint/host_checks/01_num_args_mismatch.py (1)

main (10-17)

maint/host_checks/03_ndim_mismatch.py (1)

main (7-15)

maint/host_checks/04_dtype_mismatch.py (1)

main (7-15)

maint/host_checks/05_shape_mismatch.py (1)

main (7-15)

maint/host_checks/06_strides_mismatch.py (1)

main (7-15)

maint/host_checks/07_device_type_mismatch.py (1)

main (7-14)

maint/host_checks/08_device_id_mismatch.py (1)

main (7-21)

maint/host_checks/10_scalar_type_mismatch.py (1)

main (6-11)

maint/host_checks/02_pointer_type_error.py (1)

main (10-18)

maint/host_checks/09_null_data_pointer.py (1)

main (14-21)

tilelang/language/kernel.py (1)

threads (214-218)

tilelang/language/allocate.py (2)

alloc_shared (28-43)

alloc_fragment (60-71)

tilelang/language/loop.py (1)

Pipelined (56-93)

tilelang/language/ast/ir.py (2)

target (1682-1713)

evaluate (1319-1331)

tilelang/jit/kernel.py (1)

out_idx (598-599)

maint/host_checks/08_device_id_mismatch.py (2)

maint/host_checks/common.py (2)

build_matmul_kernel (35-41)

main (16-30)

maint/host_checks/run_all.py (1)

main (6-67)

🪛 Ruff (0.14.6)

maint/host_checks/run_all.py

33-33: subprocess call: check for execution of untrusted input

(S603)

maint/host_checks/common.py

38-38: Avoid specifying long messages outside the exception class

(TRY003)

47-47: Unused function argument: x

(ARG001)

47-47: Unused function argument: flag

(ARG001)

maint/host_checks/08_device_id_mismatch.py

9-9: Avoid specifying long messages outside the exception class

(TRY003)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Test for Python 3.12 with ROCm-6.3 (on self-hosted-amd)
GitHub Check: Test for Python 3.12 with CUDA-12.8 (on self-hosted-nvidia)

🔇 Additional comments (22)

examples/quickstart.py (3)

8-12: LGTM on the pass configuration setup.

The pass_configs dictionary correctly uses PassConfigKey enum values to disable TMA lowering and warp specialization.

34-34: Setting num_stages=0 disables pipelining.

For a quickstart example intended to showcase TileLang performance, num_stages=0 disables the pipelining optimization. If this is intentional for testing the assertion handling changes, consider adding a comment explaining why, or restore a non-zero value for the example.

62-62: Good additions for demonstrating kernel introspection and profiling.

Printing the host source (line 62) and kernel source (lines 92-93) provides useful visibility into the generated code. The latency profiling (lines 96-100) is a good demonstration of TileLang's profiling capabilities.

Also applies to: 92-100

src/runtime/error_helpers.cc (3)

16-32: LGTM – clean error helper implementation.

The function correctly constructs DataType objects from the packed arguments and produces a clear, actionable error message with kernel name, buffer name, and expected vs. actual dtypes. The return value of -1 signals failure to the caller as intended.

36-49: LGTM – consistent variant without names.

This follows the same pattern as DTypeMismatch and provides a lightweight alternative when buffer/kernel names are not available.

54-60: LGTM – FFI registration looks correct.

The static init block properly registers both error helpers with descriptive names.

src/transform/make_packed_api.cc (5)

42-42: LGTM – new include for merge_if_stmt transformation.

The include is appropriately placed with other local includes.

301-325: LGTM – well-structured usage tracking infrastructure.

The reverse mapping from buffer data vars to parameters, along with the shape/stride variable tracking, enables precise detection of buffer usage. The PostOrderVisit approach correctly captures all variable references within buffer metadata.

329-366: LGTM – UsedBufferDetector correctly tracks buffer usage.

The visitor properly handles:

Direct var references via VisitExpr_(VarNode*)

Buffer stores and loads via dedicated visit methods

Transitive usage through shape/stride variables

One minor note: calling StmtExprVisitor::VisitExpr_ and StmtExprVisitor::VisitStmt_ after marking the buffer ensures child nodes are also visited.

482-488: LGTM – BindDLTensor now receives display name and is_used flag.

This change enables nullable handling for unused buffers, improving assertion precision. The display name uses the buffer's data var name for clearer diagnostics.

563-564: I attempted to verify the review comment but encountered a repository access issue. The codebase cannot be cloned at this time, which prevents me from:

Checking the header files included in src/transform/make_packed_api.cc

Locating the declaration of MergeIfStmtSubstitute

Verifying the function signature matches the usage at lines 563-564

Confirming the function is properly declared before use

Unable to verify MergeIfStmtSubstitute function declaration due to repository access constraints.

The review comment's verification request cannot be completed without access to the codebase. Manual verification is required to confirm that MergeIfStmtSubstitute is correctly declared in the included headers and that the function signature is compatible with its usage at lines 563-564.

maint/host_checks/07_device_type_mismatch.py (1)

1-18: LGTM – clear repro script for device-type mismatch.

The script correctly demonstrates the device-type mismatch scenario by passing CPU tensors to a CUDA kernel. CUDA availability is handled upstream in build_matmul_kernel (per common.py lines 36-37), which raises RuntimeError if CUDA is unavailable.

maint/host_checks/10_scalar_type_mismatch.py (1)

9-11: Only the first mismatch will be reproduced.

The first call fn(1.0, True) will likely raise an exception, preventing the second call fn(1, 2.5) from executing. If both scenarios need to be tested independently, consider separating them into distinct scripts or wrapping each in a try-except block.

If only demonstrating one mismatch case is sufficient for this repro script, this is fine as-is.

maint/host_checks/04_dtype_mismatch.py (1)

1-19: LGTM – clear repro script for dtype mismatch.

The script correctly demonstrates a dtype mismatch by providing float32 tensor a when float16 is expected. The print(fn.get_host_source()) call is helpful for debugging the generated host code. Note that this script requires CUDA to be available (handled by build_matmul_kernel in common.py).

maint/host_checks/02_pointer_type_error.py (1)

1-22: LGTM!

The repro script is well-structured and correctly demonstrates passing an incorrect type (int instead of tensor) to trigger the expected pointer-type assertion error. The docstring clearly describes the expected behavior.

maint/host_checks/05_shape_mismatch.py (1)

1-19: LGTM!

The script correctly reproduces a shape mismatch scenario by constructing tensor a with dimension K+1 instead of the expected K. The implementation follows the established pattern for host-check scripts.

maint/host_checks/09_null_data_pointer.py (1)

1-25: LGTM!

The script appropriately documents the distinction between passing Python None versus a true DLTensor with NULL data, and correctly reproduces the intended class of pointer validation errors. The detailed docstring adds valuable context.

maint/host_checks/01_num_args_mismatch.py (1)

1-21: LGTM!

The script correctly reproduces an argument count mismatch by omitting the second input tensor. The docstring and inline comments clearly explain the expected behavior and that the error occurs at the adapter level before host entry.

maint/host_checks/03_ndim_mismatch.py (1)

1-19: LGTM!

The script correctly reproduces an ndim mismatch by constructing tensor a with shape (M, K, 1) (rank 3) instead of the expected (M, K) (rank 2). Clear and follows the established pattern.

maint/host_checks/08_device_id_mismatch.py (2)

7-12: Good handling of multi-GPU requirement.

The script properly checks for CUDA availability and device count before proceeding, using the [SKIP] pattern that run_all.py recognizes. This ensures graceful handling on single-GPU systems.

17-21: LGTM!

The device ID mismatch is correctly reproduced by placing tensor a on cuda:0 and tensor b on cuda:1. This will trigger the expected host-side device validation error.

maint/host_checks/common.py (1)

6-32: Matmul prim and tiling pattern look consistent (LGTM)

The make_matmul_prim construction (tiled A_shared/B_shared, C_local fragment, T.Pipelined over ko, and T.gemm followed by a final T.copy into C) is coherent and matches the typical tilelang GEMM pattern. No functional issues stand out here.

examples/quickstart.py

This commit refactors the tensor_null_test function by adding a with_bias parameter and removing the ptr_null_test function, which was previously unused. The run_test function is updated to reflect these changes, streamlining the testing process for tensor operations.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

testing/python/jit/test_tilelang_jit_nullptr.py (1)
23-25: Align B tensor shape annotation with its runtime layout

B is declared as T.Tensor((K, N), dtype) (Line 23) but instantiated in PyTorch with shape (N, K) (Line 53) and tiled as (block_N, block_K), i.e. (N, K). This only works silently here because N == K; for general shapes it becomes misleading and could confuse host-side checks.

Recommend updating the annotation to match actual usage:
-            B: T.Tensor((K, N), dtype),
+            B: T.Tensor((N, K), dtype),
so the prim_func signature, tiling, and test inputs stay consistent.

Also applies to: 53-57

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between e2cbf27 and b45bb37.

📒 Files selected for processing (1)

testing/python/jit/test_tilelang_jit_nullptr.py (2 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-11-14T07:56:11.098Z

Learnt from: lucifer1004
Repo: tile-ai/tilelang PR: 1256
File: testing/python/jit/test_tilelang_jit_gemm_nvrtc.py:55-115
Timestamp: 2025-11-14T07:56:11.098Z
Learning: In `testing/python/jit/test_tilelang_jit_gemm_nvrtc.py`, the global function `tilelang_callback_cuda_postproc` registered via `tvm.register_global_func(..., override=True)` is intentionally not restored after the test completes, as the persistent behavior is expected.

Applied to files:

testing/python/jit/test_tilelang_jit_nullptr.py

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: Test for Python 3.12 with CUDA-12.8 (on self-hosted-nvidia)
GitHub Check: Test for Python 3.12 with ROCm-6.3 (on self-hosted-amd)
GitHub Check: Test for Python 3.12 with Metal (on macos-latest)

🔇 Additional comments (1)

testing/python/jit/test_tilelang_jit_nullptr.py (1)

10-18: JIT wrapper signature and with_bias flag are consistent with the kernel body

Capturing with_bias in the closure and specializing via tensor_null_test(..., with_bias=False) matches the intended nullptr repro; the parameter ordering and defaults look good.

coderabbitai bot reviewed Nov 27, 2025

View reviewed changes

LeiWang1999 added 3 commits November 27, 2025 20:18

lint fix

a00e6d4

coderabbitai bot reviewed Nov 27, 2025

View reviewed changes

LeiWang1999 added 3 commits November 28, 2025 02:16

lint fix

8178381

lint fix

e2cbf27

coderabbitai bot reviewed Nov 27, 2025

View reviewed changes

examples/quickstart.py Outdated Show resolved Hide resolved

examples/quickstart.py Outdated Show resolved Hide resolved

LeiWang1999 added 3 commits November 28, 2025 03:25

lint fix

b45bb37

fix

e79d06b

LeiWang1999 merged commit 1e92d11 into tile-ai:main Nov 27, 2025
1 of 2 checks passed

coderabbitai bot reviewed Nov 27, 2025

View reviewed changes

coderabbitai bot mentioned this pull request Dec 3, 2025

[Language] Tilelang LazyJIT Experimental Version #1337

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Refactor] Improve assertion handling in CodeGenCHost and ArgBinder #1352

[Refactor] Improve assertion handling in CodeGenCHost and ArgBinder #1352

LeiWang1999 commented Nov 27, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

coderabbitai bot commented Nov 27, 2025 •

edited

Loading

Review failed

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Nov 27, 2025

Uh oh!

coderabbitai bot Nov 27, 2025

Uh oh!

coderabbitai bot Nov 27, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[Refactor] Improve assertion handling in CodeGenCHost and ArgBinder #1352

[Refactor] Improve assertion handling in CodeGenCHost and ArgBinder #1352

Conversation

LeiWang1999 commented Nov 27, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

coderabbitai bot commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

LeiWang1999 commented Nov 27, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 27, 2025 •

edited

Loading