Skip to content

[Feat] Add tilelang T.assume support and assume injection for buffer shapes#787

Merged
LeiWang1999 merged 7 commits intotile-ai:mainfrom
kurisu6912:kurisu-bug-0902-patch-1
Sep 5, 2025
Merged

[Feat] Add tilelang T.assume support and assume injection for buffer shapes#787
LeiWang1999 merged 7 commits intotile-ai:mainfrom
kurisu6912:kurisu-bug-0902-patch-1

Conversation

@kurisu6912
Copy link
Collaborator

@kurisu6912 kurisu6912 commented Sep 5, 2025

Summary by CodeRabbit

  • New Features

    • Runtime buffer-shape validation is injected into generated TL/TIR code.
    • Public API added to enable the new validation pass.
  • Chores

    • Pipeline now runs the validation pass after legalization and before simplification.
    • Transform wrapper made resilient when a backend symbol is unavailable.
    • Updated third-party TVM submodule pointer.

@coderabbitai
Copy link
Contributor

coderabbitai bot commented Sep 5, 2025

Note

Other AI code review bot(s) detected

CodeRabbit has detected other AI code review bot(s) in this pull request and will avoid duplicating their findings in the review comments. This may lead to a less comprehensive review.

Walkthrough

Inserts a TL/TIR pass InjectAssumes that rewrites PrimFunc bodies to add deduplicated runtime buffer-shape checks as tilelang_assume AttrStmt(s); registers and exposes the pass, integrates it into LowerAndLegalize, adds a Python wrapper, and updates the 3rdparty/tvm submodule pointer.

Changes

Cohort / File(s) Summary
TIR pass: InjectAssumes
src/transform/inject_assumes.cc
Adds AssumeInjector (subclass of tvm::tir::StmtExprMutator) with Substitute entry; collects non-constant buffer shape expressions from DeclBuffer, Block, buffer_map, alloc_buffers, and match_buffers; deduplicates shapes (structural hash/eq); simplifies GT(shape, 0) with arith::Analyzer; emits AttrStmt(tilelang_assume, cond, message) per unique shape; registers pass tl.InjectAssumes and reflection tl.transform.InjectAssumes.
Engine pipeline integration
tilelang/engine/phase.py
Calls tilelang.transform.InjectAssumes()(mod) in LowerAndLegalize after FrontendLegalize (inserted before later simplifications).
Python API wrapper & robustness
tilelang/transform/__init__.py
Adds def InjectAssumes() forwarding to _ffi_api.InjectAssumes() (returns tvm.transform.Pass); makes LowerHopperIntrin wrapper tolerant of missing _ffi_api.LowerHopperIntrin by falling back to an identity pass.
Submodule pointer update
3rdparty/tvm
Advances the 3rdparty/tvm submodule commit pointer; no other repository source changes.

Sequence Diagram(s)

sequenceDiagram
    autonumber
    participant Engine as LowerAndLegalize
    participant TL as tilelang.transform
    participant Pass as tl.InjectAssumes
    participant Func as PrimFunc

    Engine->>TL: FrontendLegalize(mod)
    Engine->>TL: InjectAssumes()(mod)   %% new insertion
    TL->>Pass: apply to PrimFunc(s)
    Pass->>Func: rewrite body (insert AttrStmt tilelang_assume checks)
    Func-->>TL: transformed PrimFunc(s)
    Engine->>TL: continue remaining passes
Loading
sequenceDiagram
    autonumber
    participant Pass as InjectAssumes
    participant Mut as AssumeInjector
    participant Func as PrimFunc
    participant Analyzer as arith::Analyzer

    Pass->>Mut: Substitute(Func)
    Mut->>Mut: collect shapes from DeclBuffer / Block / buffer_map / alloc_buffers / match_buffers
    Mut->>Mut: deduplicate shapes (structural hash/eq) and map buffers→shape
    Mut->>Analyzer: simplify GT(shape, 0)
    alt non-trivial condition
        Mut->>Func: attach AttrStmt(tilelang_assume, cond, "buffers: ...")
    else trivial/no shape
        Mut-->>Func: no assume emitted
    end
Loading

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Poem

I hop through nodes both wide and small,
I gather shapes and note them all,
I stitch an assume where checks belong,
A tiny guard, concise and strong.
🥕


📜 Recent review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

  • MCP integration is disabled by default for public repositories
  • Jira integration is disabled by default for public repositories
  • Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 872b303 and aefee7e.

📒 Files selected for processing (1)
  • 3rdparty/tvm (1 hunks)
🚧 Files skipped from review as they are similar to previous changes (1)
  • 3rdparty/tvm
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)
  • GitHub Check: format-check
  • GitHub Check: bot-task
✨ Finishing Touches
🧪 Generate unit tests
  • Create PR with unit tests
  • Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

  • Add @coderabbitai ignore or @coderabbit ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Status, Documentation and Community

  • Visit our Status Page to check the current availability of CodeRabbit.
  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

@github-actions
Copy link

github-actions bot commented Sep 5, 2025

👋 Hi! Thank you for contributing to the TileLang project.

Please remember to run bash format.sh in the root directory of the project to ensure your changes are properly linted and formatted. This will help ensure your contribution passes the format check.

We appreciate you taking this step! Our team will review your contribution, and we look forward to your awesome work!

🚀

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Summary of Changes

Hello @kurisu6912, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a new optimization pass aimed at accelerating the TVM prover by programmatically injecting assumptions about buffer dimensions into the intermediate representation. By making these constraints explicit, the prover can operate more efficiently. Furthermore, the changes include the addition of debugging tools that allow developers to save and inspect the IR at different points in the compilation pipeline, which will be valuable for understanding and troubleshooting transformations.

Highlights

  • New InjectAssumes Pass: Implemented a C++ pass (InjectAssumes) that automatically inserts assertions into the IR, ensuring buffer shapes are positive. This is intended to provide explicit constraints for the TVM prover.
  • Compilation Pipeline Integration: The InjectAssumes pass has been integrated into the LowerAndLegalize phase of the TileLang compilation pipeline, executing after frontend legalization.
  • Enhanced Debugging Capabilities: Added functionality to dump intermediate IR modules to a 'debug' directory at various stages of the compilation process, facilitating easier inspection and debugging.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in issue comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a new InjectAssumes TIR transform pass to speed up the TVM prover by adding assertions that buffer shapes are positive. The changes include the C++ implementation of the pass, its Python bindings, and its integration into the compilation pipeline. My review focuses on correctness and best practices. I've identified a logic bug in the new C++ pass that could lead to incorrect behavior, an inefficiency in the same file, and some leftover debugging code in the Python pipeline that should be removed. I've also suggested an improvement to a docstring for better clarity.

Comment on lines 63 to 70
Stmt VisitStmt_(const BlockNode* op) final {
auto body = VisitStmt(op->body);
AssertCreator c;
if(root_node) {
for(auto item: f->buffer_map) {
c.addBuffer(item.second);
}
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

The root_node flag is intended to process function parameters only for the outermost block. However, it's never set to false, causing f->buffer_map to be processed for every BlockNode in the function body, which is incorrect and inefficient. You should capture the root_node state, set it to false before recursion, and then use the captured state.

  Stmt VisitStmt_(const BlockNode* op) final {
    bool is_root = root_node;
    if (is_root) root_node = false;
    auto body = VisitStmt(op->body);
    AssertCreator c;
    if(is_root) {
      for(auto item: f->buffer_map) {
        c.addBuffer(item.second);
      }
    }

Comment on lines 65 to 66
debug_path = Path('debug')
debug_path.mkdir(exist_ok=True)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

This code unconditionally creates a debug directory in the current working directory. This is a side effect that can be problematic (e.g., if write permissions are not available) and is generally not desirable in library code. This, along with the various debug_path.joinpath(...).write_text(...) calls (e.g., on lines 103, 106, 145, 147, 211), appears to be debugging code that should be removed before merging.

Comment on lines 31 to 38
auto bucket = buckets[h];
auto it = std::find_if(bucket.begin(), bucket.end(), [&](auto y) {
return se(e, y, true);
});
if(it == bucket.end()) {
exprs.push_back(e);
buckets[h].push_back(e);
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

Accessing the buckets map with buckets[h] creates a copy of the std::vector<tvm::PrimExpr>, which is inefficient. You should use a reference (auto&) to avoid this unnecessary copy. This also allows you to simplify the code by using the reference to push back the new element.

Suggested change
auto bucket = buckets[h];
auto it = std::find_if(bucket.begin(), bucket.end(), [&](auto y) {
return se(e, y, true);
});
if(it == bucket.end()) {
exprs.push_back(e);
buckets[h].push_back(e);
}
auto& bucket = buckets[h];
auto it = std::find_if(bucket.begin(), bucket.end(), [&](auto y) {
return se(e, y, true);
});
if(it == bucket.end()) {
exprs.push_back(e);
bucket.push_back(e);
}

Comment on lines +82 to +88
"""Inject Assumes

Returns:
-------
fpass : tvm.transform.Pass
The result pass
"""
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The docstring is a bit sparse and the Returns: section is inconsistent with other functions in this file. It would be helpful to explain what is being injected and why, and to align with the existing docstring format.

Suggested change
"""Inject Assumes
Returns:
-------
fpass : tvm.transform.Pass
The result pass
"""
"""Inject assertions that buffer shapes are positive to speed up the prover.
Returns
-------
fpass : tvm.transform.Pass
The result pass
"""

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 3

🧹 Nitpick comments (6)
src/transform/inject_assumes.cc (3)

31-38: Avoid copying buckets when deduplicating shapes.

This copies the vector; use a reference.

-      auto bucket = buckets[h];
+      auto& bucket = buckets[h];

48-51: Make zero-constant dtype-stable.

Comparing with literal 0 may upcast to int32 and mismatch 64-bit shape dtypes. Consider GT(expr, make_const(expr.dtype(), 0)) (or Cast(expr->dtype, 0)).


96-101: Pass lambda should take const refs to avoid copies (align with other passes).

Not critical, but matches surrounding style and avoids unnecessary copies.

-  auto pass_func = [=](PrimFunc f, IRModule m, PassContext ctx) {
+  auto pass_func = [=](PrimFunc f, const IRModule& m, const PassContext& ctx) {
tilelang/transform/__init__.py (1)

89-89: Optional: silence type checkers for _ffi_api dynamic attr.

Other wrappers use # type: ignore; add it for consistency.

-    return _ffi_api.InjectAssumes()
+    return _ffi_api.InjectAssumes()  # type: ignore
tilelang/engine/phase.py (2)

65-67: Gate debug directory creation; avoid side effects on import.

Creating debug/ at import time can be undesirable in sandboxes and breaks read-only environments.

-debug_path = Path('debug')
-debug_path.mkdir(exist_ok=True)
+import os
+DEBUG_DUMP = bool(int(os.getenv("TL_DEBUG_IR", "0")))
+debug_path = Path(os.getenv("TL_DEBUG_DIR", "debug")) if DEBUG_DUMP else None
+if DEBUG_DUMP:
+    debug_path.mkdir(parents=True, exist_ok=True)

103-106: Guard IR dumps behind a flag to reduce I/O and noise.

Unconditional writes can slow runs and pollute working dirs. Use the DEBUG_DUMP gate.

-debug_path.joinpath('LowerTileOp.0.py').write_text(mod.script(show_meta=True))
+if DEBUG_DUMP:
+    debug_path.joinpath('LowerTileOp.0.py').write_text(mod.script(show_meta=True))
@@
-debug_path.joinpath('LowerTileOp.1.py').write_text(mod.script(show_meta=True))
+if DEBUG_DUMP:
+    debug_path.joinpath('LowerTileOp.1.py').write_text(mod.script(show_meta=True))
@@
-debug_path.joinpath('InjectFenceProxy.0.py').write_text(mod.script(show_meta=True))
+if DEBUG_DUMP:
+    debug_path.joinpath('InjectFenceProxy.0.py').write_text(mod.script(show_meta=True))
@@
-debug_path.joinpath('InjectFenceProxy.1.py').write_text(mod.script(show_meta=True))
+if DEBUG_DUMP:
+    debug_path.joinpath('InjectFenceProxy.1.py').write_text(mod.script(show_meta=True))
@@
-debug_path.joinpath('PersistThreadblock.1.py').write_text(mod.script(show_meta=True))
+if DEBUG_DUMP:
+    debug_path.joinpath('PersistThreadblock.1.py').write_text(mod.script(show_meta=True))

Also applies to: 145-147, 211-211

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

  • MCP integration is disabled by default for public repositories
  • Jira integration is disabled by default for public repositories
  • Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 6e0c350 and cd5100a.

📒 Files selected for processing (3)
  • src/transform/inject_assumes.cc (1 hunks)
  • tilelang/engine/phase.py (5 hunks)
  • tilelang/transform/__init__.py (1 hunks)
🧰 Additional context used
🧬 Code graph analysis (3)
tilelang/transform/__init__.py (1)
src/transform/inject_assumes.cc (2)
  • InjectAssumes (96-101)
  • InjectAssumes (96-96)
src/transform/inject_assumes.cc (1)
tilelang/transform/__init__.py (1)
  • InjectAssumes (81-89)
tilelang/engine/phase.py (4)
src/transform/inject_assumes.cc (2)
  • InjectAssumes (96-101)
  • InjectAssumes (96-96)
tilelang/transform/__init__.py (3)
  • InjectAssumes (81-89)
  • LowerTileOp (49-57)
  • InjectFenceProxy (228-236)
src/transform/lower_tile_op.cc (2)
  • LowerTileOp (572-577)
  • LowerTileOp (572-572)
src/transform/inject_fence_proxy.cc (3)
  • InjectFenceProxy (184-184)
  • InjectFenceProxy (189-195)
  • InjectFenceProxy (189-189)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)
  • GitHub Check: format-check
  • GitHub Check: bot-task
  • GitHub Check: format-check
🔇 Additional comments (3)
src/transform/inject_assumes.cc (1)

53-55: Use tvm::String for AssertStmt message (StringImm is wrong type).

Second parameter to AssertStmt is tvm::String, not a PrimExpr.

-      auto msg = StringImm("Invalid Buffer Shape: buffer shape should be greater than 0");
-      return AssertStmt(simplified, msg, body);
+      return AssertStmt(
+          simplified,
+          tvm::String("Invalid Buffer Shape: buffer shape should be greater than 0"),
+          body);

Likely an incorrect or invalid review comment.

tilelang/transform/__init__.py (1)

81-90: Wrapper looks good; expose via FFI is consistent.

tilelang/engine/phase.py (1)

95-97: Good placement of InjectAssumes in the pipeline.

Running after FrontendLegalize and before Simplify maximizes simplification opportunities.

Comment on lines 2 to 9
#include "tvm/arith/analyzer.h"
#include "tvm/ir/expr.h"
#include "tvm/ir/transform.h"
#include "tvm/node/structural_hash.h"
#include "tvm/tir/expr.h"
#include "tvm/tir/stmt.h"
#include "tvm/tir/stmt_functor.h"
#include "tvm/tir/transform.h"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Add missing headers to prevent build failures.

std::unordered_map, std::vector, and std::find_if require STL headers; StructuralEqual needs its header too.

Apply:

 #include "tvm/arith/analyzer.h"
+#include <algorithm>
+#include <unordered_map>
+#include <vector>
+#include "tvm/node/structural_equal.h"
 #include "tvm/ir/expr.h"
 #include "tvm/ir/transform.h"
 #include "tvm/node/structural_hash.h"
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
#include "tvm/arith/analyzer.h"
#include "tvm/ir/expr.h"
#include "tvm/ir/transform.h"
#include "tvm/node/structural_hash.h"
#include "tvm/tir/expr.h"
#include "tvm/tir/stmt.h"
#include "tvm/tir/stmt_functor.h"
#include "tvm/tir/transform.h"
#include "tvm/arith/analyzer.h"
#include <algorithm>
#include <unordered_map>
#include <vector>
#include "tvm/node/structural_equal.h"
#include "tvm/ir/expr.h"
#include "tvm/ir/transform.h"
#include "tvm/node/structural_hash.h"
#include "tvm/tir/expr.h"
#include "tvm/tir/stmt.h"
#include "tvm/tir/stmt_functor.h"
#include "tvm/tir/transform.h"
🤖 Prompt for AI Agents
In src/transform/inject_assumes.cc around lines 2 to 9, the file is missing STL
headers and the StructuralEqual definition: add #include <unordered_map>,
#include <vector>, and #include <algorithm> to provide std::unordered_map,
std::vector, and std::find_if, and include the appropriate TVM header for
StructuralEqual (e.g., #include "tvm/node/structural_equal.h") near the other
TVM includes; reorder/includes should follow project style and then rebuild to
verify the missing symbol errors are resolved.

Comment on lines 63 to 71
Stmt VisitStmt_(const BlockNode* op) final {
auto body = VisitStmt(op->body);
AssertCreator c;
if(root_node) {
for(auto item: f->buffer_map) {
c.addBuffer(item.second);
}
}
for(auto item: op->alloc_buffers) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue

Root-block detection is broken; toggle before recursion and only once.

root_node is never flipped and you recurse before toggling, so every nested Block acts like root and duplicates buffer_map asserts.

-  Stmt VisitStmt_(const BlockNode* op) final {
-    auto body = VisitStmt(op->body);
-    AssertCreator c;
-    if(root_node) {
+  Stmt VisitStmt_(const BlockNode* op) final {
+    bool was_root = root_node;
+    root_node = false;  // ensure children are not treated as root
+    AssertCreator c;
+    if (was_root) {
       for(auto item: f->buffer_map) {
         c.addBuffer(item.second);
       }
     }
+    auto body = VisitStmt(op->body);
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
Stmt VisitStmt_(const BlockNode* op) final {
auto body = VisitStmt(op->body);
AssertCreator c;
if(root_node) {
for(auto item: f->buffer_map) {
c.addBuffer(item.second);
}
}
for(auto item: op->alloc_buffers) {
Stmt VisitStmt_(const BlockNode* op) final {
bool was_root = root_node;
root_node = false; // ensure children are not treated as root
AssertCreator c;
if (was_root) {
for(auto item: f->buffer_map) {
c.addBuffer(item.second);
}
}
auto body = VisitStmt(op->body);
for(auto item: op->alloc_buffers) {
🤖 Prompt for AI Agents
In src/transform/inject_assumes.cc around lines 63-71, root_block detection is
broken because root_node is never flipped and you recurse before toggling;
change the logic so you detect and set root_node true before recursing into the
block body (but only if it was false), run the root-only buffer_map handling
while root_node is true, then restore root_node to its previous value after the
recursive visit so nested Blocks don't all act as root and duplicate asserts.

@kurisu6912 kurisu6912 changed the title Add InjectAssumes pass to speedup tvm prover Add tilelang T.assume support and assume injection for buffer shapes Sep 5, 2025
@kurisu6912 kurisu6912 changed the title Add tilelang T.assume support and assume injection for buffer shapes [Feat] Add tilelang T.assume support and assume injection for buffer shapes Sep 5, 2025
@LeiWang1999 LeiWang1999 merged commit e5b61e9 into tile-ai:main Sep 5, 2025
6 checks passed
RubiaCx pushed a commit to RubiaCx/tilelang that referenced this pull request Nov 24, 2025
…shapes (tile-ai#787)

* Add InjectAssumes pass to speedup tvm prover

* Fix lint errors

* remove debug statements

* [Feat] add assume attr and assume support in tilelang

* Add convertion from tir.assume to tilelang assume

* [Fix] Add missing With constraint in IRMutator

* Fix typo in ir mutator
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants