[MatmulLoopPipeline] Populate `LoadOp` mask to `PrefetchOp` by whitneywhtsang · Pull Request #4030 · intel/intel-xpu-backend-for-triton

whitneywhtsang · 2025-04-27T23:40:36Z

This PR enhances MatmulLoopPipeline to make it create PrefetchOp operations with mask from associated LoadOp.
Benchmark CI:
https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/14697631543
https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/14716472373
(No performance regressions.)

Note: this change comes partially from #3634.

Signed-off-by: Whitney Tsang <whitney.tsang@intel.com>

…ch_pipeline

Signed-off-by: Whitney Tsang <whitney.tsang@intel.com>

[MatmulLoopPipeline] Populate LoadOp mask to PrefetchOp

440dd9a

Signed-off-by: Whitney Tsang <whitney.tsang@intel.com>

whitneywhtsang self-assigned this Apr 27, 2025

whitneywhtsang marked this pull request as ready for review April 28, 2025 00:18

whitneywhtsang requested review from a team, alexbaden, chengjunlu and etiotto April 28, 2025 00:21

chengjunlu approved these changes Apr 28, 2025

View reviewed changes

mfrancepillois approved these changes Apr 28, 2025

View reviewed changes

etiotto reviewed Apr 28, 2025

View reviewed changes

Comment thread test/TritonIntelGPU/loop-pipeline.mlir Outdated

etiotto reviewed Apr 28, 2025

View reviewed changes

Comment thread test/TritonIntelGPU/loop-pipeline.mlir Outdated

whitneywhtsang added 2 commits April 28, 2025 16:15

Merge remote-tracking branch 'origin/main' into whitneywhtsang/prefet…

b5422ca

…ch_pipeline

address review comments

2550d8f

Signed-off-by: Whitney Tsang <whitney.tsang@intel.com>

whitneywhtsang requested a review from etiotto April 28, 2025 16:45

etiotto approved these changes Apr 28, 2025

View reviewed changes

whitneywhtsang merged commit d4699e1 into main Apr 28, 2025
10 checks passed

whitneywhtsang deleted the whitneywhtsang/prefetch_pipeline branch April 28, 2025 21:44

etiotto linked an issue May 2, 2025 that may be closed by this pull request

[Performance] Enable prefetching for tt.load with tensor of pointer #3484

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MatmulLoopPipeline] Populate `LoadOp` mask to `PrefetchOp`#4030

[MatmulLoopPipeline] Populate `LoadOp` mask to `PrefetchOp`#4030
whitneywhtsang merged 3 commits intomainfrom
whitneywhtsang/prefetch_pipeline

whitneywhtsang commented Apr 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

whitneywhtsang commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

whitneywhtsang commented Apr 27, 2025 •

edited

Loading