[CUTLASS][Cherry-pick] Introduce several features of cutlass profiler #15573

masahi · 2023-08-16T05:31:25Z

#14275 was sent and merged to unity for no good reason, so I'm sending this to main now.

Also the PR makes the conv2d profiling time 4x slower by expanding the search space over the output alignments. In most cases (profile_all_alignments = False, the default), we just want to pick the largest-possible alignment. To prevent the conv2d profiling time from blowing up in the default path, this PR adds a fix on top of #14275. Due to this difference, expect merge conflict on the next unity + main merge.

@junrushao @spectrometerHBH @Hzfengsy @jwfromm

- allow Conv2d using different alignment factors for input and epilogue, which can influence performance - store the profiler cache on disk, reducing CUTLASS profiler overhead across different runs - use the same set of default tile configurations as CUTLASS for sm80 https://github.com/NVIDIA/cutlass/blob/master/tools/library/scripts/generator.py#L1881

tvm-bot · 2023-08-16T05:31:28Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

No users to tag found in teams: cutlass, cherry-pick _{See #10317 for details}

_{Generated by tvm-bot}

Hzfengsy · 2023-08-17T00:20:40Z

Due to this difference, expect merge conflict on the next unity + main merge.

Could you please also patch it to unity branch to address the potential conflict?

spectrometerHBH and others added 2 commits August 16, 2023 14:21

skip profiling all conv2d output alignments when possible

5f8403b

Hzfengsy approved these changes Aug 17, 2023

View reviewed changes

Hzfengsy merged commit 8afa6d2 into apache:main Aug 17, 2023

masahi mentioned this pull request Aug 17, 2023

[Unity][CUTLASS][Cherry-pick] Skip profiling all conv2d output alignments when possible #15583

Merged

ysh329 mentioned this pull request Oct 18, 2023

[Release] v0.14.0 Release Candidate Notes #15948

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CUTLASS][Cherry-pick] Introduce several features of cutlass profiler #15573

[CUTLASS][Cherry-pick] Introduce several features of cutlass profiler #15573

Uh oh!

masahi commented Aug 16, 2023 •

edited

Loading

Uh oh!

tvm-bot commented Aug 16, 2023

Uh oh!

Hzfengsy commented Aug 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[CUTLASS][Cherry-pick] Introduce several features of cutlass profiler #15573

[CUTLASS][Cherry-pick] Introduce several features of cutlass profiler #15573

Uh oh!

Conversation

masahi commented Aug 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tvm-bot commented Aug 16, 2023

Uh oh!

Hzfengsy commented Aug 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

masahi commented Aug 16, 2023 •

edited

Loading