[Unity][DLight] Use less shared memory for gemv #15482

Hzfengsy · 2023-08-04T06:30:11Z

This PR fixes the issue of the GEMV rule uses too much shared memory on llama-70B model.

May have perf regression w/o #15471. (Actually not sure)

This PR fixes the issue of the GEMV rule uses too much shared memory on llama-70B model.

tvm-bot · 2023-08-04T06:30:14Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @quic-sanirudh _{See #10317 for details}

_{Generated by tvm-bot}

[Unity][DLight] Use less shared memory for gemv

f6a3d33

This PR fixes the issue of the GEMV rule uses too much shared memory on llama-70B model.

github-actions bot requested a review from cyx-6 August 4, 2023 06:30

junrushao approved these changes Aug 4, 2023

View reviewed changes

junrushao merged commit a8218b3 into apache:unity Aug 4, 2023

Hzfengsy deleted the dlight_gemv_for_large_workload branch November 5, 2023 09:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Unity][DLight] Use less shared memory for gemv #15482

[Unity][DLight] Use less shared memory for gemv #15482

Uh oh!

Hzfengsy commented Aug 4, 2023

Uh oh!

tvm-bot commented Aug 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Unity][DLight] Use less shared memory for gemv #15482

[Unity][DLight] Use less shared memory for gemv #15482

Uh oh!

Conversation

Hzfengsy commented Aug 4, 2023

Uh oh!

tvm-bot commented Aug 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants