Skip to content

Conversation

@Hzfengsy
Copy link
Member

@Hzfengsy Hzfengsy commented Aug 4, 2023

This PR fixes the issue of the GEMV rule uses too much shared memory on llama-70B model.

May have perf regression w/o #15471. (Actually not sure)

cc @cyx-6

This PR fixes the issue of the GEMV rule uses too much shared memory on
llama-70B model.
@tvm-bot
Copy link
Collaborator

tvm-bot commented Aug 4, 2023

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

Generated by tvm-bot

@github-actions github-actions bot requested a review from cyx-6 August 4, 2023 06:30
@junrushao junrushao merged commit a8218b3 into apache:unity Aug 4, 2023
@Hzfengsy Hzfengsy deleted the dlight_gemv_for_large_workload branch November 5, 2023 09:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants