[DLIGHT][GPU] Add OpenCL dequant matmul schedule #17187

krishnaraj36 · 2024-07-22T09:52:44Z

Enhanced the GPU matmul schedule for OpenCL Android and windows backend.
It improves the 2X performance gain for Llama-2-7B prefill process.

-------Model ------------------target device -----------Earlier prefill perf -------Optimized prefill perf
Llama-2-7B-chat-hf ------ Snapdragon® 8 Gen 3--------27 tok/sec------------------50 tok/sec

1. Enhanced the GPU matmul schedule for OpenCL Android and windows backend. 2. It improves the 2X performance gain for Llama-2-7B prefill process Model device Earlier prefill perf Optimized prefill perf Llama-2-7B-chat-hf Snapdragon® 8 Gen 3 27 tok/sec 50 tok/sec

krishnaraj36 · 2024-07-22T09:57:07Z

@tqchen @Hzfengsy - Can you please take a look at this PR.
@srkreddy1238

Update matmul.py

1cc21a6

tqchen approved these changes Jul 23, 2024

View reviewed changes

tqchen merged commit 50d1c97 into apache:main Jul 23, 2024

ysh329 mentioned this pull request Oct 16, 2024

[Release] v0.18.0 Release Candidate Notes #17468

Closed

kurisu6912 mentioned this pull request Sep 5, 2025

kurisu add assume attr patch 1 tile-ai/tvm#8

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DLIGHT][GPU] Add OpenCL dequant matmul schedule #17187

[DLIGHT][GPU] Add OpenCL dequant matmul schedule #17187

Uh oh!

krishnaraj36 commented Jul 22, 2024 •

edited

Loading

Uh oh!

krishnaraj36 commented Jul 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[DLIGHT][GPU] Add OpenCL dequant matmul schedule #17187

[DLIGHT][GPU] Add OpenCL dequant matmul schedule #17187

Uh oh!

Conversation

krishnaraj36 commented Jul 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

krishnaraj36 commented Jul 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

krishnaraj36 commented Jul 22, 2024 •

edited

Loading