Introduce outer reduction for metal #17058

Hzfengsy · 2024-06-03T09:17:50Z

q4f16_0 may improve the performance of metal. To be specific, Llama-3 8B on M1 Pro:

Introduce outer reduction for metal

b43c030

Hzfengsy force-pushed the low_batch_gemv branch from b39fbbe to b43c030 Compare June 3, 2024 11:55

tqchen approved these changes Jun 4, 2024

View reviewed changes

tqchen merged commit 1c05902 into apache:main Jun 4, 2024

ysh329 mentioned this pull request Jul 20, 2024

[Release] v0.17.0 Release Candidate Notes #17178

Closed

kurisu6912 mentioned this pull request Sep 5, 2025

kurisu add assume attr patch 1 tile-ai/tvm#8

Closed

Provide feedback