Update the Triton softmax micro-bench. by chengjunlu · Pull Request #1207 · intel/intel-xpu-backend-for-triton

chengjunlu · 2024-05-29T02:29:32Z

Modify the softmax kernel for better performance on N < 1024 cases.
Use the synchronize submitting by default for the benchmark.
Align the tile configuration of the XeTLA kernel and Triton kernel.

These changes are needed for #1179 to run llama kernels on simulator with performance traces. They are ported from corresponding JGS files with no modifications. --------- Signed-off-by: Gregory Shimansky <gregory.shimansky@intel.com>

chengjunlu requested review from whitneywhtsang and yudongsi May 29, 2024 02:29

chengjunlu force-pushed the chengjun/llvm-target-softmax-microbench branch from f8f2184 to 3546f31 Compare May 29, 2024 03:07

whitneywhtsang approved these changes May 30, 2024

View reviewed changes

Comment thread benchmarks/xetla_benchmark/fused_softmax.py Outdated

Comment thread benchmarks/xetla_benchmark/fused_softmax.py Outdated

yudongsi approved these changes May 30, 2024

View reviewed changes

chengjunlu force-pushed the chengjun/llvm-target-softmax-microbench branch from 3546f31 to eea9219 Compare May 30, 2024 05:28

chengjunlu merged commit ceadb1b into llvm-target May 30, 2024

Update the softmax Triton kernel for N<1024 benchmark.

eea9219

chengjunlu deleted the chengjun/llvm-target-softmax-microbench branch May 31, 2024 04:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the Triton softmax micro-bench.#1207

Update the Triton softmax micro-bench.#1207
chengjunlu merged 1 commit into
llvm-targetfrom
chengjun/llvm-target-softmax-microbench

chengjunlu commented May 29, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chengjunlu commented May 29, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants