Skip to content

CUDA: faster softmax via shared memory + fp16 math#4742

Merged
JohannesGaessler merged 5 commits intoggml-org:masterfrom
JohannesGaessler:cuda-faster-softmax
Jan 9, 2024
Merged

CUDA: faster softmax via shared memory + fp16 math#4742
JohannesGaessler merged 5 commits intoggml-org:masterfrom
JohannesGaessler:cuda-faster-softmax

Commits