Skip to content
/ Tensile Public
forked from ROCm/Tensile

Stretching GPU performance for GEMMs and tensor contractions.

License

Notifications You must be signed in to change notification settings

rkamd/Tensile

This branch is 276 commits behind ROCm/Tensile:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

7a10361 · Sep 28, 2023
Jun 14, 2022
Feb 10, 2023
Sep 27, 2023
Sep 27, 2023
Jun 10, 2022
Mar 1, 2023
Jul 10, 2023
Aug 1, 2023
Nov 11, 2022
May 19, 2020
Nov 12, 2020
Oct 2, 2021
Aug 8, 2019
Sep 27, 2023
Jul 10, 2023
May 19, 2020
May 3, 2021
Jul 10, 2023
Sep 27, 2023
Jul 10, 2023
Feb 17, 2023
Jul 10, 2023
Feb 17, 2023

Repository files navigation

Tensile is a tool for creating benchmark-driven backend libraries for GEMMs, GEMM-like problems (such as batched GEMM), and general N-dimensional tensor contractions on a GPU. The Tensile library is mainly used as backend library to rocBLAS. Tensile acts as the performance backbone for a wide variety of 'compute' applications running on AMD GPUs.

See Tensile Wiki for documentation.

About

Stretching GPU performance for GEMMs and tensor contractions.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 49.5%
  • C++ 30.5%
  • Assembly 15.7%
  • TeX 1.5%
  • CMake 1.2%
  • Shell 1.1%
  • Other 0.5%