You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
?GEADD has virtually no utility over ?AXPBY, which is itself unoptimizable (i.e. the equivalent loops, optimized by a compiler, will perform at least as well in all cases).
Both appleblas_?geadd and cublas?geam support transposition, which is useful, in part because transposition is nontrivial to optimize.
cblas_?omatcopy, which is inspired by MKL, has transpose but does not support the generality of accumulation of the former.
It would be nice to have a GEAM-style routine in OpenBLAS.
The text was updated successfully, but these errors were encountered:
I don't know if it is actually pointless, but it is the variant of GEADD that was originally implemented by ATLAS. (Fun with names of non-standardized BLAS extensions, I guess the most feature-rich GEADD is the one in IBM's ESSL ? Also patches welcome...)
None of the codes available to me seem to use it, neither does anything public on github AFAICT, there never was any complaint about the lack of transpositions here (or about the lack of GEADD in the reference implementation either), so
maybe it might as well not exist ? :)
?GEADD
has virtually no utility over?AXPBY
, which is itself unoptimizable (i.e. the equivalent loops, optimized by a compiler, will perform at least as well in all cases).Both
appleblas_?geadd
andcublas?geam
support transposition, which is useful, in part because transposition is nontrivial to optimize.cblas_?omatcopy
, which is inspired by MKL, has transpose but does not support the generality of accumulation of the former.It would be nice to have a GEAM-style routine in OpenBLAS.
The text was updated successfully, but these errors were encountered: