Question on numpy style addition in pycuda gpuarray matrix #329

SuperbTUM · 2021-12-07T15:08:39Z

SuperbTUM
Dec 7, 2021

I found out when implementing matrix addition with numpy style +, there is a significant loss in precision and moreover, a few elements returned wrong answers, but once I flattened the matrix to vector and did vector addition, everything went right. This issue happens after completing two sets of (1000, 1000) * (1000, 1000) matrix multiplication with cublas Sgemm (data type is float32 and I think this will return a correct result, at least I tried) and add them in elementwise style with a simple symbol +. I checked all the intermediate results by transferring the results from device to host and comparing them with numpy.allclose().

inducer · 2021-12-08T19:21:59Z

inducer
Dec 8, 2021
Maintainer

For now, you need to make sure that the strides match on arrays that you add together. PyCUDA should probably do this for you behind the scenes. PRs welcome!

#330

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on numpy style addition in pycuda gpuarray matrix #329

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Question on numpy style addition in pycuda gpuarray matrix #329

SuperbTUM Dec 7, 2021

Replies: 1 comment

inducer Dec 8, 2021 Maintainer

SuperbTUM
Dec 7, 2021

inducer
Dec 8, 2021
Maintainer