Fix rocblas_bfloat16 conversions and add a __device__ __host__ rocblas_abs() function by leekillough · Pull Request #678 · ROCm/rocBLAS

leekillough · 2019-08-30T17:05:54Z

I will have to do build and test run manually.

I'm not 100% sure this fixes it, but it should.

…cit conversion of bfloat16 to float

…nd __host__ are both required

leekillough · 2019-08-30T21:42:56Z

There were three problems addressed:

The rocblas_bfloat16 type should be allowed to be implicitly converted to float, not just explicitly converted to float or double. There is no loss in the conversion from rocblas_bfloat16 to float, so it can be allowed implicitly. If converted to double, it will be implicitly converted to float first.
The std::abs overload for rocblas_bfloat16 was buggy, using the integer value of data & 0x7fff converted to rocblas_bfloat16 as the return value, instead of the absolute value of the BF16 value.
The std::abs function was being called from __device__ functions, which isn't generally allowed. As a result, when trying to call std::abs from __device__ functions with float or double arguments, the compiler only saw the rocblas_bfloat16 overload of std::abs as a viable __device__ overload candidate function. It then tried to convert the float or double argument to rocblas_bfloat16, but it couldn't, since rocblas_bfloat16's constructor is explicit (a conversion to rocblas_bfloat16 from float or double is lossy, and thus should not be allowed implicitly).

The fix:

Allow rocblas_bfloat16 to be implicitly converted to float, without loss. The conversion to double will then be automatic by the compiler, first converting it to float and then to double.
Remove the std::abs overloads for rocblas_bfloat16.
Add a rocblas_abs function to be callable from __device__ or __host__ functions, and put the rocblas_bfloat16 overfload there.
Change ROT*g functions to use rocblas_abs instead of std::abs.

AlexBrownAMD

Looks good!

* extend range of single block and inc1 * add testing for better case coverage

Replace explicit conversion of bloat16 to float and double with impli…

de55d7f

…cit conversion of bfloat16 to float

leekillough requested review from amcamd and wbgilmartin August 30, 2019 17:05

leekillough changed the title ~~Replace explicit conversion of bloat16 to float and double with impicit conversion of bloat16 to float~~ Replace explicit conversion of bloat16 to float and double with implicit conversion of bloat16 to float Aug 30, 2019

leekillough requested a review from AlexBrownAMD August 30, 2019 17:27

Lee Killough added 2 commits August 30, 2019 13:31

Fix std::abs for rocblas_bfloat16

79b052d

Change to using rocblas_abs instead of std::abs for when __device__ a…

0126b9b

…nd __host__ are both required

leekillough changed the title ~~Replace explicit conversion of bloat16 to float and double with implicit conversion of bloat16 to float~~ Fix rocblas_bfloat16 conversions and add a __device__ __host__ rocblas_abs() function Aug 30, 2019

AlexBrownAMD approved these changes Aug 30, 2019

View reviewed changes

leekillough merged commit 253fbf6 into ROCm:develop Aug 31, 2019

leekillough mentioned this pull request Sep 1, 2019

Bfloat16 ROCm/Tensile#652

Merged

leekillough deleted the bfloat16_to_float branch December 27, 2019 00:29

mlse-lib-jenkins pushed a commit that referenced this pull request Jun 4, 2021

Dot opt take2 (#678)

4984cc5

* extend range of single block and inc1 * add testing for better case coverage

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix rocblas_bfloat16 conversions and add a device host rocblas_abs() function#678

Fix rocblas_bfloat16 conversions and add a device host rocblas_abs() function#678
leekillough merged 3 commits into
ROCm:developfrom
leekillough:bfloat16_to_float

leekillough commented Aug 30, 2019

Uh oh!

leekillough commented Aug 30, 2019

Uh oh!

AlexBrownAMD left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leekillough commented Aug 30, 2019

Uh oh!

leekillough commented Aug 30, 2019

Uh oh!

AlexBrownAMD left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants