[backend][fp8] Float8E4M3FNUZ -> Float8E4M3FN for NVIDIA PTX by chsigg · Pull Request #4596 · triton-lang/triton

chsigg · 2024-08-29T07:46:38Z

Fix MLIR type used for e4m3 fp8 type in NVIDIA PTX codegen.

Co-authored-by: acollins3 <acollins@nvidia.com>

ThomasRaoux

LGTM

yiakwy-xpu-ml-framework-team · 2024-09-10T09:04:02Z

Hi @chsigg AMD uses Float8E4M3FNUZ (bias == 8) instead of Float8E4M3FN (bias == 7). Will it be a problem if we modifying codes directly over TritonIR ?

chsigg · 2024-09-10T09:11:09Z

Hi @chsigg AMD uses Float8E4M3FNUZ (bias == 8) instead of Float8E4M3FN (bias == 7). Will it be a problem if we modifying codes directly over TritonIR ?

@acollins3 tried to keep this generic in openxla#8 (comment) where it's not backend-specific yet. I hope we manage for both fp8 types to coexist. I'm not sure what you mean by 'modifying codes directly over TritonIR'.

yiakwy-xpu-ml-framework-team · 2024-09-10T09:43:38Z

Hi @chsigg AMD uses Float8E4M3FNUZ (bias == 8) instead of Float8E4M3FN (bias == 7). Will it be a problem if we modifying codes directly over TritonIR ?

@acollins3 tried to keep this generic in openxla#8 (comment) where it's not backend-specific yet. I hope we manage for both fp8 types to coexist. I'm not sure what you mean by 'modifying codes directly over overriding'.

Because I see we overiding existing fp8 without adding a new fp8 (I guess adding a new one) is more robust. Float8E4M3FN uses 0b1111 111 to represent NaNs, while Float8E4M3FNUZ uses 0b1000 000.

I am not sure this will be a problem. I share this information to relevant people to check against. Thank you for the fast response.

…lang#4596) Fix MLIR type used for e4m3 fp8 type in NVIDIA PTX codegen. triton-lang#3681 Co-authored-by: acollins3 <acollins@nvidia.com>

[backend][fp8] Float8E4M3FNUZ -> Float8E4M3FN for NVIDIA PTX

63ca0ca

Co-authored-by: acollins3 <acollins@nvidia.com>

chsigg requested review from Jokeren and ptillet as code owners August 29, 2024 07:46

ThomasRaoux approved these changes Aug 29, 2024

View reviewed changes

Jokeren merged commit 9665bc0 into triton-lang:main Aug 30, 2024

jlebar mentioned this pull request Sep 3, 2024

Build LLVMAarch64CodeGen if CMAKE_OSX_ARCHITECTURES is arm64. #4637

Merged

chsigg deleted the fp8_fix branch September 10, 2024 09:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[backend][fp8] Float8E4M3FNUZ -> Float8E4M3FN for NVIDIA PTX#4596

[backend][fp8] Float8E4M3FNUZ -> Float8E4M3FN for NVIDIA PTX#4596
Jokeren merged 1 commit intotriton-lang:mainfrom
openxla:fp8_fix

chsigg commented Aug 29, 2024

Uh oh!

ThomasRaoux left a comment

Uh oh!

yiakwy-xpu-ml-framework-team commented Sep 10, 2024

Uh oh!

chsigg commented Sep 10, 2024

Uh oh!

yiakwy-xpu-ml-framework-team commented Sep 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

chsigg commented Aug 29, 2024

Uh oh!

ThomasRaoux left a comment

Choose a reason for hiding this comment

Uh oh!

yiakwy-xpu-ml-framework-team commented Sep 10, 2024

Uh oh!

chsigg commented Sep 10, 2024

Uh oh!

yiakwy-xpu-ml-framework-team commented Sep 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants