You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[DataType] Initial support of fp8 (e4m3/e5m2) (#14863)
Recently NVIDIA announced official support of the fp8 data type: e4m3 and e5m2, the first one has 4 bits for exponent and 3 bits for mantissa while the second one has 5 bits for exponent and 2 bits for mantissa, and NVIDIA encourages using e4m3 for forward and e5m2 (larger dynamic range) for backward. Currently, TVM has no support for these data types, as the first step to support fp8, this PR adds new type codes for `e4m3_float8` and `e5m2_float8`, and implement legalization passes `FP8ComputeLegalize` and `FP8StorageLegalize` so that we can use them for backends that do not have native fp8 support.
0 commit comments