[Format] HALF precision FLOAT Logical type #317

asfimport · 2016-10-28T15:53:36Z

Reporter: Julien Le Dem / @julienledem
Assignee: Anja Boskovic / @anjakefala

Related issues:

Add NULL type to Bring Parquet logical types to par with Arrow (is related to)
[Java] support for Arrow's float16 (is depended upon by)
[C++] Support float16 in writing/reading parquet (is depended upon by)

PRs and other links:

GitHub Pull Request #40

_{Note: This issue was originally created as PARQUET-758. Please see the migration documentation for further details.}

asfimport · 2023-06-05T07:42:34Z

Gabor Szadovszky / @gszadovszky:
Hey everyone, who is interested in the half-float type,

When I've reviewed the format change it was obvious to me to use the "2-byte IEEE little-endian format". Now, I've faced another approach to encode 2 byte FP numbers: bfloat16. Since neither java nor c++ support 2 byte FP numbers natively we probably need to convert the encoded numbers to float. For bfloat16 it would be more performant to do so.
It might worth adding bfloat16 to the format as well and add implementations for it in the same round. WDYT?

asfimport · 2023-06-05T18:26:29Z

Anja Boskovic / @anjakefala:
Hi Gabor!

I would support a proposal for implementing bfloat16, maybe even as a canonical extension type in Arrow.

However, I have a hesitency to including that in this round of implementations. I think it should be considered seperately.

My understanding is that the implementations have already begun (I messaged the parties working on the implementations, to create appropriate tickets).
It would prolong the format review and implementations.
Part of that prolonging is that I forsee additional back-and-forth over debating why "bfloat16"; why not tensorfloat? Why not add both?

And my experience has been that these conversations take a really long time for the Parquet community. It could easily add months to this process.

Float16 being an IEEE standard has a simplicity to its inclusion.

So, I guess my takeaway is that I support us opening a seperate format PR for bfloat16 inclusion, and having that occur seperate from the work of including, and implementing, IEEE float16.

asfimport · 2023-06-06T07:31:31Z

Gabor Szadovszky / @gszadovszky:
Thanks for your reply, @anjakefala!

I've mentioned bfloat16 only because of the ease of converting back and forth to java/c++ float which we will probably need to be implemented for IEEE Float16 as well. But I agree, we should not block the format release because of additional discussions about this additional topic.

asfimport closed this as completed Oct 28, 2023

This was referenced Jun 23, 2024

Add NULL type to Bring Parquet logical types to par with Arrow #268

Closed

[Java] support for Arrow's float16 apache/parquet-java#2362

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Format] HALF precision FLOAT Logical type #317

[Format] HALF precision FLOAT Logical type #317

asfimport commented Oct 28, 2016 •

edited

Loading

asfimport commented Jun 5, 2023

asfimport commented Jun 5, 2023

asfimport commented Jun 6, 2023

[Format] HALF precision FLOAT Logical type #317

[Format] HALF precision FLOAT Logical type #317

Comments

asfimport commented Oct 28, 2016 • edited Loading

Related issues:

PRs and other links:

asfimport commented Jun 5, 2023

asfimport commented Jun 5, 2023

asfimport commented Jun 6, 2023

asfimport commented Oct 28, 2016 •

edited

Loading