Fix New Parquet Reader reading array<struct<type:string>>#10849
Fix New Parquet Reader reading array<struct<type:string>>#10849zhenxiao wants to merge 1 commit intoprestodb:masterfrom
Conversation
|
@zhenxiao Thank you for your commit and for catching this issue! I've spent some time investigating the root cause of the problem and realised that the issue is a little bit broader than parsing It's my fault, I missed one of the backward-compatibility rules from parquet spec. This means that: must be parsed as Whereas must be parsed as I've implemented another fix, it covers all affected cases. Additionally, I had to refactor parquet writer used for tests, so that it could be used for writing arrays with and without middle group level. Thanks again for catching this issue. Will really appreciate if you review the fix #10852. Hope it solves all backward-compatibility issues! |
Hi @dain @electrum @nezihyigitbasi @kgalieva
found a bug when running New Parquet reader for our production traffic (seems like the only bug)
when reading array<structtype:string>, columnIO is primitive for the inside struct
With this fix, all of our highly nested Parquet queries are good