Add checkpoint for ORC UNION type#14532
Conversation
|
Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to cla@trino.io. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla |
|
Hi @phd3 Could you review it? |
electrum
left a comment
There was a problem hiding this comment.
The code looks fine to me, but would be good to have a test
@baohe-zhangis is it correct to say that - we're able to reproduce this by writing file bigger than one rowgroup in a stripe in a unit test? Given that this is an omission from the original implementation, may be okay to merge without a special test (i.e. we don't have such tests for other types) |
@phd3 We can't do that in a unit test as right now trino orc doesn't support write in union type. [EDITED by phd3: discussed offline about adding a product test] |
9b07615 to
1c08ccc
Compare
|
Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to cla@trino.io. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla |
|
I added a unit test in hive product test. Without this fix, the test will fail and throw an exception: |
There was a problem hiding this comment.
Thanks @baohe-zhang , great to see this being reproduced.
2022-11-06T15:50:09.2852993Z tests | 2022-11-06 21:35:09 INFO: Test io.trino.tests.product.hive.TestReadUniontype.testReadOrcUniontypeWithCheckpoint took 42.67s
2022-11-06T15:50:09.9087769Z tests | 2022-11-06 21:35:09 INFO: SUCCESS / io.trino.tests.product.hive.TestReadUniontype.testReadOrcUniontypeWithCheckpoint (Groups: smoke) took 43.3 seconds
2022-11-06T15:50:11.0978592Z tests | 2022-11-06 21:35:11 INFO: [141 of 561] io.trino.tests.product.hive.TestReadUniontype.testReadUniontype [ORC] (Groups: smoke)
The test takes ~43s on the CI. I'm bit on the fence whether it "deserves" this kind of time - given it was an omission originally. This is ~1% of overall module time - less than many other tests. So I'm leaning towards including it. cc @findepi @electrum
|
@cla-bot check |
|
Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to cla@trino.io. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla |
|
The cla-bot has been summoned, and re-checked this pull request! |
|
cc @martint for processing @baohe-zhang 's CLA |
Co-Authored-By: Jithesh T Rajan <jirajan@linkedin.com>
1c08ccc to
f96a015
Compare
|
Thank you for your pull request and welcome to the Trino community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. Continue to work with us on the review and improvements in this PR, and submit the signed CLA to cla@trino.io. Photos, scans, or digitally-signed PDF files are all suitable. Processing may take a few days. The CLA needs to be on file before we merge your changes. For more information, see https://github.com/trinodb/cla |
We never received the email with the signed CLA. @baohe-zhang, can you please resubmit? |
|
Hi @martint, I resubmitted my cla. |
|
@cla-bot check |
|
The cla-bot has been summoned, and re-checked this pull request! |
Description
Non-technical explanation
Release notes
() This is not user-visible or docs only and no release notes are required.
() Release notes are required, please propose a release note for me.
(x) Release notes are required, with the following suggested text: