Add support OrcReader based on DwrfReader#10194
Add support OrcReader based on DwrfReader#10194wypb wants to merge 1 commit intofacebookincubator:mainfrom
Conversation
✅ Deploy Preview for meta-velox canceled.
|
78ab47f to
d5906fc
Compare
|
@kagamiori has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
|
Hi @wypb, there are a few unit test failing internally. An example failure is I wonder whether we need to update the |
7372895 to
d0c48b2
Compare
|
Hi @kagamiori Thank you for you review. The |
|
@kagamiori has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
|
@kagamiori merged this pull request in 1a50a8a. |
Wow, I didn't even know we hard-code "fbcode" in our code...Thank you for looking into it and fix the unit test! |
|
Conbench analyzed the 1 benchmark run on commit There were no benchmark performance regressions. 🎉 The full Conbench report has more details. |
Summary: This is a follow-up PR of #10194 to address test failures in the out-of-source build: ``` E20240728 Exceptions.h:67] Line: velox/velox/common/file/File.cpp:112, Function:LocalReadFile, Expression: No such file or directory: cmake-build-debug/velox/velox/dwio/orc/test/examples/TestStringDictionary.testRowIndex.orc, Source: RUNTIME, ErrorCode: FILE_NOT_FOUND E20240728 Exceptions.h:67] Line: velox/velox/common/file/File.cpp:112, Function:LocalReadFile, Expression: No such file or directory: cmake-build-debug/velox/velox/dwio/orc/test/examples/complextypes_iceberg.orc, Source: RUNTIME, ErrorCode: FILE_NOT_FOUND E20240728 Exceptions.h:67] Line: velox/velox/common/file/File.cpp:112, Function:LocalReadFile, Expression: No such file or directory: cmake-build-debug/velox/velox/dwio/orc/test/examples/orc_index_int_string.orc, Source: RUNTIME, ErrorCode: FILE_NOT_FOUND E20240728 Exceptions.h:67] Line: velox/velox/common/file/File.cpp:112, Function:LocalReadFile, Expression: No such file or directory: cmake-build-debug/velox/velox/dwio/orc/test/examples/TestOrcFile.testDate1900.orc, Source: RUNTIME, ErrorCode: FILE_NOT_FOUND E20240728 Exceptions.h:67] Line: velox/velox/common/file/File.cpp:112, Function:LocalReadFile, Expression: No such file or directory: cmake-build-debug/velox/velox/dwio/orc/test/examples/orc_all_type.orc, Source: RUNTIME, ErrorCode: FILE_NOT_FOUND ``` Pull Request resolved: #10588 Reviewed By: pedroerp Differential Revision: D60452930 Pulled By: Yuhta fbshipit-source-id: adeb972bd89ed0696639ab8b3379549f3138abe3
The ORC file format is used by many companies. Currently, the DWRF Reader in Velox can be used to read ORC files. This PR implements
OrcReaderFactorybased onDwrfReaderand registers it inHiveConnectorFactory#initialize(). In this way, we can get aReaderthat can read the ORC file format throughdwio::common::getReaderFactory(FileFormat::ORC)->createReader(..).CC: @Yuhta