-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Test][Fix](parquet-reader) Add parquet decoder unit tests and fix bugs by these tests. #49182
Conversation
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
run buildall |
07893b1
to
24b730d
Compare
run buildall |
24b730d
to
292172b
Compare
run buildall |
292172b
to
0d34f41
Compare
run buildall |
0d34f41
to
d64b5ce
Compare
run buildall |
TPC-H: Total hot run time: 34235 ms
|
TPC-DS: Total hot run time: 193185 ms
|
ClickBench: Total hot run time: 31.76 s
|
d64b5ce
to
ab6b680
Compare
run buildall |
TPC-H: Total hot run time: 32271 ms
|
TPC-DS: Total hot run time: 192303 ms
|
ClickBench: Total hot run time: 31.22 s
|
ab6b680
to
19f8160
Compare
run buildall |
TPC-H: Total hot run time: 32117 ms
|
TPC-DS: Total hot run time: 191648 ms
|
ClickBench: Total hot run time: 31.31 s
|
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
19f8160
to
edbbd65
Compare
run buildall |
TPC-H: Total hot run time: 31946 ms
|
TPC-DS: Total hot run time: 192623 ms
|
ClickBench: Total hot run time: 31.11 s
|
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR approved by at least one committer and no changes requested. |
PR approved by anyone and no changes requested. |
What problem does this PR solve?
Release note
Added Parquet decoder unit tests and fixed bugs identified through these tests.
In addition to the unit tests, the main changes include:
Modification of
Decoder::set_data()
Return Type fromvoid
toStatus
to capture error in the unit tests.Changed the return type of Decoder::set_data() from void to Status to enable checking whether the operation succeeded in unit tests.
Separation of Header and Implementation Files
Moved the definitions of non-template functions from header files to corresponding .cpp files (e.g.,
delta_bit_pack_decoder.h
,fix_length_plain_decoder.h
).Resolved link errors caused by duplicate symbols during the build process of unit tests.
This change optimizes the code structure, improves compilation efficiency, and avoids potential link conflicts.
Implements
tparquet::Encoding::BIT_PACKED
branch inlevel_decoder.cpp
Implements
tparquet::Encoding::BIT_PACKED
branch inlevel_decoder.cpp
.Bug Fixes
BoolRLEDecoder::skip_values()
was not functioning correctly.FixLengthDictDecoder::read_dict_values_to_column()
andFixLengthDictDecoder::convert_dict_column_to_string_column()
were not functioning correctly.Future Work:
Add more test cases for invalid scenarios in these unit tests.
Check List (For Author)
Test
Behavior changed:
Does this need documentation?
Check List (For Reviewer who merge this PR)