-
Notifications
You must be signed in to change notification settings - Fork 2.9k
[Python] support BucketByteBuffer and BucketUUID #2836
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
It looks like the following matches the expected output(which I guess is different than the Java output): |
|
I guess python is doing the 'right' thing. The question for me is: will Java start doing the right thing or maintain the wrong thing for backwards compatibility. Python should, I think, be consistent w/ Java. |
|
@rymurr Unfortunately, I think you're right that matching Java is more
important than being correct according to spec. We may want to wait on
this until the java community decides if they are going to fix this or not.
Seems like they're trying to decide how likely existing users are to have
been impacted.
…On Tue, Jul 20, 2021, 2:06 AM Ryan Murray ***@***.***> wrote:
I guess python is doing the 'right' thing. The question for me is: will
Java start doing the right thing or maintain the wrong thing for backwards
compatibility. Python should, I think, be consistent w/ Java.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#2836 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAMETHZDOQENBDO5ZRB3JCLTYU4ANANCNFSM5ARSWF7A>
.
|
|
Elevating the 'wrong' behavior to be 'the standard' will be hard for non-Java languages. Waiting for Java bucketing version to be fixed seems reasonable though. This avoids correctness issues -- no support for bucketing in Python, means no bugs at all. FWIW, in Trino, bucketing did not have the bug (that's how we found #2837), so 'correctly' bucketed data can be out there too. |
I agree. I think the last thing we want to do is start duplicating guava bugs in other languages. It is hard and unnecessary. Shall we wait for the guava fix to be merged and propagated to Iceberg then? |
|
Waiting seems like the best option here. |
Per @rdblue 's #2837 (comment) i posted a proposed fix #2849. |
findepi
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
code lgtm
is there any place in tests where hash values could be asserted?
* [Python] support BucketByteBuffer and BucketUUID * Add additional unit tests for bucket hash methods.
* [Python] support BucketByteBuffer and BucketUUID * Add additional unit tests for bucket hash methods.
Follow up #2689 (comment)