Fix an edge case with Round function when the scaled number exceeds Double.MAX_VALUE #16876

cshao239 · 2023-04-04T14:57:07Z

Description

Fix an edge case with Round function when the scaled number exceeds Double.MAX_VALUE

This issue was caused by PR #14620.

Before this PR, select round(123.4, 10000000) would return 0.0 incorrectly.

After this PR, select round(123.4, 10000000) would cause issue "input is infinite or NaN"

After this fix, it would return correctly 123.4.

In general this PR fixes an edge case for round(a, b) if a*b^10 is exceeds Double.MAX_VALUE

Additional context and related issues

Release notes

(x) This is not user-visible or docs only and no release notes are required.
( ) Release notes are required, please propose a release note for me.
( ) Release notes are required, with the following suggested text:

cla-bot · 2023-04-04T14:57:09Z

Thank you for your pull request and welcome to our community. We could not parse the GitHub identity of the following contributors: Chenren Shao.
This is most likely caused by a git client misconfiguration; please make sure to:

check if your git client is configured with an email to sign commits git config --list | grep email
If not, set it up using git config --global user.email [email protected]
Make sure that the git commit email is configured in your GitHub account settings, see https://github.com/settings/emails

pettyjamesm · 2023-04-11T16:21:51Z

core/trino-main/src/main/java/io/trino/operator/scalar/MathFunctions.java

~~We probably need to use Math.toIntExact (and catch ArithmeticException, converting it to TrinoException if decimals > Integer.MAX_VALUE)~~

Nevermind, the parameter is annotated @SqlType(StandardTypes.INTEGER) so really we ~~should just change decimals to an int parameter instead of long.~~ Nevermind again, it looks like these are always passed as long on purpose, maybe for code generation simplicity (cc: @martint who might know). I would still suggesting using toIntExact so that we fail instead of overflowing if someone ever passes an value larger than Integer.MAX_VALUE.

core/trino-main/src/main/java/io/trino/operator/scalar/MathFunctions.java

pettyjamesm · 2023-04-11T16:49:43Z

core/trino-main/src/main/java/io/trino/operator/scalar/MathFunctions.java

Having tested this locally, I can say that the performance of this approach is terrible. The unit test examples added take over 5 seconds to perform each rounding operation.

I wonder if there's something better we can do when handling high precision rounding. Note that BigDecimal.valueOf(double) is implemented as:

public static BigDecimal valueOf(double val) { // <irrelevant comment ellided> return new BigDecimal(Double.toString(val)); }

where Double.toString internally is already doing a lot of the same work we're trying to accomplish in our rounding routine in order to produce the string output which is then just parsed back in. There might be a more efficient approach we can take from the implementation there to get a correct result without such a large performance hit.

Yes, I totally agree. I tested the performance locally as well! I think the performance implication here is the main reason why we use rescaled approach for other cases, given that BigDecimal approach should work for any input. I am open to other ideas how we can handle this edge case with better performance than BigDecimal approach.

martint

10000000 is not a reasonable input for this function. A double number can't have more that ~17 significant digits, so it doesn't make sense to be able to round to anything smaller than that. We should fix it either by short-circuiting the computation if the number of digits is larger what would produce any visible effect or by failing on such input.

cshao239 · 2023-04-12T14:29:14Z

10000000 is not a reasonable input for this function.

True, but this issue exists for a more reasonable case too. For round(a, b), if a is somehow at the neighborhood of Double.MAX_VALUE, let's say Double.MAX_VALUE-1, b = 2, which is a reasonable input, then a*b^10 exceeds Double.MAX_VALUE, and it will break.

martint · 2023-04-12T15:15:43Z

Double.MAX_VALUE is a large integer number with 17 significant digits and many zeros after, so it doesn't matter what the value of b is. The round operation should be a no-op. This is one of the cases where we could short-circuit the computation.

…ouble.MAX_VALUE

cshao239 · 2023-04-13T17:03:48Z

That makes sense. I have update it with direct return the original value and more reasonable unit tests

cshao239 · 2023-04-18T16:18:42Z

cc: @martint - is this what you had in mind instead?

cshao239 · 2023-04-26T20:32:51Z

@martint Hi, Martin. Can you merge if you don't have other concerns?

cshao239 · 2023-04-28T13:30:20Z

@pettyjamesm @findepi can you guys take a look as well?

martint · 2023-05-01T15:44:15Z

Thanks for the fix, @cshao239 !

cshao239 requested a review from findepi April 4, 2023 14:57

cshao239 marked this pull request as draft April 4, 2023 15:05

cshao239 force-pushed the round-max branch from e98e0f3 to 11965be Compare April 4, 2023 15:12

cla-bot bot added the cla-signed label Apr 4, 2023

cshao239 marked this pull request as ready for review April 4, 2023 15:51

cshao239 requested a review from martint April 10, 2023 13:49

cshao239 force-pushed the round-max branch from 11965be to e5bad67 Compare April 10, 2023 23:40

pettyjamesm reviewed Apr 11, 2023

View reviewed changes

cshao239 force-pushed the round-max branch from e5bad67 to dff6834 Compare April 11, 2023 17:00

martint requested changes Apr 11, 2023

View reviewed changes

Fix an edge case with Round function when the scaled number exceeds D…

ffef5a0

…ouble.MAX_VALUE

cshao239 force-pushed the round-max branch from dff6834 to ffef5a0 Compare April 13, 2023 17:02

martint approved these changes Apr 27, 2023

View reviewed changes

martint merged commit fc26d6d into trinodb:master May 1, 2023

github-actions bot added this to the 416 milestone May 1, 2023

colebow mentioned this pull request May 3, 2023

Add Trino 416 release notes #17328

Merged

cshao239 deleted the round-max branch December 28, 2023 18:25

Fix an edge case with Round function when the scaled number exceeds Double.MAX_VALUE #16876

Fix an edge case with Round function when the scaled number exceeds Double.MAX_VALUE #16876

Uh oh!

Conversation

cshao239 commented Apr 4, 2023

Description

Additional context and related issues

Release notes

Uh oh!

cla-bot bot commented Apr 4, 2023

Uh oh!

pettyjamesm Apr 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pettyjamesm Apr 11, 2023

Choose a reason for hiding this comment

Uh oh!

cshao239 Apr 11, 2023

Choose a reason for hiding this comment

Uh oh!

martint left a comment

Choose a reason for hiding this comment

Uh oh!

cshao239 commented Apr 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

martint commented Apr 12, 2023

Uh oh!

cshao239 commented Apr 13, 2023

Uh oh!

cshao239 commented Apr 18, 2023

Uh oh!

cshao239 commented Apr 26, 2023

Uh oh!

cshao239 commented Apr 28, 2023

Uh oh!

martint commented May 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

pettyjamesm Apr 11, 2023 •

edited

Loading

cshao239 commented Apr 12, 2023 •

edited

Loading