Optimize performance of `math::trunc` (~2.5x faster) #12909

tlm365 · 2024-10-13T13:14:01Z

Rationale for this change

Same idea as #12881. Using the unary/binary functions allow faster processing (most likely auto-vectorized code) by avoiding branching on nulls.

What changes are included in this PR?

Apply unary and binary
Add benchmark

Are these changes tested?

Existing testcases.

Are there any user-facing changes?

No.

**BENCHMARK RESULT

simonvandel · 2024-10-13T13:49:19Z

datafusion/functions/src/math/trunc.rs

+                let num_array = num.as_primitive::<Float32Type>();
+                let precision_array = precision.as_primitive::<Int64Type>();
+                let result: PrimitiveArray<Float32Type> =
+                    arrow_arith::arity::binary(num_array, precision_array, |x, y| {


Is this the same function as this? https://docs.rs/arrow/latest/arrow/compute/fn.binary.html

If so you could remove the new dependency (although it is in the dependency closure anyway already)

@simonvandel Thanks so much for reviewing

Is this the same function as this? https://docs.rs/arrow/latest/arrow/compute/fn.binary.html

Oh, yes it is. Nice!

Signed-off-by: Tai Le Manh <[email protected]>

alamb

Thanks @tlm365 and @simonvandel

findepi · 2024-10-15T14:56:40Z

datafusion/functions/src/math/trunc.rs

-            ColumnarValue::Scalar(Int64(Some(0))) => Ok(Arc::new(
-                make_function_scalar_inputs!(num, "num", Float64Array, { f64::trunc }),
-            ) as ArrayRef),
-            ColumnarValue::Array(precision) => Ok(Arc::new(make_function_inputs2!(


I've broadened #12923 to cover this too

alamb · 2024-10-16T14:19:04Z

I merged up from main to resolve a conflict on this branch

alamb · 2024-10-16T17:13:19Z

🚀 thanks again

github-actions bot added the functions label Oct 13, 2024

simonvandel reviewed Oct 13, 2024

View reviewed changes

Optimize performance of math::trunc

7e6f820

Signed-off-by: Tai Le Manh <[email protected]>

tlm365 force-pushed the optimize-trunc branch from 27c8635 to 7e6f820 Compare October 13, 2024 14:15

tlm365 changed the title ~~Optimize performance of math::trunc (~2.5x faster)~~ Optimize performance of math::trunc (~2.5x faster) Oct 13, 2024

tlm365 marked this pull request as ready for review October 13, 2024 18:06

alamb approved these changes Oct 15, 2024

View reviewed changes

findepi mentioned this pull request Oct 15, 2024

Revise and remove usages of make_function_scalar_inputs, make_function_scalar_inputs_return_type, make_function_inputs2 #12923

Closed

findepi reviewed Oct 15, 2024

View reviewed changes

Merge remote-tracking branch 'apache/main' into optimize-trunc

c5f70b0

buraksenn mentioned this pull request Oct 16, 2024

Removed last usages of scalar_inputs, scalar_input_types and inputs2 to use arrow unary/binary for performance #12972

Merged

alamb merged commit caeabc1 into apache:main Oct 16, 2024
26 checks passed

tlm365 deleted the optimize-trunc branch November 10, 2024 15:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize performance of `math::trunc` (~2.5x faster) #12909

Optimize performance of `math::trunc` (~2.5x faster) #12909

tlm365 commented Oct 13, 2024 •

edited

Loading

simonvandel Oct 13, 2024

tlm365 Oct 13, 2024

alamb left a comment

findepi Oct 15, 2024

alamb commented Oct 16, 2024

alamb commented Oct 16, 2024

Optimize performance of math::trunc (~2.5x faster) #12909

Optimize performance of math::trunc (~2.5x faster) #12909

Conversation

tlm365 commented Oct 13, 2024 • edited Loading

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

simonvandel Oct 13, 2024

Choose a reason for hiding this comment

tlm365 Oct 13, 2024

Choose a reason for hiding this comment

alamb left a comment

Choose a reason for hiding this comment

findepi Oct 15, 2024

Choose a reason for hiding this comment

alamb commented Oct 16, 2024

alamb commented Oct 16, 2024

Optimize performance of `math::trunc` (~2.5x faster) #12909

Optimize performance of `math::trunc` (~2.5x faster) #12909

tlm365 commented Oct 13, 2024 •

edited

Loading