GH-43719: [C++] Clarify the way SIMD-enabled agg kernels come from the same code in different compilation units #43720

felipecrv · 2024-08-15T23:13:52Z

Rationale for this change

More than once I've been confused about how the SimdLevel template parameters on these kernel classes affect dispatching of kernels based on SIMD support detection at runtime [1] given that nothing in the code changes based on the parameters.

What matters is the compilation unit in which the templates are instantiated. Different compilation units get different compilation parameters. The SimdLevel parameters don't really affect the code that gets generated (!), they only serve as a way to avoid duplication of symbols in the compiled objects.

This PR organizes the code to make this more explicit.

[1] #7871 (comment)

What changes are included in this PR?

Introduction of aggregate_basic-inl.h
Moving of the impls in aggregate_basic-inl.h to an anonymous namespace
Grouping of code based on the function they implement (Sum, Mean, and MinMax)

Are these changes tested?

By the compilation process, existing tests, and benchmarks.

GitHub Issue: [C++] Clarify the way aggregation kernels are generated from the same code in different compilation units #43719

github-actions · 2024-08-15T23:15:35Z

⚠️ GitHub issue #43719 has been automatically assigned in GitHub to PR creator.

felipecrv · 2024-08-15T23:16:45Z

@kou where would I change CMake code to skip the installation of -inl.h headers?

felipecrv · 2024-08-15T23:35:57Z

I checked binary size impact: arrow-acero-aggregate-benchmark only grew by 120 bytes.

kou · 2024-08-15T23:50:47Z

Could you add internal to file name like aggregate_basic_common_internal.h?
See also:

arrow/cpp/cmake_modules/BuildUtils.cmake

Lines 956 to 958 in 2767dc5

    
               if(HEADER_BASENAME MATCHES "internal") 
        
                 continue() 
        
               endif()

felipecrv · 2024-08-16T00:07:12Z

Could you add internal to file name like aggregate_basic_common_internal.h? See also:

arrow/cpp/cmake_modules/BuildUtils.cmake

Lines 956 to 958 in 2767dc5

if(HEADER_BASENAME MATCHES "internal")

continue()

endif()

I ~~need~~prefer to end it with -inl.h to follow the pattern found in many C++ codebases [1]. I can rename it to aggregate_basic_internal-inl.h.

[1] https://github.com/facebook/folly/blob/main/folly/channels/ChannelProcessor-inl.h

felipecrv · 2024-08-16T00:08:07Z

I thought the rule only matched on internal.h but having internal anywhere in the file name will do. Thanks.

felipecrv · 2024-08-16T00:09:18Z

@ursabot please benchmark command=cpp-micro --suite-filter=arrow-acero-aggregate-benchmark --benchmark-filter=MinMaxKernel* --iterations=3

ursabot · 2024-08-16T00:09:23Z

Benchmark runs are scheduled for commit ff39f8482bd772d70d337349fe0c9eb43566a337. Watch https://buildkite.com/apache-arrow and https://conbench.ursa.dev for updates. A comment will be posted here when the runs are complete.

mapleFU · 2024-08-16T02:30:54Z

Thanks for the changes!

conbench-apache-arrow · 2024-08-16T03:55:16Z

Thanks for your patience. Conbench analyzed the 0 benchmarking runs that have been run so far on PR commit ff39f8482bd772d70d337349fe0c9eb43566a337.

None of the specified runs were found on the Conbench server.

The full Conbench report has more details.

mapleFU · 2024-08-16T08:52:42Z

Seems lint failed

felipecrv · 2024-08-16T14:49:52Z

Seems lint failed

It's the -inl.h file prefix. I need to find a way of allowing - in this case and this case only in the linter.

austin3dickey · 2024-08-16T16:16:43Z

Hey @felipecrv, I recently made changes to the @ursabot and forgot to update the help message, sorry! You don't need to add the --iterations=3 anymore (it will default to 6 repetitions now). That's why the benchmark builds were failing.

felipecrv · 2024-08-16T16:49:34Z

@ursabot please benchmark command=cpp-micro --suite-filter=arrow-acero-aggregate-benchmark --benchmark-filter=MinMaxKernel*

ursabot · 2024-08-16T16:49:40Z

Benchmark runs are scheduled for commit a36a8d04f618b6ee1fe62e1756cea880e6996435. Watch https://buildkite.com/apache-arrow and https://conbench.ursa.dev for updates. A comment will be posted here when the runs are complete.

conbench-apache-arrow · 2024-08-16T17:16:35Z

Thanks for your patience. Conbench analyzed the 3 benchmarking runs that have been run so far on PR commit a36a8d04f618b6ee1fe62e1756cea880e6996435.

There weren't enough matching historic benchmark results to make a call on whether there were regressions.

The full Conbench report has more details.

felipecrv · 2024-08-16T18:54:13Z

@austin3dickey can I run something on my machine to test these benchmark filters? I'm confused about why they don't match any benchmark here.

EDIT: I misunderstood the message. There aren't baselines to compare, but the benchmarks ran.

austin3dickey · 2024-08-16T19:07:06Z

Yeah @felipecrv, it looks like they ran. I just noticed we don't run this suite on the main branch commits right now, which is why there's no baseline. I'm not exactly sure why we don't; there might have been a segfault or something at some point in the past. I can try to kick off a manual run of this suite on the default branch for you and post a comparison here.

austin3dickey · 2024-08-16T19:17:43Z

can I run something on my machine to test these benchmark filters?

Yes, you can install archery and do archery benchmark run --suite-filter arrow-acero-aggregate-benchmark --benchmark-filter MinMaxKernel*

austin3dickey · 2024-08-16T19:40:25Z

Okay, I ran a similar run on the latest main commit for the three machine types we ran above. Here are the comparisons:

There isn't a z-score comparison because we don't have a distribution of baseline results to compare to; just the one. But you can look at the percent change column.

pitrou

Sorry for the delay @felipecrv . This looks good to me, thanks!

pitrou · 2024-09-03T13:34:12Z

@github-actions crossbow submit -g cpp

github-actions · 2024-09-03T13:36:45Z

Revision: 447fbc9

Submitted crossbow builds: ursacomputing/crossbow @ actions-7e561de588

Task	Status
test-alpine-linux-cpp
test-build-cpp-fuzz
test-conda-cpp
test-conda-cpp-valgrind
test-cuda-cpp
test-debian-12-cpp-amd64
test-debian-12-cpp-i386
test-fedora-39-cpp
test-ubuntu-20.04-cpp
test-ubuntu-20.04-cpp-bundled
test-ubuntu-20.04-cpp-minimal-with-formats
test-ubuntu-20.04-cpp-thread-sanitizer
test-ubuntu-22.04-cpp
test-ubuntu-22.04-cpp-20
test-ubuntu-22.04-cpp-emscripten
test-ubuntu-22.04-cpp-no-threading
test-ubuntu-24.04-cpp
test-ubuntu-24.04-cpp-gcc-13-bundled
test-ubuntu-24.04-cpp-gcc-14

conbench-apache-arrow · 2024-09-04T12:53:04Z

After merging your PR, Conbench analyzed the 4 benchmarking runs that have been run so far on merge-commit 6ce2af7.

There were no benchmark performance regressions. 🎉

The full Conbench report has more details.

…rom the same code in different compilation units (apache#43720) ### Rationale for this change More than once I've been confused about how the `SimdLevel` template parameters on these kernel classes affect dispatching of kernels based on SIMD support detection at runtime [1] given that nothing in the code changes based on the parameters. What matters is the compilation unit in which the templates are instantiated. Different compilation units get different compilation parameters. The SimdLevel parameters don't really affect the code that gets generated (!), they only serve as a way to avoid duplication of symbols in the compiled objects. This PR organizes the code to make this more explicit. [1] apache#7871 (comment) ### What changes are included in this PR? - Introduction of aggregate_basic-inl.h - Moving of the impls in `aggregate_basic-inl.h` to an anonymous namespace - Grouping of code based on the function they implement (`Sum`, `Mean`, and `MinMax`) ### Are these changes tested? By the compilation process, existing tests, and benchmarks. * GitHub Issue: apache#43719 Lead-authored-by: Felipe Oliveira Carvalho <felipekde@gmail.com> Co-authored-by: Antoine Pitrou <antoine@python.org> Signed-off-by: Antoine Pitrou <antoine@python.org>

github-actions bot added Component: C++ awaiting committer review Awaiting committer review labels Aug 15, 2024

felipecrv mentioned this pull request Aug 15, 2024

[C++] compute: Why AddMinMaxAvx512AggKernels add AVX2 in some cases #43687

Closed

felipecrv changed the title ~~Avx~~ GH-43719: [C++] Clarify the way aggregation kernels are generated from the same code in different compilation units Aug 15, 2024

felipecrv changed the title ~~GH-43719: [C++] Clarify the way aggregation kernels are generated from the same code in different compilation units~~ GH-43719: [C++] Clarify the way SIMD-enabled agg kernels come from the same code in different compilation units Aug 15, 2024

apache deleted a comment from github-actions bot Aug 15, 2024

felipecrv requested a review from pitrou August 16, 2024 15:18

felipecrv force-pushed the avx branch 2 times, most recently from eb64fde to 2c9a588 Compare August 16, 2024 18:51

felipecrv force-pushed the avx branch from 2c9a588 to b235cb6 Compare August 16, 2024 21:42

felipecrv requested a review from pitrou August 22, 2024 22:51

github-actions bot added awaiting change review Awaiting change review and removed awaiting changes Awaiting changes labels Aug 22, 2024

felipecrv and others added 16 commits September 3, 2024 15:28

Rename aggregate_basic_internal.h to aggregate_basic_internal.inc.cc

728169c

Use aggregate_basic_internal.inc.cc from specific compilation units

233fa48

Rename aggregate_basic_internal.inc.cc to aggregate_basic-inl.h

79b01d5

Organize the kernels

5fb32ce

Use Native instead of Default as non-SIMD suffix

dd6287d

Please the linter

cacd070

Rename -inl.h to -inl.cc

7aca534

Allow -inl.{cc,h} filenames

3359c5b

Remove now unecessary extra includes

2d8abc7

Undo s/Default/Native

ec34acd

Rename -inl.cc to .inc

ae52c5d

Undo lint_cpp_cli.py changes

8a30725

fixup! Undo s/Default/Native

b6f4af4

Rename .inc to .inc.cc

2030b8a

NOLINT the .cc inclusions

370b6a3

Fix lint + make more definitions anonymous/static

447fbc9

pitrou force-pushed the avx branch from ee5a9fe to 447fbc9 Compare September 3, 2024 13:33

pitrou approved these changes Sep 3, 2024

View reviewed changes

pitrou merged commit 6ce2af7 into apache:main Sep 3, 2024

pitrou removed the awaiting change review Awaiting change review label Sep 3, 2024

pitrou mentioned this pull request Sep 3, 2024

[C++] Clarify the way aggregation kernels are generated from the same code in different compilation units #43719

Closed

felipecrv deleted the avx branch September 3, 2024 14:35

GH-43719: [C++] Clarify the way SIMD-enabled agg kernels come from the same code in different compilation units #43720

GH-43719: [C++] Clarify the way SIMD-enabled agg kernels come from the same code in different compilation units #43720

Uh oh!

Conversation

felipecrv commented Aug 15, 2024 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Uh oh!

github-actions bot commented Aug 15, 2024

Uh oh!

felipecrv commented Aug 15, 2024

Uh oh!

felipecrv commented Aug 15, 2024

Uh oh!

kou commented Aug 15, 2024

Uh oh!

felipecrv commented Aug 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

felipecrv commented Aug 16, 2024

Uh oh!

felipecrv commented Aug 16, 2024

Uh oh!

ursabot commented Aug 16, 2024

Uh oh!

mapleFU commented Aug 16, 2024

Uh oh!

conbench-apache-arrow bot commented Aug 16, 2024

Uh oh!

mapleFU commented Aug 16, 2024

Uh oh!

felipecrv commented Aug 16, 2024

Uh oh!

austin3dickey commented Aug 16, 2024

Uh oh!

felipecrv commented Aug 16, 2024

Uh oh!

ursabot commented Aug 16, 2024

Uh oh!

conbench-apache-arrow bot commented Aug 16, 2024

Uh oh!

felipecrv commented Aug 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

austin3dickey commented Aug 16, 2024

Uh oh!

austin3dickey commented Aug 16, 2024

Uh oh!

austin3dickey commented Aug 16, 2024

Uh oh!

pitrou left a comment

Choose a reason for hiding this comment

Uh oh!

pitrou commented Sep 3, 2024

Uh oh!

github-actions bot commented Sep 3, 2024

Uh oh!

conbench-apache-arrow bot commented Sep 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

felipecrv commented Aug 15, 2024 •

edited by github-actions bot

Loading

felipecrv commented Aug 16, 2024 •

edited

Loading

felipecrv commented Aug 16, 2024 •

edited

Loading