Optimization in Numeric Terms Aggregation query for Large Bucket Counts #18702

vinaykpud · 2025-07-08T18:39:30Z

Description

In bucket aggregations, data node sends topN bucket requested to the coordinator. The contract here is to return the buckets sorted by key but topN on the basis of value.
If the number of requested top-N buckets exceeds or close to the maximum bucket ordinal, making the use of a PriorityQueue for top-N selection inefficient or redundant. So we made following modifications:

use quickselect for topN if the requested size is greater than the 20% of the total buckets.
If the requested size is greater than the bucket size then return all the bucket.

Benchmarking test results :

#18703 (comment)

Related Issues

Resolves #18703
Related #18650

Check List

Functionality includes testing.
API changes companion pull request created, if applicable.
Public documentation issue/PR created, if applicable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

github-actions · 2025-07-08T20:07:32Z

❌ Gradle check result for 74295ec: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Signed-off-by: Rishabh Maurya <[email protected]> (cherry picked from commit 130d890)

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

github-actions · 2025-07-11T18:26:20Z

❌ Gradle check result for 9f7c12d: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

github-actions · 2025-07-11T19:56:02Z

❕ Gradle check result for e124eb1: UNSTABLE

Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

codecov · 2025-07-11T19:56:26Z

Codecov Report

❌ Patch coverage is 89.47368% with 10 lines in your changes missing coverage. Please review.
✅ Project coverage is 72.90%. Comparing base (c01ff89) to head (68e77e1).
⚠️ Report is 9 commits behind head on main.

Files with missing lines	Patch %	Lines
...egations/bucket/terms/BucketSelectionStrategy.java	92.95%	2 Missing and 3 partials ⚠️
...regations/bucket/terms/NumericTermsAggregator.java	76.47%	4 Missing ⚠️
...va/org/opensearch/search/DefaultSearchContext.java	80.00%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##               main   #18702      +/-   ##
============================================
+ Coverage     72.89%   72.90%   +0.01%     
- Complexity    69318    69339      +21     
============================================
  Files          5642     5643       +1     
  Lines        318636   318712      +76     
  Branches      46107    46112       +5     
============================================
+ Hits         232254   232348      +94     
- Misses        67540    67569      +29     
+ Partials      18842    18795      -47

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions · 2025-07-14T21:46:48Z

❌ Gradle check result for 66ccdaa: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2025-07-14T23:21:56Z

❌ Gradle check result for ced0d1b: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

github-actions · 2025-07-15T00:54:00Z

❌ Gradle check result for 13c663b: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

github-actions · 2025-08-07T08:28:15Z

✅ Gradle check result for adda83c: SUCCESS

server/src/main/java/org/opensearch/search/DefaultSearchContext.java

github-actions · 2025-08-07T17:14:01Z

❌ Gradle check result for 0d3ecb4: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2025-08-07T17:54:19Z

❌ Gradle check result for 0d3ecb4: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

jainankitk

Approach is similar to terms aggregation. Can you answer the questions in #18732 (review) ?

server/src/main/java/org/opensearch/search/SearchService.java

github-actions · 2025-08-07T19:18:20Z

✅ Gradle check result for 68e77e1: SUCCESS

vinaykpud · 2025-08-07T19:53:12Z

@jainankitk

Approach is similar to terms aggregation. Can you answer the questions in #18732 (review) ?

Sure,

I am curious how we arrived at 20% as the right threshold for choosing between pq approach vs quickselect?

Compared with different threshold starting from 10% and analyzed for which value QuickSelects performs better. I have added the results here:
#18702 (comment)

Does this have any memory usage implications when the size is above 20% of value count?

There is no much difference in the memory usage is observed for the size above 20%. When we use quickSelect we create an array with size equal to the bucketOrdinal and copy all buckets to it to perform the topN selection. Bellow link has JVM metrics comparison.

#18703 (comment)

…ts (#18702) * optimize num agg using quick select for topN when applicable Signed-off-by: Rishabh Maurya <[email protected]> (cherry picked from commit 130d890) * Updated the numeric term aggregation logic to select topN Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Updated the algorithm selection logic Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Added a feature flag for the implementation Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Added profile debug information Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * use priority queue method for significant terms Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Refactored the selection strategy Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Added tests case with proper assertions Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Added cluster settings for selection strategy Signed-off-by: Vinay Krishna Pudyodu <[email protected]> --------- Signed-off-by: Rishabh Maurya <[email protected]> Signed-off-by: Vinay Krishna Pudyodu <[email protected]> Co-authored-by: Rishabh Maurya <[email protected]> (cherry picked from commit 7db7a5a) Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…ts (#18702) (#18974) * optimize num agg using quick select for topN when applicable (cherry picked from commit 130d890) * Updated the numeric term aggregation logic to select topN * Updated the algorithm selection logic * Added a feature flag for the implementation * Added profile debug information * use priority queue method for significant terms * Refactored the selection strategy * Added tests case with proper assertions * Added cluster settings for selection strategy --------- (cherry picked from commit 7db7a5a) Signed-off-by: Rishabh Maurya <[email protected]> Signed-off-by: Vinay Krishna Pudyodu <[email protected]> Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Rishabh Maurya <[email protected]>

…ts (opensearch-project#18702) * optimize num agg using quick select for topN when applicable Signed-off-by: Rishabh Maurya <[email protected]> (cherry picked from commit 130d890) * Updated the numeric term aggregation logic to select topN Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Updated the algorithm selection logic Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Added a feature flag for the implementation Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Added profile debug information Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * use priority queue method for significant terms Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Refactored the selection strategy Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Added tests case with proper assertions Signed-off-by: Vinay Krishna Pudyodu <[email protected]> * Added cluster settings for selection strategy Signed-off-by: Vinay Krishna Pudyodu <[email protected]> --------- Signed-off-by: Rishabh Maurya <[email protected]> Signed-off-by: Vinay Krishna Pudyodu <[email protected]> Co-authored-by: Rishabh Maurya <[email protected]>

github-actions bot added bug Something isn't working Search:Performance labels Jul 8, 2025

rishabhmaurya and others added 4 commits July 11, 2025 11:04

optimize num agg using quick select for topN when applicable

ee5276a

Signed-off-by: Rishabh Maurya <[email protected]> (cherry picked from commit 130d890)

Updated the numeric term aggregation logic to select topN

9a528c0

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

Add changelog

14738d8

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

Updated the algorithm selection logic

52f4c52

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

vinaykpud force-pushed the num-term-agg-opt branch from 3abbaaa to 52f4c52 Compare July 11, 2025 18:04

Updated the comment

9f7c12d

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

spotlessApply

e124eb1

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

opensearch-ci-bot mentioned this pull request Jul 11, 2025

[AUTOCUT] Gradle Check Flaky Test Report for RemoteStoreStatsIT #14310

Open

vinaykpud force-pushed the num-term-agg-opt branch 3 times, most recently from ff2e323 to ced0d1b Compare July 14, 2025 22:11

Updated tests

13c663b

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

vinaykpud force-pushed the num-term-agg-opt branch from ced0d1b to 13c663b Compare July 14, 2025 23:46

opensearch-ci-bot mentioned this pull request Jul 15, 2025

[AUTOCUT] Gradle Check Flaky Test Report for FullRollingRestartIT #18490

Closed

vinaykpud added 2 commits July 15, 2025 12:10

Added a feature flag for the implementation

45b50b7

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

Added profile debug information

f8a69e7

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

vinaykpud force-pushed the num-term-agg-opt branch from 1f361cc to f8a69e7 Compare July 15, 2025 19:10

vinaykpud marked this pull request as ready for review July 15, 2025 19:12

vinaykpud requested review from anasalkouz, andrross and ashking94 as code owners July 15, 2025 19:12

vinaykpud force-pushed the num-term-agg-opt branch from 51c0446 to 506c92e Compare August 7, 2025 06:42

Merge branch 'main' into num-term-agg-opt

adda83c

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

rishabhmaurya reviewed Aug 7, 2025

View reviewed changes

server/src/main/java/org/opensearch/search/DefaultSearchContext.java Outdated Show resolved Hide resolved

vinaykpud closed this Aug 7, 2025

vinaykpud reopened this Aug 7, 2025

Fixed nit pick

68e77e1

Signed-off-by: Vinay Krishna Pudyodu <[email protected]>

vinaykpud force-pushed the num-term-agg-opt branch from 0d3ecb4 to 68e77e1 Compare August 7, 2025 17:55

opensearch-ci-bot mentioned this pull request Aug 7, 2025

[AUTOCUT] Gradle Check Flaky Test Report for RemoteStoreReplicationSourceTests #16683

Open

jainankitk reviewed Aug 7, 2025

View reviewed changes

server/src/main/java/org/opensearch/search/SearchService.java Show resolved Hide resolved

rishabhmaurya approved these changes Aug 7, 2025

View reviewed changes

rishabhmaurya merged commit 7db7a5a into opensearch-project:main Aug 7, 2025
31 checks passed

vinaykpud added the backport 3.2 Backport to 3.2 branch label Aug 7, 2025

opensearch-trigger-bot bot mentioned this pull request Aug 7, 2025

[Backport 3.2] Optimization in Numeric Terms Aggregation query for Large Bucket Counts #18974

Merged

This was referenced Aug 5, 2025

[AUTOCUT] Gradle Check Flaky Test Report for SearchRestCancellationIT #14311

Open

[AUTOCUT] Gradle Check Flaky Test Report for RethrottleTests #17937

Open

[AUTOCUT] Gradle Check Flaky Test Report for MetadataIndexTemplateServiceTests #19058

Open

BrewTestBot mentioned this pull request Aug 20, 2025

opensearch 3.2.0 Homebrew/homebrew-core#234146

Merged

vinaykpud mentioned this pull request Sep 2, 2025

Adding tests for high cardinality numeric term aggregation opensearch-project/opensearch-benchmark-workloads#690

Merged

5 tasks

opensearch-ci-bot mentioned this pull request Oct 22, 2025

[AUTOCUT] Gradle Check Flaky Test Report for EhcacheDiskCacheManagerTests #19722

Open

Optimization in Numeric Terms Aggregation query for Large Bucket Counts #18702

Optimization in Numeric Terms Aggregation query for Large Bucket Counts #18702

Uh oh!

Conversation

vinaykpud commented Jul 8, 2025 • edited by rishabhmaurya Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Benchmarking test results :

Related Issues

Check List

Uh oh!

github-actions bot commented Jul 8, 2025

Uh oh!

github-actions bot commented Jul 11, 2025

Uh oh!

github-actions bot commented Jul 11, 2025

Uh oh!

codecov bot commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Jul 14, 2025

Uh oh!

github-actions bot commented Jul 14, 2025

Uh oh!

github-actions bot commented Jul 15, 2025

Uh oh!

github-actions bot commented Aug 7, 2025

Uh oh!

Uh oh!

github-actions bot commented Aug 7, 2025

Uh oh!

github-actions bot commented Aug 7, 2025

Uh oh!

jainankitk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Aug 7, 2025

Uh oh!

Uh oh!

vinaykpud commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vinaykpud commented Jul 8, 2025 •

edited by rishabhmaurya

Loading

codecov bot commented Jul 11, 2025 •

edited

Loading

vinaykpud commented Aug 7, 2025 •

edited

Loading