-
Notifications
You must be signed in to change notification settings - Fork 2.3k
[Bug] Star tree indexing - kahan summation fix #18462
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] Star tree indexing - kahan summation fix #18462
Conversation
|
❌ Gradle check result for 60144ea: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
4595246 to
a151898
Compare
Signed-off-by: bharath-techie <[email protected]>
|
❌ Gradle check result for 5165fe5: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
|
❌ Gradle check result for 5165fe5: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #18462 +/- ##
=========================================
Coverage 72.66% 72.67%
- Complexity 67858 67893 +35
=========================================
Files 5521 5522 +1
Lines 312541 312574 +33
Branches 45364 45377 +13
=========================================
+ Hits 227113 227148 +35
- Misses 66903 66921 +18
+ Partials 18525 18505 -20 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
I did one round of benchmarks on nyc_taxis. Indexing has very slight to no regression. Search - Overall sum : |
...va/org/opensearch/index/compositeindex/datacube/startree/aggregators/SumValueAggregator.java
Show resolved
Hide resolved
.../opensearch/index/compositeindex/datacube/startree/builder/AbstractDocumentsFileManager.java
Outdated
Show resolved
Hide resolved
asimmahmood1
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Any thoughts about adding this minor improvement to account for values that larger than current sum? https://en.wikipedia.org/wiki/Kahan_summation_algorithm#Further_enhancements
Signed-off-by: bharath-techie <[email protected]>
Signed-off-by: bharath-techie <[email protected]>
Since we're targeting this for 3.1 and code freeze is today , will create an issue to track this enhancement. |
) * Star tree kahan summation fix Signed-off-by: bharath-techie <[email protected]>
) * Star tree kahan summation fix Signed-off-by: bharath-techie <[email protected]>Signed-off-by: TJ Neuenfeldt <[email protected]>
) * Star tree kahan summation fix Signed-off-by: bharath-techie <[email protected]>
) * Star tree kahan summation fix Signed-off-by: bharath-techie <[email protected]>

Description
Star tree's SumValueAggregator today stores kahan summation state which gets reused and we found an edge case where the children node's value gets set in the kahan summation that gets used in the parent nodes which results in incorrect results.
As part of this PR removing the state from the SumValueAggregator and using
CompensatedSumas theSumAggregatortype.Related Issues
#18463
Check List
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.