-
Notifications
You must be signed in to change notification settings - Fork 2.3k
[BUG FIX] Using an excessively large reindex slice can lead to a JVM OutOfMemoryError on coordinator #18964
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG FIX] Using an excessively large reindex slice can lead to a JVM OutOfMemoryError on coordinator #18964
Conversation
97c1aa7 to
f42bf6f
Compare
|
❌ Gradle check result for f42bf6f: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
f42bf6f to
955ccc2
Compare
|
❌ Gradle check result for 955ccc2: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
955ccc2 to
a8be685
Compare
Signed-off-by: sunqijun.jun <[email protected]>
Signed-off-by: sunqijun.jun <[email protected]>
Signed-off-by: sunqijun.jun <[email protected]>
d1db8cb to
ab6e2e4
Compare
|
❌ Gradle check result for ab6e2e4: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
|
❌ Gradle check result for ab6e2e4: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #18964 +/- ##
============================================
+ Coverage 72.82% 72.94% +0.11%
- Complexity 69677 69772 +95
============================================
Files 5658 5658
Lines 320099 320108 +9
Branches 46348 46350 +2
============================================
+ Hits 233110 233491 +381
+ Misses 68088 67749 -339
+ Partials 18901 18868 -33 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
❌ Gradle check result for 2c7897f: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
|
@ankitkala I apologize for taking up your time, but I’m genuinely unsure who else to ask for a review. As I noticed you’ve modified the reindex code in the past, I’d be extremely grateful if you could review it at your convenience. Thank you so much for your help. |
Signed-off-by: kkewwei <[email protected]>
|
❕ Gradle check result for 4bfd6ec: UNSTABLE Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure. |
Signed-off-by: kkewwei <[email protected]>
|
❌ Gradle check result for bbf2503: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
Signed-off-by: kkewwei <[email protected]>
kkewwei
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
…OutOfMemoryError on coordinator (opensearch-project#18964) * bugfix for too much slices cause jvm oom Signed-off-by: sunqijun.jun <[email protected]> * add changelogs Signed-off-by: sunqijun.jun <[email protected]> * fix spotlessApply Signed-off-by: sunqijun.jun <[email protected]> --------- Signed-off-by: sunqijun.jun <[email protected]> Signed-off-by: kkewwei <[email protected]> Co-authored-by: sunqijun.jun <[email protected]> Co-authored-by: kkewwei <[email protected]>
…OutOfMemoryError on coordinator (opensearch-project#18964) * bugfix for too much slices cause jvm oom Signed-off-by: sunqijun.jun <[email protected]> * add changelogs Signed-off-by: sunqijun.jun <[email protected]> * fix spotlessApply Signed-off-by: sunqijun.jun <[email protected]> --------- Signed-off-by: sunqijun.jun <[email protected]> Signed-off-by: kkewwei <[email protected]> Co-authored-by: sunqijun.jun <[email protected]> Co-authored-by: kkewwei <[email protected]> Signed-off-by: Ankit Jain <[email protected]>
…OutOfMemoryError on coordinator (opensearch-project#18964) * bugfix for too much slices cause jvm oom Signed-off-by: sunqijun.jun <[email protected]> * add changelogs Signed-off-by: sunqijun.jun <[email protected]> * fix spotlessApply Signed-off-by: sunqijun.jun <[email protected]> --------- Signed-off-by: sunqijun.jun <[email protected]> Signed-off-by: kkewwei <[email protected]> Co-authored-by: sunqijun.jun <[email protected]> Co-authored-by: kkewwei <[email protected]> Signed-off-by: Ankit Jain <[email protected]>
…OutOfMemoryError on coordinator (opensearch-project#18964) * bugfix for too much slices cause jvm oom Signed-off-by: sunqijun.jun <[email protected]> * add changelogs Signed-off-by: sunqijun.jun <[email protected]> * fix spotlessApply Signed-off-by: sunqijun.jun <[email protected]> --------- Signed-off-by: sunqijun.jun <[email protected]> Signed-off-by: kkewwei <[email protected]> Co-authored-by: sunqijun.jun <[email protected]> Co-authored-by: kkewwei <[email protected]>
…OutOfMemoryError on coordinator (opensearch-project#18964) * bugfix for too much slices cause jvm oom Signed-off-by: sunqijun.jun <[email protected]> * add changelogs Signed-off-by: sunqijun.jun <[email protected]> * fix spotlessApply Signed-off-by: sunqijun.jun <[email protected]> --------- Signed-off-by: sunqijun.jun <[email protected]> Signed-off-by: kkewwei <[email protected]> Co-authored-by: sunqijun.jun <[email protected]> Co-authored-by: kkewwei <[email protected]>
…OutOfMemoryError on coordinator (opensearch-project#18964) * bugfix for too much slices cause jvm oom Signed-off-by: sunqijun.jun <[email protected]> * add changelogs Signed-off-by: sunqijun.jun <[email protected]> * fix spotlessApply Signed-off-by: sunqijun.jun <[email protected]> --------- Signed-off-by: sunqijun.jun <[email protected]> Signed-off-by: kkewwei <[email protected]> Co-authored-by: sunqijun.jun <[email protected]> Co-authored-by: kkewwei <[email protected]>
Description
Currently, OpenSearch's restrictions and checks on search slice parameters are inappropriate. In the code, search slices size check in each shard execution. If there is a problem with the verification on the shard, an exception will be thrown on the shard. This is a relatively slow process.
Therefore, we need to change this approach. we need to move the validation of slice parameters to the coordinator node to detect issues earlier.
Related Issues
Resolves #18963
Check List
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.