-
-
Notifications
You must be signed in to change notification settings - Fork 512
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Group By on JSON Data Causes Primary To Crash #1265
Comments
Additional information. I removed the ORDER BY clause from the query and it started working. Does this mean there could be bad data somewhere and its causing it to crash? Hopefully the crash dump above points you in the right direction. Thank You |
could you provide remote index with configs along with master index config to reproduce this crash locally and investigation? |
Unfortunately, I am not authorized to upload our indexes. I did try to recreate the issue on our UAT environment by processing the data of the catalog there, but was unable to recreate this issue. For now, moving the sorting into the application will have to do. I will close the ticket as nothing further can be diagnosed or provided. |
could you check every your local index from the distributed RewardTranslationIndexEn with indextool could you check that crash still persist at recent release 620 at your UAT environment ? |
I ran that as well previously without issue... here is the output for one of four nodes. Each one passed successfully. `indextool -c /etc/manticoresearch/manticore.conf --check RewardTranslationIndexEn using config file '/etc/manticoresearch/manticore.conf'... Unfortunately, as to updating our UAT environment, we can't recreate the issue even after importing the data to it. So updating to latest dev build won't help us here. It is difficult to recreate the issue, otherwise I would have created a paired down index for you to test against. I will make one general observation. We have only ever had issues in the past with queries when JSON data is being filtered on (Like in this instance). Usually minor in nature, but they caused manticore to crash, we just always worked around it. The last issue was when the JSON object key started with a number, it would cause a crash. But when we prefixed the object key with an underscore '_', it fixed it Eg: "category_facets.catalog_6407._48938dhd3872d". In this current instance however, the values are string readable ASCII characters, so nothing special in the data that needs to be sorted on. I even forced the JSON key/array data structures to match sequentially in case there was some optimization happening in the background that may have caused an out of bounds access to happen. Maybe the issue lives in the code that takes the JSON data and starts to sort on it. Perhaps there is some optimization making assumptions on accessing the JSON structure? I just don't know. If you have any other questions, please let me know. In any case, We look forward to your next release. |
The issue with JSON numeric keys was fixed at 6dd3964 and is a part of the 6.2.0-release However you need reindex your data to get issue fixed after package upgrade. For the crash you now reported I also tried but unable to recreate it on simplified data. Please reopen issue if you see more crashes or have a reproducible example that you could share. |
So we tried upgrading our production instances to 6.2.0 on Friday evening and our crashes got worse. The query itself was not crashing it, but the search instances were running out of memory about once an hour. Here are some server logs. We also do not see anything in the search logs because the kernel is killing the process, so coredump is never triggered. The log below is also for a single node. If you can think of anything for this then let us know, as of right now we had to roll back to Manticore 6.0.4.
Thank You, |
Does it happen while the select query from the original post is being executed? Anyway the easiest way to solve this issue seems to be if you upload your index files, config and the query which causes a problem to our write-only S3, so we can reproduce and inspect it locally. |
When performing a group by query against JSON structured data (which is treated like a facet) across 4 distributed nodes, the primary server making the call crashes when grouping the results from the nodes. The individual nodes successfully return their data to the primary server. The last query in the trace attached below is the query that causes the Manticore instance to crash.
The primary server does not host any local indexes.
Environment:
Manticore 6.0.4 1a3a4ea@230314
Ubuntu 20.04
Clients connect to mysql which is set on port 9312
Agents listen on port 9315
The text was updated successfully, but these errors were encountered: