-
-
Notifications
You must be signed in to change notification settings - Fork 510
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Setting max_threads_per_query = 12 leads to 99.9 CPU load for two threads on 16 core box #1631
Comments
Please show the following at this moment:
|
I have observed a similar failure. Workers would go to 100%, the connection to the client would break (the client receives no response). They were processing sphinx protocol requests querying a text field "all_childs" which can contain the words "child_1", "child_2", ... up to "child_18". These were the hanging queries I had to
RT indices were present, but the queries ran against a non-RT index.
These unspecific queries match a good portion of 230k documents. Other, more specific queries did not crash. After reading this issue, I set Hardware: AMD Ryzen 9 5950X 16-Core Processor, 128 GB RAM
searchd: |
Here's some data |
@Korkman if you can stably reproduce it by running on of the |
could you try to use head of the dev version as it has fixes of CPU limit during FT queries ? |
@tomatolog @sanikolaev @tomatolog Would a workaround be possible in 6.2.12 or can this only be fixed with the release of 6.2.13? |
you could set |
Thanks. I'm closing this issue then. @starinacool feel free to reopen in case it doesn't work for you in the dev version or the upcoming release. |
Describe the bug
2 of 16 worker threads go 99.9 CPU time when I try to change max_threads_per_query from 10 to 12 on a 16 core box. Even after removing all workload from the server these two threads keep consuming 99.9 CPU.
Server cannot be stoped with systemctl stop manticore. Only kill -9 helps.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
All worker threads working normaly.
Describe the environment:
Manticore 6.2.12 dc5144d@230822 (columnar 2.2.4 5aec342@230822) (secondary 2.2.4 5aec342@230822)
Messages from log files:
[Sun Nov 26 06:31:34.042 2023] [634140] caught SIGTERM, shutting down
[Sun Nov 26 06:31:39.550 2023] [634140] WARNING: still 2 alive tasks during shutdown, after 5.508 sec
[Sun Nov 26 06:31:39.701 2023] [634153] rt: table listing_finished: ramchunk saved in 0.150 sec
Additional context
Config:
optimize_cutoff = 8
max_threads_per_query = 10
access_doclists=mmap
access_hitlists=mmap
network_timeout = 20
client_timeout = 300
seamless_rotate = 1
unlink_old = 1
max_packet_size = 64M
max_filter_values = 65535
listen_backlog = 255
max_batch_queries = 32
subtree_docs_cache = 16M
subtree_hits_cache = 32M
binlog_flush = 2
binlog_max_log_size = 128M
expansion_limit = 100
query_log_format = sphinxql
collation_server = utf8_general_ci
collation_libc_locale = ru_RU.UTF-8
query_log_min_msec = 200
predicted_time_costs = doc=64, hit=48, skip=2048, match=64
The text was updated successfully, but these errors were encountered: