-
-
Notifications
You must be signed in to change notification settings - Fork 512
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Crash indexer on morphology line in config #395
Comments
Changing the line to morphology = libstemmer_nl, lemmatize_en has the same result (crash) |
Hi Can you provide more details on how to reproduce the crash as I can't reproduce it like this:
|
It seems more complicated than my first post. I just uploaded the config (please replace the PHP vars manually) + the DB data (run sql) + stopwords If I remove any of the lines: stopwords = sphinx_stopwords.txt the indexer crashes. |
From what I see it's the exact |
Is there a workaround to use this morphology without a crash? |
➤ Aleksey N. Vinogradov commented: try this one:
|
Seems to work, not getting the crash anymore. |
➤ Aleksey N. Vinogradov commented: Actually doesn't. Produced index has no ft part, only full-scan filtering by 'title' possible. |
@cappadaan you anyway has to fix your config/sources |
I already have this. This config is just a completely stripped one to show the crash. Adding a ID does not fix this. |
Well, ok. Let's split the issues then.
|
Is this a final patch? will it be included in the next version, so my version does not break while updating? |
➤ Aleksey N. Vinogradov commented: If @adriannuta or you confirm if fixes the problem - yes, see no reason to change anything in it. |
Im using yum, so I have no idea how to check if this patch works |
well, ok, I've just pushed 59d94ce with the fix. |
That should most probably fix github #395
I still have no idea how to use this, can you be more specific how? |
current 'dev' repo is just builds from master branch. You may pick latest rpm with this fix from that repo. |
Got it working, thx! I can confirm the patch works. I just indexed the full Sphinx 3.2.1 config without a crash. |
In the indexer it says "Copyright (c) 2017-2020, Manticore Software LTD (http://manticoresearch.com)" should be "Copyright (c) 2017-2020, Manticore Software LTD (https://manticoresearch.com)" |
➤ Aleksey N. Vinogradov commented: Good point! Thank you! |
Got all new sorts of unknow warnings (never saw these in sphinx) while trying to start searchd [13:42.085] [25205] using config file '/home/bla.conf' (22783 chars)... and [Wed Aug 19 19:18:30.435 2020] [25217] WARNING: internal error: non-empty queue on a rotation cycle start, got 1 elements |
➤ Aleksey N. Vinogradov commented: First is warning to you. We support tcp-fast-open. If you enable it, master-agent communications will be improved, and also connection via http, if client supports, will be improved also. All is ready, only enable support on system level necessary. So, you're welcome to use it. or you may silently ignore the warning; that is opportunity, not error. second, I guess, is just matter of the way you write config. Actually we have dedicated index 'type = template' which has no source, no files and just a container to inherit from (or also may be used to generate snippets, since they need no real index). You use kind of 'template_index' instead, which is usual plain index, but incomplete (no datasource, no storage). It works in general, but this warning will always be issued about it, since daemon just has no idea whether it is intentionally incomplete, or that is just a 'brick' to build another index. |
I'm closing the issue as the crash is fixed. Feel free to reopen if it makes sense. |
Manticore Search version: 3.5.0
OS: CentOS 7
I am trying to move from Sphinx to Manticore.
This specific line in my Sphinx (3.2.1) config causes a crash on the Manticore indexer:
morphology = libstemmer_dutch, lemmatize_en
Report
*** Oops, indexer crashed! Please send the following report to developers.
Manticore 3.5.0 1d34c49@200722 release
-------------- report begins here ---------------
Current document: docid=23196, hits=1135108
Current batch: minid=225047, maxid=225211
Hit pool start: docid=0, hit=0
-------------- backtrace begins here ---------------
Program compiled with 4.8.5
Configured with flags: Configured by CMake with these definitions: -DCMAKE_BUILD_TYPE=RelWithDebInfo -DDISTR_BUILD=rhel7 -DUSE_SSL=ON -DDL_UNIXODBC=1 -DUNIXODBC_LIB=libodbc.so.2 -DDL_EXPAT=1 -DEXPAT_LIB=libexpat.so.1 -DUSE_LIBICONV=1 -DDL_MYSQL=1 -DMYSQL_LIB=libmysqlclient.so.18 -DDL_PGSQL=1 -DPGSQL_LIB=libpq.so.5 -DLOCALDATADIR=/var/data -DFULL_SHARE_DIR=/usr/share/manticore -DUSE_RE2=1 -DUSE_ICU=1 -DUSE_BISON=ON -DUSE_FLEX=ON -DUSE_SYSLOG=1 -DWITH_EXPAT=1 -DWITH_ICONV=ON -DWITH_MYSQL=1 -DWITH_ODBC=ON -DWITH_PGSQL=1 -DWITH_RE2=1 -DWITH_STEMMER=1 -DWITH_ZLIB=ON -DGALERA_SONAME=libgalera_manticore.so.31 -DSYSCONFDIR=/etc/manticoresearch
Host OS is Linux runner-fa6cab46-project-3858465-concurrent-0 4.19.78-coreos #1 SMP Mon Oct 14 22:56:39 -00 2019 x86_64 x86_64 x86_64 GNU/Linux
Stack bottom = 0x0, thread stack size = 0x20000
Trying system backtrace:
begin of system symbols:
indexer(_Z12sphBacktraceib+0x90)[0x614e80]
indexer(_Z7sigsegvi+0xa2)[0x55c1c2]
/lib64/libpthread.so.0(+0xf630)[0x7fd0794a7630]
/lib64/libc.so.6(+0x13ee07)[0x7fd0783cbe07]
indexer(_ZN14CSphHitBuilder7cidxHitEP16CSphAggregateHit+0x37e)[0x5792ae]
indexer(_ZN13CSphIndex_VLN5BuildERKN3sph8Vector_TIP10CSphSourceNS0_13DefaultCopy_TIS3_EENS0_14DefaultRelimitENS0_16DefaultStorage_TIS3_EEEEii+0x22e4)[0x5bd1a4]
indexer(_Z7DoIndexRK17CSphConfigSectionPKcRK15CSphOrderedHashIS_10CSphString15CSphStrHashFuncLi256EEbP8_IO_FILE+0x15cb)[0x56303b]
indexer(main+0x13b7)[0x55a787]
/lib64/libc.so.6(__libc_start_main+0xf5)[0x7fd0782af555]
indexer[0x55c05f]
The text was updated successfully, but these errors were encountered: