-
-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Search has issues with words adjacent to puncutation characters #2095
Comments
I also tried same ting in my dev instance I had the same issue. But if you pull the changes and update the version to BookStack v0.29.3. |
Updating the title to be more generic in the interest of merging down some issues. Related to #1037 |
I’ve just realized that searching word parts which are combined through hyphens, doesn't work either. Example: Searching for An english language example would be |
@ssddanbrown we are running into something similar. Running BookStack v21.05.4 for a science fiction authors book series. One of her races are called |
As part of #3043 I've made a change to auto-convert any search terms, that would experience this issue, into exact match terms instead which will run a direct, although less efficient, content match. Doesn't directly solve this but should provide a much better user-experience in such situations. Will be part of the next feature release. |
Hello @ssddanbrown, |
I can confirm this issue is still present on 23.10.2. I also just verified on the demo instance (currently 23.10.4) and hyphenated words are not correctly found. For instance, the pages prod-linode-sparkjet or dev-internal-sparklebike on the demo instance cannot be found if the last term (i.e. sparkjet or sparklebike) is used to search. |
Facing the same issue with hyphenated words, which are pretty common in german text.
@ssddanbrown Any reason to exclude |
This changes indexing so that a.b now indexes as "a", "b" AND "a.b" instead of just the first two, for periods and hypens, so terms containing those characters can be searched within. Adds hypens as a delimiter - #2095
Really it was because they felt more part of a term rather than something to split them by, but I can see the issue that would result. I spent some time on this today to change up the indexing a bit via #5488. There will still be gaps and limitations in search due to the nature of the trying to keep content indexed, using prefix matching, and the use of custom tokenization, but this should solve some of the most common issues here reported about hyphenated words. The mentioned changes will be part of the next feature release. Thanks all for your input! |
for example: it wont find "house" in "big-house"
but it will find "big"
this is the case for book or page titles and for page content
tested with BookStack version v0.29.0
The text was updated successfully, but these errors were encountered: