Skip to content

Commit 5206256

Browse files
iorixxxjrodewig
authored andcommitted
[DOCS] Correct DFI docs regarding stop word removal (#53836)
The documentation of DFI should recommend *not* to [remove stop words][1], since DFI is good at scoring queries that contain common terms: `the wall`, `the sun`, `the who`, etc. [1]:https://lucene.apache.org/core/8_1_1/core/org/apache/lucene/search/similarities/DFISimilarity.html
1 parent 7e25563 commit 5206256

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

docs/reference/index-modules/similarity.asciidoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -124,7 +124,7 @@ This similarity has the following options:
124124
{lucene-core-javadoc}/org/apache/lucene/search/similarities/IndependenceSaturated.html[`saturated`],
125125
{lucene-core-javadoc}/org/apache/lucene/search/similarities/IndependenceChiSquared.html[`chisquared`].
126126

127-
When using this similarity, it is highly recommended to remove stop words to get
127+
When using this similarity, it is highly recommended *not* to remove stop words to get
128128
good relevance. Also beware that terms whose frequency is less than the expected
129129
frequency will get a score equal to 0.
130130

0 commit comments

Comments
 (0)