-
Notifications
You must be signed in to change notification settings - Fork 9.5k
Commit
- Loading branch information
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -309,6 +309,7 @@ void Tesseract::SearchWords(PointerVector<WERD_RES>* words) { | |
word_certainty); | ||
word->best_choice->print(); | ||
} | ||
word->best_choice->set_certainty(word_certainty); | ||
This comment has been minimized.
Sorry, something went wrong.
This comment has been minimized.
Sorry, something went wrong.
theraysmith
Author
Contributor
|
||
// Discard words that are impossibly bad, but allow a bit more for | ||
// dictionary words, and keep bad words in non-space-delimited langs. | ||
if (word_certainty >= RecodeBeamSearch::kMinCertainty || | ||
|
@@ -324,7 +325,6 @@ void Tesseract::SearchWords(PointerVector<WERD_RES>* words) { | |
// It is a dud. | ||
word->SetupFake(lstm_recognizer_->GetUnicharset()); | ||
} | ||
word->best_choice->set_certainty(word_certainty); | ||
} | ||
} | ||
} | ||
|
This change in
ccmain/linerec.cpp
has a negative effect on execution time and accuracy in my tests.With my example image, the time needed for
tesseract 481659978_08_0081.tif test -l deu+eng --oem 1
increases from 36 s to 58 s (user time). The recognized text shows several new errors, especially Kiindigungs- instead of Kündigungs- and Uster instead of öster-,