You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've been getting very poor results with jpn_vert, but when training to improve it I'm getting a million errors (below).
I've checked all the existing issues on this here and tried all the suggestions, but I'm still stuck.
Trying this:
gmake training MODEL_NAME=jpn_vert_1 START_MODEL=jpn_vert FINETUNE_TYPE=Impact
I get endless errors like this:
Image data/jpn_vert_1-ground-truth/seg-017-0001.lstmf not trainable
Image too small to scale!! (2x48 vs min width of 3)
Line cannot be recognized!!
[...]
I don't understand this error, as none of my image files for training have these dimensions. They are all vertical strips of Japanese, generally around 120 x 3200 px, all PNG files at 600 dpi. Here is a ZIP (small) of my ground-truth folder.
EDIT: reading elsewhere on the site it sounds like tesstrain will scale every training image to 48px height — is that correct? If so, how should I train vertical Japanese? Rotate everything 90°? Otherwise, cutting the vertical images by hand would be a big pain. I know there are hocr tools for this but I have checked the hocr output and the bounding boxes cut through characters all the time (which I assume is why the OCR is giving poor results), so AFAICT that is really not a viable solution either.
I've been getting very poor results with jpn_vert, but when training to improve it I'm getting a million errors (below).
I've checked all the existing issues on this here and tried all the suggestions, but I'm still stuck.
Trying this:
gmake training MODEL_NAME=jpn_vert_1 START_MODEL=jpn_vert FINETUNE_TYPE=Impact
I get endless errors like this:
I don't understand this error, as none of my image files for training have these dimensions. They are all vertical strips of Japanese, generally around 120 x 3200 px, all PNG files at 600 dpi. Here is a ZIP (small) of my ground-truth folder.
EDIT: reading elsewhere on the site it sounds like tesstrain will scale every training image to 48px height — is that correct? If so, how should I train vertical Japanese? Rotate everything 90°? Otherwise, cutting the vertical images by hand would be a big pain. I know there are hocr tools for this but I have checked the hocr output and the bounding boxes cut through characters all the time (which I assume is why the OCR is giving poor results), so AFAICT that is really not a viable solution either.
How can I proceed?
My system: macOS Monterey v12.7.6.
The text was updated successfully, but these errors were encountered: