Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Aditional languages (rus, ukr) doesn't choosing in OCR language list #612

Open
deimjons opened this issue Apr 20, 2024 · 1 comment
Open

Comments

@deimjons
Copy link

Description
Hello.
I am using a custom docker image with Russian and Ukrainian language packages for tesseract installed, following instructions: https://docs.papermerge.io/3.0/setup/add-ocr-langs/
Dockerfile:

FROM papermerge/papermerge:3.1

# add Ukrainian and Russian OCR languages
RUN apt install tesseract-ocr-rus tesseract-ocr-ukr

Info:

  • Papermerge Version [e.g. 3.1]
    In the container are added languages:
# docker exec -it papermerge bash
# tesseract --list-langs
List of available languages in "/usr/share/tesseract-ocr/5/tessdata/" (11):
deu
eng
fra
ita
nld
osd
por
ron
rus
spa
ukr

I see these languages in OCR languages list but can not choose and Run OCR

Screenshot 2024-04-19 at 18 45 39
@deimjons deimjons added the bug Something isn't working label Apr 20, 2024
@ciur
Copy link
Owner

ciur commented Apr 21, 2024

@ciur ciur added missing-language missing-ocr-language and removed bug Something isn't working labels Apr 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants