-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Special requirements for Hindi and Arabic OCR #239
Comments
Hmm, can you try again using tess-two version 8.0.0? Hindi is working OK for me in both Tesseract and Cube modes on version 8.0.0. |
Thanks for the reply. I tried with version 8.0.0 but still same issue. |
What's the error message that's printed to the device log when your app crashes? |
Here is the error summary displayed on my android mobile. Screenshot 1 Screenshot 2 Thanks for your help. |
I can't reproduce the error that you're seeing. Make sure you're using the correct training data file, from the 3.04.00 tag of the tessdata project. I get the following result for your input image when using the default settings (
|
Thanks for your reply. However, I still could not resolve the error. I have tried with training data file from here. This page also says that "For Arabic and Hindi you need both the traineddata file and the cube data files." Thanks a lot for your help. |
Yes, you need to install hin.* from https://github.com/tesseract-ocr/tessdata/tree/3.04.00 Thanks for reporting this issue. I've created a task (#240) for myself to improve the training data checking for Arabic and Hindi so developers get a clear error message rather than a crash when using the wrong training data files. |
Thanks for your reply. I installed all hin.* files from the link provided by you but the app still crashes. Thanks for your help. |
The problem is solved. Thanks for your help. Thanks a lot for your help. |
Glad you were able to solve the problem! |
Thanks for looking into this issue. After taking a second look at this, I want to make a note here for reference. Special requirements for Hindi and Arabic OCRArabic and Hindi OCR requires the installation of all Cube data files when using Hindi OCR also works using |
I am trying to build android app for OCR Hindi using tess two. It runs for many languages except Hindi. For Hindi, the app just crashes when try to scan any hindi language. I tried all OEM_TESSERACT_ONLY, OEM_TESSERACT_CUBE_COMBINED, OEM_CUBE_ONLY and PSM_SINGLE_BLOCK but app not working. Please give any solution . Crash: |
I include ara.cube.* and user OEM_TESSERACT_ONLY , app still crash |
I also use OEM_CUBE_ONLY |
Summary:
I am new to tesseract and Android Studio. I am trying to build android app for OCR using tess two. I was able to make it with the help of internet and it runs for many languages except Hindi. For Hindi, the app just crashes after opening it.
Expected result:
Hindi language should also work along with all other languages.
Actual result:
The app crashes when I put hin.traineddata file and change the language to Hindi.
Tess-two version:
tess-two:5.4.1
Android version:
7.1.2
Phone/device model:
Xiaomi Redmi 4
Phone/device architecture (armeabi, armeabi-v7a, x86, mips, arm64-v8a, x86_64, mips64):
Link to training data used:
https://github.com/tesseract-ocr/tessdata/tree/3.04.00
Link to image used as input:
The text was updated successfully, but these errors were encountered: