-
Notifications
You must be signed in to change notification settings - Fork 211
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test on a paragraph #301
Comments
As long as you're providing the coordinates of the lines with a PAGE XML file, you can run your model on the page or paragraph image. |
Thanks a lot for your reply! |
@andbue |
As I said before, the image alone won't do it. You need a XML file structured according to the PAGE XML schema containing the line coordinates. How you achieve this is not the business of Calamari-OCR. Maybe the most simple way for you would be to just use Aletheia, this has been proposed here and over there PRImA-Research-Lab/PAGE-XML#30 (comment) as well. Please don't try to save time by stealing other people's time e.g. by @-ing all developers you can get hold of at once in a github issue and then just asking them to explain to you the most basic workings of their code. We're doing our best to provide you with documentation for calamari, it is even included in a GUI at OCR4all. The internet is full of blog posts and tutorials explaining how the different steps of text recognition with line based OCR engines (calamari, kraken, ocropus, tesseract...) work. Please, do some reading first and then ask questions directly related to the project, showing that you've read the docs and that you've tried at least some google search on the topic. |
Hello!
@ChWick @andbue
Please can I train calamari on lines of images and testing it on a paragraph?
Please how can I doing this?
Thanks in advance!!
The text was updated successfully, but these errors were encountered: