Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test on a paragraph #301

Closed
Tailor2019 opened this issue Jan 6, 2022 · 5 comments
Closed

Test on a paragraph #301

Tailor2019 opened this issue Jan 6, 2022 · 5 comments

Comments

@Tailor2019
Copy link

Hello!
@ChWick @andbue
Please can I train calamari on lines of images and testing it on a paragraph?
Please how can I doing this?
Thanks in advance!!

@andbue
Copy link
Member

andbue commented Jan 6, 2022

As long as you're providing the coordinates of the lines with a PAGE XML file, you can run your model on the page or paragraph image.

@Tailor2019
Copy link
Author

Tailor2019 commented Jan 6, 2022

Thanks a lot for your reply!
@andbue
But how can I prepare this PAGE XML file?
can you give me an example of this xml file? Does exist a tool that can help preparing these coordinates?
Thanks in advance!

@Tailor2019
Copy link
Author

@andbue
If I have like this paragraph
IMG002
:
How can I proceed for recognizing it with Calamari?
Thanks in advance!

@andbue
Copy link
Member

andbue commented Jan 7, 2022

As I said before, the image alone won't do it. You need a XML file structured according to the PAGE XML schema containing the line coordinates. How you achieve this is not the business of Calamari-OCR. Maybe the most simple way for you would be to just use Aletheia, this has been proposed here and over there PRImA-Research-Lab/PAGE-XML#30 (comment) as well.

Please don't try to save time by stealing other people's time e.g. by @-ing all developers you can get hold of at once in a github issue and then just asking them to explain to you the most basic workings of their code. We're doing our best to provide you with documentation for calamari, it is even included in a GUI at OCR4all. The internet is full of blog posts and tutorials explaining how the different steps of text recognition with line based OCR engines (calamari, kraken, ocropus, tesseract...) work. Please, do some reading first and then ask questions directly related to the project, showing that you've read the docs and that you've tried at least some google search on the topic.

@bertsky bertsky closed this as completed Oct 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants