-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"Load Result" with parsererror #326
Comments
Don't remember anything about OCR4all integration (request API), but I often see this error with valid PAGE-XML files when
(This is due to the parser from PRImA being not very robust, and not exposing the internal cause of error correctly.) |
Excuse the late reply, I somehow totally overlooked this issue. |
Except for the last point ( I don't have examples readily available, but it should be straightforward to construct some from your existing test cases. |
I'm not an XML schema expert so the following train of thought might be flawed but I'd be interested to know why the above mentioned cases wouldn't make the XML invalid?
|
You're right. Dangling IDREF should make the document invalid as of XML specification. I had based my judgement on the behaviour of the libxml2 implementation, which does not check IDREF.
Right again, my bad.
Again, you're spot on. Sorry for my sloppy nonsense! (I carried this misconception with me for quite some time...) |
I'll close this for now, feel free to reopen this @l0rn0r if the issue still persists and isn't caused by invalid PAGE XML (or if the invalid PAGE XML is produced by OCR4all). |
Hello
I'm running the OCR4all Docker container on my Ubuntu 20.04.
It works quite well but there is an error, when I tried to load a PageXML in LAREX.
When I have a page in the LAREX editor, which went through every OCR4all steps till recognition, I wanted to load an already existing PageXML of this page - to check if I could load a ground truth text for training - I get the error message:
"Couldn't retrieve annotations from file."
And in the console it says
"request:/file/upload/annotations - fail 'parsererror'"
which is indicated by
Larex/resources/js/viewer/communicator.js
, Line 17 - failed Post-request.The writing permissions of the data-folder on the server should be good (777).
The PageXML file is v2013-07-15.
Any hint for this problem?
Or any hint how to load ground truth from existing PageXMLs to train a new model?
The text was updated successfully, but these errors were encountered: