Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Segment command fails when trying to output as PageXML/ALTO #597

Closed
fattynoparents opened this issue Apr 28, 2024 · 5 comments
Closed

Segment command fails when trying to output as PageXML/ALTO #597

fattynoparents opened this issue Apr 28, 2024 · 5 comments

Comments

@fattynoparents
Copy link

When trying to use the segment command with output as PageXML/ALTO:

$ kraken -a -i ... segment -bl
$ kraken -x -i ... segment -bl

I get the error:

Segmenting      [04/25/24 11:06:55] 
ERROR    Failed processing /home/user/images/2024.03.07/1.jpg: 
'BaselineLine' object   kraken.py:429  has no attribute 'cuts'
@fattynoparents
Copy link
Author

Version 5.2.4 gives me same error on Ubuntu 22.04 (WSL):

~$ kraken -x -i ~/images/test/79.jpg ~/images/test/79.xml segment -bl
scikit-learn version 1.2.2 is not supported. Minimum required version: 0.17. Maximum required version: 1.1.2. Disabling scikit-learn conversion API.
Torch version 2.1.2+cu121 has not been tested with coremltools. You may run into unexpected errors. Torch 2.0.0 is the most recent version that has been tested.
Loading ANN /home/user/.local/lib/python3.10/site-packages/kraken/blla.mlmodel  ✓
Segmenting      [05/14/24 15:28:58] ERROR    Failed processing /home/user/images/test/79.jpg: 'BaselineLine' object has no  attribute 'cuts'
kraken.py:431

@mittagessen
Copy link
Owner

mittagessen commented May 14, 2024 via email

@fattynoparents
Copy link
Author

I saw that you tagged the 5.2.5 release with my issue https://github.com/mittagessen/kraken/releases/tag/5.2.5
I'm now trying to run the code with a dev version 5.2.6.dev8 but get the following error:

(kraken) user@server:~/Documents/test$ kraken --version
kraken, version 5.2.6.dev8
(kraken) user@server:~/Documents/test$ kraken -x -i 7001.jpg 7001.xml segment -bl
Loading ANN /tmp/yes/envs/kraken/lib/python3.11/site-packages/kraken/blla.mlmodel       ✓
[07/03/24 14:21:12] ERROR  Failed processing 7001.jpg: 
kraken.py:431 /tmp/yes/envs/kraken/lib/python3.11/site-packages/pyarrow/../../.././libbrotlidec.so.1: undefined
                             symbol: BrotliSharedDictionaryDestroyInstance

@dstoekl
Copy link

dstoekl commented Jul 3, 2024

here is a workaround: https://stackoverflow.com/questions/55051431/linux-pyarrow-undefined-symbol

@mittagessen
Copy link
Owner

mittagessen commented Jul 3, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants