You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There are 6 xml and jpg named 01...06
Created PDF don't follow this but goes random, e.g.
Page 1 -> 04.jpg
Page 2 -> 03.jpg
Page 3 -> 01.jpg
Page 4 -> 06.jpg
Page 5 -> 02.jpg
Page 6 -> 05.jpg
Is there any convention how to name files?
Thank you!
The text was updated successfully, but these errors were encountered:
The tool processes the PAGE XML files of a directory without sorting the directory entries. Try ls -U to get the unsorted list of directory entries. So the page order in the PDF is not random, but can be unexpected. This happens especially if the PAGE XML were not created in a sorted order, for example when running parallel OCR processes.
I used a trick to get sorted entries. Run these commands in the directory with the (unsorted) PAGE XML files:
mkdir sorted
cd sorted
ln -s ../*.xml .
Then use the newly created directory sorted instead of the unsorted directory.
Of course it would be much better to fix the code and sort the directory entries there.
There are 6 xml and jpg named 01...06
Created PDF don't follow this but goes random, e.g.
Page 1 -> 04.jpg
Page 2 -> 03.jpg
Page 3 -> 01.jpg
Page 4 -> 06.jpg
Page 5 -> 02.jpg
Page 6 -> 05.jpg
Is there any convention how to name files?
Thank you!
The text was updated successfully, but these errors were encountered: