-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
📦 v2023-12-07 #400
📦 v2023-12-07 #400
Conversation
It looks like CI has problems with ocr-fileformat, maybe because of stricter tests. |
Yes, the problem is in textract2page. cc @rue-a.
|
Yeah, and I can reproduce locally, will preparare a PR after tech call |
See slub/textract2page#13 for a hackish fix. |
Now updating ocrd_fileformat to include UB-Mannheim/ocr-fileformat#171 which in turn includes slub/textract2page#13 to test the CI. |
Updates core to v2.59.1 which includes the workflow endpoint, additional features for chunking and additional output formats for
ocrd workspace list-page
; fixing the file naming in the bagger; and the filtering by file group forclone
,zip bag
etc.@stweil improved the
page2img
script in format-converters significantly.@mikegerber did some house cleaning work on dinglehopper and ocrd_calamari
ocrd_pagetopdf should now work properly on MacOS and supports the METS Server.
workflow-configuration contains additional XSLT to detect ID clashes and add missing confidence values, supports pretty printing XML in the CLIs and supports the METS Server.
tesseract is also updated to the latest state in master.
I will merge this tomorrow, let me know if I missed something.I forgot to click on "Create pull request". Will merge ASAP once the CI is fixed.