-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[feature request] Processor interface #58
Comments
That would be too generic IMO. Might even be something that core itself will provide in the future. We follow the pattern that processor names are indicative of their respective repository.
The reason we merged Guessing the input type from the fileGrp is not going to be robust:
We also have the important question of backwards compatibility. Thus I advise against any such changes. |
I understand that the design decisions of naming processors and parameters might not be ideal, but they are at least consistent. Changing those now would break existing workflows without offering a functional improvement. @stweil I am curious: What are you using ocrd_fileformat for? IIUC you're mostly using tesseract and kraken directly @UB-Mannheim which have support for different output formats. I only use ocr(d)_fileformat for plain text output and very rarely to try out something with hOCR but for the bulk of conversions (PAGE to ALTO) we're using the processor of https://github.com/OCR-D/page-to-alto directly which is much faster. |
While using
ocrd-fileformat-transform
, I was thinking about some modifications which I'd like to discuss:ocrd-transform
be sufficient as name instead of the current lengthy one?from-to
is a strange parameter name. I see no technical need to inherit the specific needs ofocr-fileformat
here and would prefer two parametersfrom
andto
. Are there use cases wherefrom
cannot be omitted? Technically it should be possible to derive it for all entries in the input file group, and typically it will simply bePAGE
. So most users will only have to provide ato
parameter or – if it defaults to `ALTO – no parameter at all.The text was updated successfully, but these errors were encountered: