Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Avoid to run OCR for digital PDF files #4

Open
mykolamelnykml opened this issue Nov 26, 2024 · 0 comments
Open

Avoid to run OCR for digital PDF files #4

mykolamelnykml opened this issue Nov 26, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@mykolamelnykml
Copy link
Collaborator

mykolamelnykml commented Nov 26, 2024

Need to detect if PDF page contains text layer and avoid to call OCR for this page.

And add option to force OCR for all pages. It is some time useful for scanned documents with bad recognized text.

@mykolamelnykml mykolamelnykml converted this from a draft issue Nov 26, 2024
@mykolamelnykml mykolamelnykml added the enhancement New feature or request label Nov 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Backlog
Development

No branches or pull requests

1 participant