We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
When attempting to extract text from the attached simple PDF file there are some characters missing within the text.
To reproduce the problem simply call pdfStripper.getText() on the attached pdf file (Problematic.pdf)
The text is mostly returned ok, however the following issues are present in the returned text:
making time to reflect and review your -> making time to reect and review your If you find it easier -> If you nd it easier
PdfBox-Android version: [e.g. 2.0.27.0] It happens on all versions of Android SDK (I have tried several)
Problematic.pdf
The text was updated successfully, but these errors were encountered:
This is an unsolved problem https://issues.apache.org/jira/browse/PDFBOX-3248
In this file, the /ToUnicode file maps ligatures to 0 and uses the /ActualText feature in the content stream which PDFBox doesn't support.
Sorry, something went wrong.
Possible solution, that works with the linked file: https://issues.apache.org/jira/browse/PDFBOX-5868?focusedCommentId=17874189&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-17874189
No branches or pull requests
When attempting to extract text from the attached simple PDF file there are some characters missing within the text.
To reproduce the problem simply call pdfStripper.getText() on the attached pdf file (Problematic.pdf)
The text is mostly returned ok, however the following issues are present in the returned text:
making time to reflect and review your -> making time to reect and review your
If you find it easier -> If you nd it easier
PdfBox-Android version: [e.g. 2.0.27.0]
It happens on all versions of Android SDK (I have tried several)
Problematic.pdf
The text was updated successfully, but these errors were encountered: