Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

free-text notes are not extracted #55

Open
rdiaz02 opened this issue Oct 18, 2020 · 0 comments
Open

free-text notes are not extracted #55

rdiaz02 opened this issue Oct 18, 2020 · 0 comments
Labels

Comments

@rdiaz02
Copy link
Contributor

rdiaz02 commented Oct 18, 2020

"Free-text" notes (what Okular calls "Inline notes" and "Typewriter notes" and Foxit calls "callout", "text box", "typewriter" notes) are not extracted. I suppose this cannot be fixed (pdftools cannot create these annotations; it can edit them, but will not list them. See politza/pdf-tools#438)?

As an example, I am attaching a pdf with an "Inline" and a "Typewriter" note. Both were created with Okular, and then I added more text using pdf-tools: ex-annot.pdf

This non-extraction also affects, of course, similar notes created with, for example, android apps for annotating PDFs.

Both org-noter (https://github.com/weirdNox/org-noter) and org-noter-pdftools (https://github.com/fuxialexander/org-pdftools) are affected.

For the record, these types of notes are not extracted either by Zotfile (http://zotfile.com/) using pdf.js (https://mozilla.github.io/pdf.js/).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants