Replies: 1 comment 5 replies
-
This is a no-can-do sorry. Detecting / Establishing natural reading order is an integral part of the package making it impossible to disable it. If it doesn't work in your case, it would instead be worthwhile to find and fix the problem. |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, this is an amazing package. It's really good for processing PDF files. Massive thanks to the team! One BIG wish from me though is to have the natural reading order as an option. I have many PDF's where the natural reading order actually makes it worse (based on using
get_text()
from pymupdf) - it seems the automatic column identification isn't working for my documents. Would it be possible to add this as a feature / an option to theto_markdown()
function?Beta Was this translation helpful? Give feedback.
All reactions