Skip to content

0.15.12

Compare
Choose a tag to compare
@MthwRobinson MthwRobinson released this 13 Sep 14:39
· 92 commits to main since this release
8b7e5bb

0.15.12

Enhancements

  • Improve pdfminer element processing Implemented splitting of pdfminer elements (groups of text chunks) into smaller bounding boxes (text lines). This prevents loss of information from the object detection model and facilitates more effective removal of duplicated pdfminer text.