Skip to content

ReadingOrderDetector and tidying DLA project#123

Merged
EliotJones merged 1 commit intoUglyToad:masterfrom
BobLd:master
Jan 11, 2020
Merged

ReadingOrderDetector and tidying DLA project#123
EliotJones merged 1 commit intoUglyToad:masterfrom
BobLd:master

Conversation

@BobLd
Copy link
Collaborator

@BobLd BobLd commented Jan 10, 2020

  • Tidying the UglyToad.PdfPig.DocumentLayoutAnalysis project by creating folders
  • Adding TextBlock.ReadingOrder
  • Adding IReadingOrderDetector to gets the blocks in reading order and sets the TextBlock.ReadingOrder
  • Adding DefaultReadingOrderDetector that does nothing
  • Adding RenderingReadingOrderDetector that retrieve the blocks' reading order using rendering order (TextSequence) as suggested in Adding TextSequence number to each letter to determine if letters belong to the same Tj operation #61
  • Adding UnsupervisedReadingOrderDetector that retrieve the blocks' reading order using both (spatial) Allen’s interval relations and rendering order (TextSequence). References:
    • Section 5.1 of 'Unsupervised document structure analysis of digital scientific articles' by S. Klampfl, M. Granitzer, K. Jack, R. Kern
    • 'Document Understanding for a Broad Class of Documents' by L. Todoran, M. Worring, M. Aiello and C. Monz.

@codecov-io
Copy link

Codecov Report

Merging #123 into master will not change coverage.
The diff coverage is n/a.

Impacted file tree graph

@@           Coverage Diff           @@
##           master     #123   +/-   ##
=======================================
  Coverage   72.18%   72.18%           
=======================================
  Files         449      449           
  Lines       26849    26849           
=======================================
  Hits        19381    19381           
  Misses       7468     7468

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b4d917d...d5aed82. Read the comment docs.

@EliotJones EliotJones merged commit e7417be into UglyToad:master Jan 11, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants