Skip to content

apatawari/LabelExtractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LabelExtractor

Extracts the content from the labeled pdf.

Given the labeled PDF files, The files first gets converted to Text file and further labels are extracted from the pdf based certain known parameters. In this case, as its medical datasets, repetition and pattern is identified and based on that key value pair are extracted and are written in the Excel file.

About

Extracts the content from the labeled pdf.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages