Extracts the content from the labeled pdf.
Given the labeled PDF files, The files first gets converted to Text file and further labels are extracted from the pdf based certain known parameters. In this case, as its medical datasets, repetition and pattern is identified and based on that key value pair are extracted and are written in the Excel file.