Hi I have a requirement to parse a pdf line by line irrespective of the presence of columns. getText(PDDocument) copies the complete text as we copy using cntrl+C and cntrl+V. So in the case of parsing an invoice/receipt which has values like total 231, total will be parsed as one line and 231 parsed after several lines. The contents in the same column as 231 will be parsed after total, so there is no way for me to retrieve the total as 231. Any help would be of use.
-- Thanks & Regards Elbin K Elias

