Hi, I am using the latest i-text pdf to parse / read the pdf and am getting the following exceptions
*option 1 snippet:* strategy = (TextExtractionStrategy) parser.processContent(i, new SimpleTextExtractionStrategy()); *option 1 exception:* java.lang.StringIndexOutOfBoundsException: String index out of range: 0 *option 2 snippet:* String myLine = PdfTextExtractor.getTextFromPage(reader, j); *option 2 exception:* java.lang.IllegalArgumentException: Comparison method violates its general contract! *option 3 snippet:* String parsedText = pdfStripper.getText(pdDoc); Using the PDFTextStripper, I am not getting any exception and am getting the output too, but the table content in pdf is coming one column after another. Basically, first column is getting parsed, then second column.. and so on I wanted to get the output line-by-line (row wise) I have attached the pdf file also for your reference Would really appreciate if you could provide me a solution to parse the pdf line-by-line Waiting for you reply Regards, Vishal Mittal
test_dec.pdf
Description: Adobe PDF document
------------------------------------------------------------------------------
_______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php