[iText-questions] Exception during parsing pdf

vishal mittal Fri, 28 Mar 2014 00:42:08 -0700

Hi,

I am using the latest i-text pdf to parse / read the pdf and am getting the
following exceptions


*option 1 snippet:*
   strategy = (TextExtractionStrategy) parser.processContent(i, new
SimpleTextExtractionStrategy());

*option 1 exception:*
   java.lang.StringIndexOutOfBoundsException: String index out of range: 0



*option 2 snippet:*
String myLine = PdfTextExtractor.getTextFromPage(reader, j);

*option 2 exception:*
 java.lang.IllegalArgumentException: Comparison method violates its general
contract!


*option 3 snippet:*
   String parsedText = pdfStripper.getText(pdDoc);

Using the PDFTextStripper, I am not getting any exception and am getting
the output too, but the table content in pdf is coming one column after
another. Basically, first column is getting parsed, then second column..
and so on

I wanted to get the output line-by-line (row wise)

I have attached the pdf file also for your reference

Would really appreciate if you could provide me a solution to parse the pdf
line-by-line

Waiting for you reply

Regards,
Vishal Mittal

test_dec.pdf
Description: Adobe PDF document

------------------------------------------------------------------------------

_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

[iText-questions] Exception during parsing pdf

Reply via email to