I have been parsing a PDF and I have an issue with the text that is returned
from the PdfTextExtractor.getTextFromPage method. The reason I'm using this
version because of an exception that I am getting with the 5.1.3 version.
However when I'm getting the text from the page(s) I have noticed that the
words are running together e.g "iTextParseError" instead of "iText Parse
Error". I made the change below in the TextRendererInfo.java and that
resolved the text issues without a space. Finally, the PDF I'm parsing is
duplicating lines of text, I confirmed that the text only appears once on
the page in the PDF. E.g. "iText Parse Error\niText Parse Error".
public String getText(){
return (text == null) ? " " : (text.length() == 0) ? " " : text;
}
Exception in thread "main" java.lang.StringIndexOutOfBoundsException: String
index out of range: 0
at java.lang.String.charAt(String.java:686)
at
com.itextpdf.text.pdf.parser.LocationTextExtractionStrategy.getResultantText(LocationTextExtractionStrategy.java:121)
at
com.itextpdf.text.pdf.parser.PdfTextExtractor.getTextFromPage(PdfTextExtractor.java:73)
at
com.itextpdf.text.pdf.parser.PdfTextExtractor.getTextFromPage(PdfTextExtractor.java:88)
--
View this message in context:
http://itext-general.2136553.n4.nabble.com/iText-5-1-4-text-extraction-issue-tp4392545p4392545.html
Sent from the iText - General mailing list archive at Nabble.com.
------------------------------------------------------------------------------
Virtualization & Cloud Management Using Capacity Planning
Cloud computing makes use of virtualization - but cloud computing
also focuses on allowing computing to be delivered as a service.
http://www.accelacomm.com/jaw/sfnl/114/51521223/
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php