Op 16/03/2011 14:03, DivyaKambhatla schreef: > we are using iText5.0.5 , PdfTextExtractor.getTextfromPage method. Since > iText2.0.7 does not have the PdfTextExtractor class or package at all. > iText2.1.7 does have it, but, iText2.1.7 does not work for watermarked PDFs. > Only iText5.0.5 works for watermarked PDFs. I think you misunderstood the remark. You've already told us that you're using iText 5.0.5, BUT the sample PDF you've sent, the one "Published by Maney Publishing" was created using iText 2.0.7 (the document says so).
However, when I look inside, I see that the lines of text don't contain any spaces. This means that somebody programmed it this way, because left and right justification is done differently in iText. I was wondering how the PDF was produced, because your problem is caused by the fact that somebody deliberately defined spaces using a value that is less than half of the normal width of the space character in the font that was used. If you could ask the producer of the PDFs to avoid this (because it's no fun to read text where the words are so close to each other), then you won't experience the problem you describe. ------------------------------------------------------------------------------ Colocation vs. Managed Hosting A question and answer guide to determining the best fit for your organization - today and in the future. http://p.sf.net/sfu/internap-sfd2d _______________________________________________ iText-questions mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php
