On 1/09/2011 17:47, Alexis Pigeon wrote: > If your PDF file is tagged, that can be done. > If not, it'll get *really* difficult to achieve what you want to do, > if not impossible, due to the very nature of PDF. Really difficult = use iText's PDF parsing functionality and inspect what is returned in the TextRenderInfo. Why is this difficult? Because a word isn't necessarly stored as a word in a PDF document. As explained in the book, "Hello World" could be stored as "ld", "Wor", "llo", "He" and visualized as "Hello World" because of the way these word snippets are positioned. Why could this be impossible? Because you're not even sure if the word is stored as a text: it could be an image that looks like a word, or just a bunch of lines and shapes.
------------------------------------------------------------------------------ Special Offer -- Download ArcSight Logger for FREE! Finally, a world-class log management solution at an even better price-free! And you'll get a free "Love Thy Logs" t-shirt when you download Logger. Secure your free ArcSight Logger TODAY! http://p.sf.net/sfu/arcsisghtdev2dev _______________________________________________ iText-questions mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php
