On 1/09/2011 17:47, Alexis Pigeon wrote:
> If your PDF file is tagged, that can be done.
> If not, it'll get *really* difficult to achieve what you want to do, 
> if not impossible, due to the very nature of PDF.
Really difficult = use iText's PDF parsing functionality and inspect 
what is returned in the TextRenderInfo.
Why is this difficult? Because a word isn't necessarly stored as a word 
in a PDF document.
As explained in the book, "Hello World" could be stored as "ld", "Wor", 
"llo", "He" and visualized as "Hello World" because of the way these 
word snippets are positioned.
Why could this be impossible? Because you're not even sure if the word 
is stored as a text: it could be an image that looks like a word, or 
just a bunch of lines and shapes.

------------------------------------------------------------------------------
Special Offer -- Download ArcSight Logger for FREE!
Finally, a world-class log management solution at an even better 
price-free! And you'll get a free "Love Thy Logs" t-shirt when you
download Logger. Secure your free ArcSight Logger TODAY!
http://p.sf.net/sfu/arcsisghtdev2dev
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Reply via email to