Thanks. Is there any way to get individual character sizes or anything 
less than a large block of text?
I could calculate what I wanted if I knew how to get the rect bounds for 
a specific character within a larger block of text.

Or if I knew how to iterate on the text pieces within a PDF Text Object 
and then gather further information about its pieces. Given that text 
formatting could be different I canʻt assume they are the same size just 
because they are the same letter.

Thanks again,

kb


On 7/15/12 11:14 PM, Leonard Rosenthol wrote:
> iText does not have any lexical analysis tools, so it does not know what a
> "word" is.  It only sees the drawing instructions.
>
> So you will need to obtain all of the text and coordinates for the page,
> then perform your own analysis to determine "words".   Don't forget that
> the definition of a "word" differs across languagesŠ
>
> Leonard
>
> On 7/15/12 8:46 PM, "Kalani Bright" <kapaa...@manastudios.com> wrote:
>
>> Hi guys;
>>
>> Heres what I am trying to do; I would appreciate to know if this is
>> possible in iText.
>> I'm not interested in constructing pdfs only deconstructing existing
>> pdf's for analysis of content and positions of words on the page.
>>
>> Rather than boundary of all text on the page I want the boundary info
>> for each word in order to generate some xml for another program I wrote.
>>
>> Something like this...
>> <word id="0" x="0" y="0" width="8" height="4">The</word>
>> <word id="1" x="12" y="0" width="7" height="4">fox</word>
>> <word id="2" x="22" y="0" width="7" height="4">was</word>
>>
>> I know I can do it for a region of text; as shown in the IText in Action
>> book in Chapter 15; but I really do want it for each individual word so
>> I can generate invisible yet clickable hotspots over what will end up
>> being just be a plain image.
>>
>> Is this possible to do with iText; how would I accomplish something like
>> this?
>>
>> Thanks guys,
>>
>> kb
>>
>>
>>
>>
>> --------------------------------------------------------------------------
>> ----
>> Live Security Virtual Conference
>> Exclusive live event will cover all the ways today's security and
>> threat landscape has changed and how IT managers can respond. Discussions
>> will include endpoint security, mobile security and the latest in malware
>> threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
>> _______________________________________________
>> iText-questions mailing list
>> iText-questions@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/itext-questions
>>
>> iText(R) is a registered trademark of 1T3XT BVBA.
>> Many questions posted to this list can (and will) be answered with a
>> reference to the iText book: http://www.itextpdf.com/book/
>> Please check the keywords list before you ask for examples:
>> http://itextpdf.com/themes/keywords.php
>
> ------------------------------------------------------------------------------
> Live Security Virtual Conference
> Exclusive live event will cover all the ways today's security and
> threat landscape has changed and how IT managers can respond. Discussions
> will include endpoint security, mobile security and the latest in malware
> threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
> _______________________________________________
> iText-questions mailing list
> iText-questions@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/itext-questions
>
> iText(R) is a registered trademark of 1T3XT BVBA.
> Many questions posted to this list can (and will) be answered with a 
> reference to the iText book: http://www.itextpdf.com/book/
> Please check the keywords list before you ask for examples: 
> http://itextpdf.com/themes/keywords.php
>
>




------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Reply via email to