Yeah, I already done a StreamEngine to get image scale and position, 
And I just took a look at Annotation example and it cleared my path.
Now, as you suggested, I'm going to extends the textStripper to get the
position of the text.
I'm afraid to have problem with fonts, are you having some king of this
problem?



Thanks,
José Rodolfo Carrijo de Freitas
Analista de Sistemas
Softplan - Departamento de pesquisa e desenvolvimento
Sistema da Qualidade Certificado ISO 9001:2008
(48) 3027 8000 Ramal 8359
http://www.softplan.com.br

-----Mensagem original-----
De: Julien Plée [mailto:[email protected]] 
Enviada em: sexta-feira, 17 de setembro de 2010 15:59
Para: [email protected]
Assunto: Re: wrap text with links

Yes, it is. This is almost what I am working on at the moment.
To prevent you from wasting much time on research, have a look at the  
PDFStreamEngine (more precisely override the processTextPosition  
function). If you manage to extend PDFTextStripper, it may be better  
since it manages text flows even if it is columned layered. I didn't  
manage to do this and PDFStreamEngine suites my needs at the moment.

In the PDF, text is cut in groups of words... and sometimes even words  
are cut in half. You'll have to process the text flow with a back  
match memory when parsing the flow.
You'll need to deal with the graphic state (to get the text  
coordinates) and will have to hack it a bit to get the approximate  
position of words or sentences you are looking for (because of the  
text flow structure).


Julien PLÉE


Le 17 sept. 10 à 20:24, José Rodolfo Carrijo de Freitas a écrit :

> Hello,
>
> Do you believe it is possible to read a text from a pdf and wrap a  
> text with
> a link?
>
> For example:
>
> if it founds “pdfbox” on the box, it will link it to the pdfbox  
> website.
>
>
>
> Thanks,
>
> José Rodolfo Carrijo de Freitas
> Analista de Sistemas
> Softplan - Departamento de pesquisa e desenvolvimento
>
> Sistema da Qualidade Certificado ISO 9001:2008
> (48) 3027 8000 Ramal 8359
> http://www.softplan.com.br
>
>
>


Reply via email to