what would you like to do with that content?

BR
Maruan

Am 14.01.2015 um 21:42 schrieb Stefan Falk <[email protected]>:

> Hello pdfbox people!
> 
> I was wondering if anybody can help me with my needs. What I am looking for 
> is a possibility to extract the underlying PDF code from a PDF file by simply 
> selecting an area with your mouse.
> 
> After reading a few things about PDFs I have learned that anything that has 
> to do with extraction anything from a PDF can be a quite hard task.
> 
> So I was wondering if pdfbox could do that somehow. I've taken a rough look 
> at the PDFReader and I noticed that there is e.g. processTextPosition from 
> the class PageDrawer that seem to allow me to get at least the position from 
> Text - am I right in assuming that?
> 
> My concrete question would be what is possible with pdfbox regarding this 
> matter? E.g. I have a PDF on my drive which text seems to be "extractable" by 
> pdfbox on the one hand but on the other hand the PDFReader is not able to 
> render any of it. It just renders the images (see attachment).
> 
> Thank you for your help in advance!
> 
> Best regards,
> Stefan

Reply via email to