Hi,


We use iText to produce a PDF document that is a combination of data,
user supplied PDFs and system generated PDFs, all combined into one
large PDF.? Some of the user supplied PDF files contain pages that are
scanned documents and contain no searchable text, only an image of the
scanned document.? I have been looking for OCR products that can
OCR/extract text from these pages that are just images, with limited
success. I have found one, which I am experimenting with from Asprise.?
Anyone have experience with OCR/extracting text from scanned pages
inside a PDF?? Supposing I can OCR/extract this text, can I use iText
to insert a hidden text layer beneath these images that would make
these pages text searchable? I don't want to change the appearance of
the page, but simply insert the text underneath so it becomes a
searchable page.



Thanks,

Jody
________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! - 
http://webmail.aim.com
-------------------------------------------------------------------------
SF.Net email is sponsored by:
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services
for just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions
Buy the iText book: http://itext.ugent.be/itext-in-action/

Reply via email to