Leonard Rosenthol wrote:
At 01:51 PM 3/23/2006, Nicholas Mistry wrote:
I am using iText to merge a series of tiff documents into PDF. After
the merge
we use acrobat professional 7's OCR to allow us to search the entire
document.
This works great.
OK.
Recently, i added some page marks (text) at the top of each page
(using iText),
denoting page numbers, etc.. This now broke the ability to have
Acrobat OCR
the PDF, since it now contains rendered text.
That is correct, as the Acrobat OCR engine will ONLY process
"image only" documents.
My question, is there a way to add some annotations to the page, but
still allow
acrobat to OCR the page?
I would recommend that you OCR first and THEN apply your
"annotations".
My intial gut feeling is to render the text as images, and place them
on the
doc... but i wanted ask if there was an easier way.
That won't help either, as you can only have a SINGLE image on
the page for Acrobat to OCR it. It won't do it for multiple ones, IIRC.
Well, i just wrote a test program that inserts multiple tiff files on a
page. Surprizingly acrobat actually OCR'd it. Waching the status bar
closely, Acrobat first rasterizes the entire page, and then passes it to
the OCR engine.
This was tested on Acrobat 7 Professional, im not sure about previous
versions.
Now this leads me to another question... Why couldnt they have
rasterized the text as well? Or better yet, ignore the text portion
completely..
Anyways, its a workaround... for now.. I am still interested in
learning how to create searchable images, and may add it to the app later..
Thanks again!
-N
-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions