On 11-Mar-08, at 4:29 PM, Gautam John wrote:

Okay, let me try to explain. I work with a non-profit that publishes
childrens books. And we have these books in English. Now we want to
upload them to a wiki type place, which we are developing but ideas
are welcome, where people can take the English book and translate/edit
the text and it will overlay this text on the pictures using the
original layout. Then they can print it as a book. To upload the
layout and the pictures, I need to do so as JPEGs but need to mark the
text areas. Hence the queries.

One of Adobe's PDF generation tools can scan a page of text, apply OCR to extract the text, then generate a PDF file with the text hidden behind the image, so that you only see the image, but when you search for a piece of text, the relevant area of the image gets highlighted.

Not very web friendly, but a great archival tool. Would that work?

Reply via email to