On 9/6/07, Ross Presser <[EMAIL PROTECTED]> wrote: > > > You said you "exported from the djvu viewer with ascii", but your > > > commandline shows PPM? I'm not understanding. > > > > > > When you say your ultimate goal is to get a pdf, do you mean a pdf with > > text > > > as text, where the text can be copied to the clipboard? That is going to > > > require an OCR step, with a Spanish-aware OCR program. > > > > > > Put up your PPM for download somewhere. (Zip it this time!) > > > > My raw (without ascii) PPM is available at > > > > http://rapidshare.com/files/53847538/example.zip > > > > (The ZIP file is 2,4 MB large.) > > > > Yes, I would like to add text to be copied from the pdf to the > > clipboard. If one has a text file with the text of the document, is it > > possible to mix it with the image in the pdf file such that one can > > copy from the pdf file to the clipboard? > > Mixing it precisely, so that if I higlight three words from one > paragraph I get it back, is of equivalent complexity to re-setting the > entire text. So no, it doesn't seem feasible. > > Your best bet to proceed would be to use the b/w TIFF with an OCR > package. Such packages often save PDFs with original image and text > linked.
Acrobat Professional does that with its embedded OCR tool. However, I am looking for Linux ways to accomplish the same. Maybe the guys on comp.text.pdf have some expertise regarding how to do it on Linux. Paul _______________________________________________ Magick-users mailing list [email protected] http://studio.imagemagick.org/mailman/listinfo/magick-users
