On 9/6/07, Ross Presser <[EMAIL PROTECTED]> wrote:
> > > You said you "exported from the djvu viewer with ascii", but your
> > > commandline shows PPM?  I'm not understanding.
> > >
> > > When you say your ultimate goal is to get a pdf, do you mean a pdf with
> > text
> > > as text, where the text can be copied to the clipboard? That is going to
> > > require an OCR step, with a Spanish-aware OCR program.
> > >
> > > Put up your PPM for download somewhere. (Zip it this time!)
> >
> > My raw (without ascii) PPM is available at
> >
> > http://rapidshare.com/files/53847538/example.zip
> >
> > (The ZIP file is 2,4 MB large.)
> >
> > Yes, I would like to add text to be copied from the pdf to the
> > clipboard. If one has a text file with the text of the document, is it
> > possible to mix it with the image in the pdf file such that one can
> > copy from the pdf file to the clipboard?
>
> Mixing it precisely, so that if I higlight three words from one
> paragraph I get it back, is of equivalent complexity to re-setting the
> entire text. So no, it doesn't seem feasible.
>
> Your best bet to proceed would be to use the b/w TIFF with an OCR
> package. Such packages often save PDFs with original image and text
> linked.

Acrobat Professional does that with its embedded OCR tool. However, I
am looking for Linux ways to accomplish the same. Maybe the guys on
comp.text.pdf have some expertise regarding how to do it on Linux.

Paul
_______________________________________________
Magick-users mailing list
[email protected]
http://studio.imagemagick.org/mailman/listinfo/magick-users

Reply via email to