On Fri, Dec 14, 2012 at 11:28 AM, Zdenko Podobný <[email protected]> wrote:
> tesseract-ocr use leptonica for image IO. List of supported input type also
> depends on leptonica configuration e.g. if you did not compile jpeg support
> for leptonica, jpeg will be not supported in tesseract-ocr. So creating list
> of supported types would be tricky.
> For possible supported type you can check e.g. leptonica source code[1].
>
> [1] http://tpgit.github.com/Leptonica/imageio_8h_source.html#l00034

Just do:

  tesseract -v

and a message will be printed out not only showing the tesseract
version but also the versions of the image libraries leptonica was
linked with.

But to answer the specific question about PDF input support, no,
leptonica does not support reading PDFs directly.

Side note: Zdenko, at some point, to make support easier, the -v
option might also print out what tesseract thinks is the path to the
TESSDATA directory. A lot of initial problems people have seem to be
related to misunderstanding this important setting.

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to