I get the following error on a TIFF created from a PDF by ImageMagick: tesseract file.tiff ocred -l eng Tesseract Open Source OCR Engine with Leptonica TIFFstream: Sorry, can not handle image. Error in pixReadFromTiffStream: failed to read tiffdata Error in pixReadStreamTiff: pix not read Error in pixReadTiff: pix not read
TIFF created from a PDF with ImageMagick: convert file.pdf -depth 4 file.tiff TIFF information (truncated): tiffinfo file.tiff TIFF Directory at offset 0xb792a (751914) Subfile Type: multi-page document (2 = 0x2) Image Width: 595 Image Length: 842 Resolution: 72, 72 (unitless) Bits/Sample: 4 Compression Scheme: None Photometric Interpretation: RGB color FillOrder: msb-to-lsb Orientation: row 0 top, col 0 lhs Samples/Pixel: 3 Rows/Strip: 9 Planar Configuration: single image plane Page Number: 0-35 DocumentName: scanned.tiff Software: ImageMagick 6.6.2-6 2010-12-02 Q16 http://www.imagemagick.org TIFF Directory at offset 0x16f688 (1504904) Subfile Type: multi-page document (2 = 0x2) Image Width: 595 Image Length: 842 [...] Tesseract 3.0 compiled from source (leptonica and libtiff installed) on Ubuntu 2010.04. Is there a problem with my TIFF? Could someone point me to a TIFF file that is know to work with Tesseract? Thanks a lot! Nicolas Raoul ECM consultant in Tokyo -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

