see: http://code.google.com/p/tesseract-ocr/issues/detail?id=340 http://code.google.com/p/tesseract-ocr/issues/detail?id=391 http://code.google.com/p/tesseract-ocr/issues/detail?id=443
Zdenko On Fri, Feb 25, 2011 at 9:15 AM, Nicolas Raoul <[email protected]>wrote: > I get the following error on a TIFF created from a PDF by ImageMagick: > > tesseract file.tiff ocred -l eng > Tesseract Open Source OCR Engine with Leptonica > TIFFstream: Sorry, can not handle image. > Error in pixReadFromTiffStream: failed to read tiffdata > Error in pixReadStreamTiff: pix not read > Error in pixReadTiff: pix not read > > TIFF created from a PDF with ImageMagick: > convert file.pdf -depth 4 file.tiff > > TIFF information (truncated): > tiffinfo file.tiff > TIFF Directory at offset 0xb792a (751914) > Subfile Type: multi-page document (2 = 0x2) > Image Width: 595 Image Length: 842 > Resolution: 72, 72 (unitless) > Bits/Sample: 4 > Compression Scheme: None > Photometric Interpretation: RGB color > FillOrder: msb-to-lsb > Orientation: row 0 top, col 0 lhs > Samples/Pixel: 3 > Rows/Strip: 9 > Planar Configuration: single image plane > Page Number: 0-35 > DocumentName: scanned.tiff > Software: ImageMagick 6.6.2-6 2010-12-02 Q16 http://www.imagemagick.org > TIFF Directory at offset 0x16f688 (1504904) > Subfile Type: multi-page document (2 = 0x2) > Image Width: 595 Image Length: 842 > [...] > > Tesseract 3.0 compiled from source (leptonica and libtiff installed) > on Ubuntu 2010.04. > > Is there a problem with my TIFF? > Could someone point me to a TIFF file that is know to work with > Tesseract? > Thanks a lot! > > Nicolas Raoul > ECM consultant in Tokyo > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

