I get the following error on a TIFF created from a PDF by ImageMagick:

tesseract file.tiff ocred -l eng
Tesseract Open Source OCR Engine with Leptonica
TIFFstream: Sorry, can not handle image.
Error in pixReadFromTiffStream: failed to read tiffdata
Error in pixReadStreamTiff: pix not read
Error in pixReadTiff: pix not read

TIFF created from a PDF with ImageMagick:
convert file.pdf -depth 4 file.tiff

TIFF information (truncated):
tiffinfo file.tiff
TIFF Directory at offset 0xb792a (751914)
  Subfile Type: multi-page document (2 = 0x2)
  Image Width: 595 Image Length: 842
  Resolution: 72, 72 (unitless)
  Bits/Sample: 4
  Compression Scheme: None
  Photometric Interpretation: RGB color
  FillOrder: msb-to-lsb
  Orientation: row 0 top, col 0 lhs
  Samples/Pixel: 3
  Rows/Strip: 9
  Planar Configuration: single image plane
  Page Number: 0-35
  DocumentName: scanned.tiff
  Software: ImageMagick 6.6.2-6 2010-12-02 Q16 http://www.imagemagick.org
TIFF Directory at offset 0x16f688 (1504904)
  Subfile Type: multi-page document (2 = 0x2)
  Image Width: 595 Image Length: 842
[...]

Tesseract 3.0 compiled from source (leptonica and libtiff installed)
on Ubuntu 2010.04.

Is there a problem with my TIFF?
Could someone point me to a TIFF file that is know to work with
Tesseract?
Thanks a lot!

Nicolas Raoul
ECM consultant in Tokyo

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to