see:
http://code.google.com/p/tesseract-ocr/issues/detail?id=340
http://code.google.com/p/tesseract-ocr/issues/detail?id=391
http://code.google.com/p/tesseract-ocr/issues/detail?id=443

Zdenko

On Fri, Feb 25, 2011 at 9:15 AM, Nicolas Raoul <[email protected]>wrote:

> I get the following error on a TIFF created from a PDF by ImageMagick:
>
> tesseract file.tiff ocred -l eng
> Tesseract Open Source OCR Engine with Leptonica
> TIFFstream: Sorry, can not handle image.
> Error in pixReadFromTiffStream: failed to read tiffdata
> Error in pixReadStreamTiff: pix not read
> Error in pixReadTiff: pix not read
>
> TIFF created from a PDF with ImageMagick:
> convert file.pdf -depth 4 file.tiff
>
> TIFF information (truncated):
> tiffinfo file.tiff
> TIFF Directory at offset 0xb792a (751914)
>  Subfile Type: multi-page document (2 = 0x2)
>  Image Width: 595 Image Length: 842
>  Resolution: 72, 72 (unitless)
>  Bits/Sample: 4
>  Compression Scheme: None
>  Photometric Interpretation: RGB color
>  FillOrder: msb-to-lsb
>  Orientation: row 0 top, col 0 lhs
>  Samples/Pixel: 3
>  Rows/Strip: 9
>  Planar Configuration: single image plane
>  Page Number: 0-35
>  DocumentName: scanned.tiff
>  Software: ImageMagick 6.6.2-6 2010-12-02 Q16 http://www.imagemagick.org
> TIFF Directory at offset 0x16f688 (1504904)
>  Subfile Type: multi-page document (2 = 0x2)
>  Image Width: 595 Image Length: 842
> [...]
>
> Tesseract 3.0 compiled from source (leptonica and libtiff installed)
> on Ubuntu 2010.04.
>
> Is there a problem with my TIFF?
> Could someone point me to a TIFF file that is know to work with
> Tesseract?
> Thanks a lot!
>
> Nicolas Raoul
> ECM consultant in Tokyo
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to
> [email protected].
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en.
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to