Il giorno venerdì 24 luglio 2015 09:48:53 UTC+2, Simon Eigeldinger ha 
scritto:
>
> hi, 
>
> sorry missed the point. 
> just reproduced it: 
>
> $ tesseract testing\eurotext.tif testing\eurotext -l eng+deu pdf 
>
> Tesseract Open Source OCR Engine v3.05.00dev with Leptonica 
> Page 1 
> Error in fopenWriteStream: stream not opened 
> Error in pixWrite: stream not opened 
> Error in fopenReadStream: file not found 
> Error in extractG4DataFromFile: stream not opened to file 
> Error in l_generateG4Data: datacomp not extracted 
> Error in pixGenerateCIData: g4 data not made 
> Error in l_generateCIDataForPdf: file testing\eurotext.tif format is 4; 
> unreadab 
> le 
> Error during processing. 
>
>
>
> the pdf comes out but you can't open it. 
> adobe reader shows anerror that it is corrupted. 
> i did another test without pdf. 
>
> $ tesseract testing\eurotext.tif testing\eurotext -l eng+deu 
>
> Tesseract Open Source OCR Engine v3.05.00dev with Leptonica 
> Page 1 
> Warning in pixReadMemTiff: tiff page 1 not found 
>
> It creates a text which seem to contain everything but shows the warning 
> message. 
>
> i recompiled a new version on my fake website so people can play with 
> the training tools as well. 
> so and now i am off for 2 weeks. 
> have a nice time while i am not around. 
>
> greetings, 
> simon 
>
>
>
on cygwin with the just built 3.04.00 package

$ tesseract -l eng+deu eurotext.tif eurotext  pdf
Tesseract Open Source OCR Engine v3.04.00 with Leptonica
Page 1
Warning in pixReadMemTiff: tiff page 1 not found

the pdf is fine (just looks as bad as the original eurotext.tif picture)

Regards
Marco
(tesseract+leptonica cygwin package maintainer)
 

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/ad237b30-dbb3-4032-806e-3f6e793f7eed%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to