Il giorno venerdì 24 luglio 2015 09:48:53 UTC+2, Simon Eigeldinger ha scritto: > > hi, > > sorry missed the point. > just reproduced it: > > $ tesseract testing\eurotext.tif testing\eurotext -l eng+deu pdf > > Tesseract Open Source OCR Engine v3.05.00dev with Leptonica > Page 1 > Error in fopenWriteStream: stream not opened > Error in pixWrite: stream not opened > Error in fopenReadStream: file not found > Error in extractG4DataFromFile: stream not opened to file > Error in l_generateG4Data: datacomp not extracted > Error in pixGenerateCIData: g4 data not made > Error in l_generateCIDataForPdf: file testing\eurotext.tif format is 4; > unreadab > le > Error during processing. > > > > the pdf comes out but you can't open it. > adobe reader shows anerror that it is corrupted. > i did another test without pdf. > > $ tesseract testing\eurotext.tif testing\eurotext -l eng+deu > > Tesseract Open Source OCR Engine v3.05.00dev with Leptonica > Page 1 > Warning in pixReadMemTiff: tiff page 1 not found > > It creates a text which seem to contain everything but shows the warning > message. > > i recompiled a new version on my fake website so people can play with > the training tools as well. > so and now i am off for 2 weeks. > have a nice time while i am not around. > > greetings, > simon > > > on cygwin with the just built 3.04.00 package
$ tesseract -l eng+deu eurotext.tif eurotext pdf Tesseract Open Source OCR Engine v3.04.00 with Leptonica Page 1 Warning in pixReadMemTiff: tiff page 1 not found the pdf is fine (just looks as bad as the original eurotext.tif picture) Regards Marco (tesseract+leptonica cygwin package maintainer) -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ad237b30-dbb3-4032-806e-3f6e793f7eed%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

