On Thursday, 24 April 2014 16:39:30 UTC+2, [email protected] wrote: > > Hi, > > I have installed the tesseract-ocr (v3.03) package within Ubuntu 12.04.3 > LTS and am experiencing difficulty getting it to run. > > How did you installed it?
> I first located the tessdata directory: > > $ sudo find / -name tessdata > /usr/share/tesseract-ocr/tessdata > > Then set the environment variable TESSDATA_PREFIX to tessdata's parent > directory: > > $ export TESSDATA_PREFIX=/usr/share/tesseract-ocr/ > > And then checked to make sure this was set correctly: > > $ echo $TESSDATA_PREFIX > /usr/share/tesseract-ocr/ > > When trying to run tesseract against a test image I receive the following > error: > $ tesseract test.png out > Error opening data file tesseract/tessdata/eng.traineddata > Please make sure the TESSDATA_PREFIX environment variable is set to the > parent directory of your "tessdata" directory. > Failed loading language 'eng' > Tesseract couldn't load any languages! > Tesseract Open Source OCR Engine v3.03 with Leptonica > AdaptedTemplates != NULL:Error:Assert failed:in file adaptmatch.cpp, line > 174 > Segmentation fault (core dumped) > > > When I run 'tesseract eurotext.tif - -l xz' (requesting not existing language file) I got different output and there is not segfault. The same happens if I set TESSDATA_PREFIX to wrong directory (e.g. /tmp) and I run 'tesseract eurotext.tif -'. So it looks like you have strange installation.... > The first line of the error suggests that tesseract is not taking the > value from TESSDATA_PREFIX and is instead trying to locate the language > files in 'tesseract/tessdata'. > > Can anyone advise where I am going wrong? > > Thanks, > Chris > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ae626912-25ef-411c-8e97-df2133ce08ce%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

