But , As a first time , I am getting this error when I tried to check for languages using tesseract --list-langs
//Error opening data file /usr/local/share/tessdata/eng.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to the parent directory of your "tessdata" directory. Failed loading language 'eng' Tesseract couldn't load any languages! Could not initialize tesseract. // -Sibi On Wednesday, November 5, 2014 7:39:09 PM UTC+5:30, sibi kanagaraj wrote: > > Dear Shree and Quan , > > Thank you for the replies. I solved the issue using Shree's direction . > > Here is what I did : > > 1.Removed any previous installations using > sudo apt-get remove tesseract-ocr > sudo apt-get remove --auto-remove tesseract-ocr > > 2.Downloaded the > [tesseract_3.03.03.orig.tar.gz] from > http://packages.ubuntu.com/utopic/tesseract-ocr > > 3.Extracted it to /home/sibi/tesseract-3.03 > > 4.Folowed the normal procedure of > > ./autogen.sh > ./configure > make > sudo make install > sudo ldconfig > > 5.Then used > > make training > sudo make training-install > > 6.To confirm if the training module is installed did a check with the > commands > "unicharset_extractor" > "text2image" . > > Thank you once again Shree and Quan . > > -Sibi > > On Wednesday, November 5, 2014 7:18:03 PM UTC+5:30, Quan Nguyen wrote: >> >> I read Ubuntu 14.10 has Tesseract training executable. >> >> On Wednesday, November 5, 2014 7:41:12 AM UTC-6, shree wrote: >>> >>> Did you install the latest version from >>> http://packages.ubuntu.com/utopic/tesseract-ocr >>> >>> If so, it should have the trainingtools. >>> >>> Try >>> which text2image >>> to see if it installed >>> >>> >>> ShreeDevi >>> ____________________________________________________________ >>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>> >>> On Wed, Nov 5, 2014 at 7:03 PM, sibi kanagaraj <[email protected]> >>> wrote: >>> >>>> Dear all , >>>> >>>> I am using ubuntu 14.04 LTS on a 64 bit machine . >>>> >>>> I installed Tesseract using apt-get install tesseract-ocr >>>> >>>> The tesseract-v gives me the following >>>> >>>> tesseract 3.03 >>>> leptonica-1.71 >>>> libjpeg 8d : libpng 1.2.50 : libtiff 4.0.3 : zlib 1.2.8 >>>> >>>> Now I wanted to install the training module . >>>> >>>> Tried the following link >>>> https://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 >>>> >>>> which says >>>> >>>> "Beginning with 3.03, if you're compiling Tesseract from source you >>>> need to make and install the training tools with separate make commands. >>>> Once the above additional libraries have been installed, run the following >>>> from the tesseract source directory:" >>>> >>>> I have not installed it from source . I have installed form Repo . How >>>> and where am i to run the >>>> >>>> make training >>>> sudo make training-install >>>> >>>> Checking for any Tesseract related files , I only notice a >>>> /usr/share/tesseract-ocr/tessdata which is present . >>>> >>>> Thank you >>>> Sibi >>>> >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To post to this group, send email to [email protected]. >>>> Visit this group at http://groups.google.com/group/tesseract-ocr. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/9bca7a52-f411-4746-8281-c9f949596305%40googlegroups.com >>>> >>>> <https://groups.google.com/d/msgid/tesseract-ocr/9bca7a52-f411-4746-8281-c9f949596305%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> >>> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/abf26e03-a499-493f-b302-dbb2c606b2e6%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

