You probably got the source for a different version of Tesseract. This might not matter, depending on what you are doing. Find out the version by running it: you will see 'Tesseract Open Source OCR Engine v3.04.00 with Leptonica' or similar.
How to train: http://blog.cedric.ws/how-to-train-tesseract-301 Cedric's server is not responding so try: http://webcache.googleusercontent.com/search?q=cache:RDSrO-_YQLgJ:blog.cedric.ws/how-to-train-tesseract-301+&cd=1&hl=en&ct=clnk&gl=ca&client=ubuntu On Sunday, October 12, 2014 1:19:17 AM UTC-4, sibi kanagaraj wrote: > > Hello all , > > What is the difference between sudo apt-get install tesseract and > installing from source . > Let me explain the context so that it could give a clear picture . > > I wanted to work with Tesseract and build better Training Data for Tamil > language as well as see if any modifications could be done to engine also > to make it more responsive for Tamil . > 1.Hence installed Tesseract via sudo apt-get install tesseract . > 2.I could see that the files and folders relating to tessdata were created > in /usr/share/tesseract-ocr > 3.And installed all the necessary packages as mentioned here > https://code.google.com/p/tesseract-ocr/wiki/Compiling > 4.Now the command tesseract is working fine . > 5.I wanted to see the debugging of the system , hence posted a question > here > https://groups.google.com/forum/#!topic/tesseract-ocr/BxZvoUlxjOc > 6.I was asked to perform the make file operations in Java folder . > 7.Since within the /usr/share I could not find java folder , I created a > folder and tried to do the make .But it did not work out . > 8.As per zdenop reply in the above , I am trying to configure it from the > source and hence created a clone using > git clone https://code.google.com/p/tesseract-ocr/ > 9.Here I could find the /java folder . > > My questions are > 1.What is the difference between package installation and installing from > source ? > 2.Where can I find the files if I am doing a package installation ? > 3.Am I doing the entire process wrong ? > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/540bccc2-06f2-4fec-9689-3c6716b82ccc%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

