I was able to build from source using the following Dockerfile: https://github.com/tleyden/docker/blob/master/tesseract-training/Dockerfile
(based on instructions https://code.google.com/p/tesseract-ocr/wiki/Compiling) In that docker container, all the tools such as combine_tessdata, wordlist2dawg, dawg2wordlist, etc seem to be working. On Saturday, July 19, 2014 9:25:12 PM UTC-7, Traun Leyden wrote: > > > According to > http://manpages.ubuntu.com/manpages/trusty/man1/combine_tessdata.1.html, > combine_tessdata should be installed on Ubuntu 14.04, but I'm not seeing it > anywhere. > > # dpkg -L tesseract-ocr > /. > /usr > /usr/share > /usr/share/man > /usr/share/man/man1 > /usr/share/man/man1/cntraining.1.gz > /usr/share/man/man1/tesseract.1.gz > /usr/share/man/man1/shapeclustering.1.gz > /usr/share/man/man1/combine_tessdata.1.gz > /usr/share/man/man1/unicharset_extractor.1.gz > /usr/share/man/man1/ambiguous_words.1.gz > /usr/share/man/man1/wordlist2dawg.1.gz > /usr/share/man/man1/mftraining.1.gz > /usr/share/man/man1/dawg2wordlist.1.gz > /usr/share/doc > /usr/share/doc/tesseract-ocr > /usr/share/doc/tesseract-ocr/copyright > /usr/bin > /usr/bin/tesseract > /usr/share/doc/tesseract-ocr/changelog.Debian.gz > > It seems like the ubuntu package only installs the /usr/bin/tesseract > binary and the man page for combine_tessdata.1.gz > > To install combine_tessdata, will I need to build from source? > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/e5d8f993-96fc-4632-883a-6d40c7f0b307%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

