tesseract library/engine[1] is separated from language trained data[2]. Main reason for this split is size of trained data and users need only few of them. Trained data should be placed to the same tessdata directory where tesseract looks for config files (well config files are not needed if user use API of popper command line options)
[1] https://github.com/tesseract-ocr/tesseract [2] https://github.com/tesseract-ocr/tessdata Zdenko On Tue, Aug 16, 2016 at 9:06 PM, John Muccigrosso <[email protected]> wrote: > Do I understand correctly that the files that are installed into the > tessdata dir are broken into two groups on Github, the first being the > traineddata files in the tesseract-ocr/tessdata repository and the second > everything else at tesseract-ocr/tesseract/tessdata? > > I'd discovered this because I was using the —tessdata option to point to > my local mirror of the traineddata from Github and of course ran into > problems without the pdf config file. > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/89ccb328-6237-4e35-931d-d36834048ab8% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/89ccb328-6237-4e35-931d-d36834048ab8%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8wzPgb49rWBa%2BVMiD%3DBY3E6Ljv1Tgc3hvRHRF6m0qXsAg%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

