Thanks for reply. The question is that if I create a new language "lan", how shall I generate the files you mention? What files do I need to generate them from?
Thanks, Mikayel On Tuesday, May 10, 2016 at 5:39:37 PM UTC+4, marco atzeri wrote: > > On 10/05/2016 14:39, Mikael Egibyan wrote: > > Hi Marco, > > > > Can you please link a tutorial how to generate/create all the specific > > language files? > > > > Thanks! > > Mikayel > > Hi Mikayel, > > It is not clear your request. > > Are you asking about training file ? > On cygwin it works as on the other system > https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract > > Or how to add additional languages to cygwin ? > Anyone in particular ? > > The specific language packages are > just containing the same files from > > https://github.com/tesseract-ocr/tessdata > > $ tar -tf tesseract-ocr-ita-3.04-1.tar.xz > > usr/share/tessdata/ita.cube.bigrams > usr/share/tessdata/ita.cube.fold > usr/share/tessdata/ita.cube.lm > usr/share/tessdata/ita.cube.nn > usr/share/tessdata/ita.cube.params > usr/share/tessdata/ita.cube.size > usr/share/tessdata/ita.cube.word-freq > usr/share/tessdata/ita.tesseract_cube.nn > usr/share/tessdata/ita.traineddata > usr/share/tessdata/ita_old.traineddata > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/4b6e3554-b5a4-4cf7-9e97-d341cc4a929e%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

