see https://github.com/tesseract-ocr/langdata/tree/master/eng
Zdenko ne 20. 6. 2021 o 7:33 Sim Tov <[email protected]> napĂsal(a): > > Hello, > > it is written in the documentation/Creating Starter Traineddata: > > > https://tesseract-ocr.github.io/tessdoc/tess4/TrainingTesseract-4.00.html#creating-starter-traineddata > > that an "optional word list files" can be supplied for the training > purpose. > > 1. what is the proper format for this file? > 2. is there an example of such a file online? > 3. can a standard MySpell/HunSpell/etc. dictionary be used for this > purpose? If yes - what formats are supported? > > Thank you in advance! > ST > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/ffc64b9c-9020-4398-9d17-c15f832d6b38n%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/ffc64b9c-9020-4398-9d17-c15f832d6b38n%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8y1XkeSz7NwyNpYtO8W%3D5QLny_za-9-w0pMi9poGAeE3A%40mail.gmail.com.

