Should I add boxes with spaces before punctuation marks? Also I've found this discussion: https://github.com/tesseract-ocr/tesseract/issues/841
It helped me a lot, but I still got questions. What should I put in rus.training_text, if I want to generate .lstmf files from my own box/tiff pairs? Texts from images? суббота, 6 мая 2017 г., 17:13:29 UTC+5 пользователь shree написал: > > When using pre-existing box tiff pairs, you have to add a box with tab > character to mark end of line and also add boxes with spaces after every > word. > > You then need to generate the .lstmf files - please > see training/tesstrain.sh for details. > > ShreeDevi > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > On Sat, May 6, 2017 at 4:40 PM, bmwmine <[email protected] <javascript:>> > wrote: > >> you are missing the .lstmf files >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> To post to this group, send email to [email protected] >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/47875785-3322-4d5d-89fd-1818c2c06bc2%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/47875785-3322-4d5d-89fd-1818c2c06bc2%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> >> For more options, visit https://groups.google.com/d/optout. >> > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/35ecde41-d654-408d-bd98-7de37fc6684a%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

