Have you tried just using the eng.traineddata directly with tess 3.04/ 3.05 / 4.0?
You don't need to train unless it is a very special case. You can try changing the dictionary dawg files with tess 3.0x. ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Wed, Apr 5, 2017 at 11:25 AM, <srns...@gmail.com> wrote: > I am trying to correct box files, so i can train tesseract. > > But I have got strange problem, > > > 1) Tesseract is recognizing some alphabet as two letters, then how to edit > the box file then.. (screenshot 1). > 2) Tesseract is not recognizing some alphabets so how to edit the box file > then.. (screenshot 2). > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To post to this group, send email to tesseract-ocr@googlegroups.com. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/8acd28ca-fa7f-4be6-a293-ec3008ffd288% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/8acd28ca-fa7f-4be6-a293-ec3008ffd288%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduX5RSr0myJhivnXc50KzU0H5KN2Mghv6k6COkcp8%2BBELQ%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.