Try to train with a large number of fonts and see if that improves the result.
On Tue 3 Apr, 2018, 2:29 PM Apoorv Khanna, <apoorvkhann...@gmail.com> wrote: > Hi all, > > I am able to extract few check boxes after fine tuning the English model > but tesseract is not able to extract all the check boxes . > > Thanks in advance > > version Used : *tesseract 4 beta* > Font used for training : *Dejavu Sans* > No of symbols inserted in training text is 14 each > > *Extracted text:* > ☐not reported wnot reported zpnot reported > cno Byes tno ☒yes ☐no ☑pyes > not reported not reported ☐not reported > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To post to this group, send email to tesseract-ocr@googlegroups.com. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/78dcd45b-eb3a-441c-8800-f056285998f4%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/78dcd45b-eb3a-441c-8800-f056285998f4%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWXFmb0OdsZV1a-dwp19kyoHDO-MsCGa4NW-OuzmzC3sg%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.