It's similiar with my problem. It well recognized for special characters (new data trained) but wrongly recognize for normal characters and word.
Vào 11:29 T.7, 27 Th10 2018 Sreehari B S <[email protected]> đã viết: > Hi, > > Something similar happened when finetuned for :. When doing ice, it > recognized some : as 1. So I fine-tuned the same. > > Now when I ocr : , it works well. When I ice some real data it's now > worser than the previous one. > > * I trained on best eng.traineddata > * I created boxes using tesseract make box command and this was edited > using jTessBoxEditor. But the box dimensions were not so perfect. > Note : I trained from a real image. (Do I really need to edit the > coordinates by hand to adjust the dimensions ?) > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/f89a3852-3d89-477f-ad58-6cf2cea12aab%40googlegroups.com > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAH1O8a9-M4dMtZj0k6CgHnQU_bO88mmLWqZUCFm5iDGRjK1_gw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

