I think that is another question. I want to filter recognized characters by confidence in this question but don't know how to get the confidence with *Tesseract*.
在 2017年10月20日星期五 UTC+8下午4:13:35,shree写道: > > Your image is 96 dpi. Increase the dpi to 300 and try. > > Preprocess the image to remove the boxes around letters, if possible. > > See https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality > > ShreeDevi > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > On Fri, Oct 20, 2017 at 1:24 PM, 朱裕清 <[email protected] <javascript:>> > wrote: > >> This is my target image: >> >>  >> >> Actually my question is similar to [this post]( >> https://stackoverflow.com/questions/4944830/how-to-make-tesseract-to-recognize-only-numbers-when-they-are-mixed-with-letter). >> >> But I don't know why the following answer will lead to another direction. I >> mean, I just hope to get those digits with high degree of confidence. Such >> as I can do this with another language >> >>  >> >> Then I can just keep those degree of confidence with a threshold `0.9`. >> But now I hope to use *Tesseract* to do this. >> >> First, I train a *number.traineddata* just for recognizing number. You >> can get it [here](https://1drv.ms/u/s!Aumb0ijJibxOi1KVXFjwDzOVRQrm). >> >> >> tesseract.exe target.jpg stdout -l number --oem 0 -psm 6 >> >> >>  >> >> Note I will get all digits which include high confidence and low >> confidence. Can we recognize the number and get the degree of its >> confidence? I cannot find any information to implement it. If *Tesseract* >> cannot do it. Any other method based on **C++** can implement my target? >> Could anyone can give me some information for it? >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> To post to this group, send email to [email protected] >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/273d9f86-39ce-42fe-8934-781f2103e4fa%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/273d9f86-39ce-42fe-8934-781f2103e4fa%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/790f9169-e724-49b2-b24a-320a10fea6f4%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

