Sorry for late response. And thank you very much. You solution help me a lot.
在 2017年10月21日星期六 UTC+8上午11:57:48,Art Rhyno写道: > > Maybe look at the API [1]. The output of the attached program shows > there’s a lot of detail that can be gleaned at this level, including the > confidence of the selected character and that of the other candidates. > Compiling against tesseract on Ubuntu, at least, is fairly straightforward. > I don’t know about windows or os/x. > > > > art > > --- > > 1. https://github.com/tesseract-ocr/tesseract/wiki/APIExample > > > > *From:* [email protected] <javascript:> [mailto: > [email protected] <javascript:>] *On Behalf Of *??? > *Sent:* Friday, October 20, 2017 4:18 AM > *To:* tesseract-ocr <[email protected] <javascript:>> > *Subject:* Re: [tesseract-ocr] How to get digital and the confidence? > > > > I think that is another question. I want to filter recognized characters > by confidence in this question but don't know how to get the confidence > with *Tesseract*. > > 在 2017年10月20日星期五 UTC+8下午4:13:35,shree写道: > > Your image is 96 dpi. Increase the dpi to 300 and try. > > > > Preprocess the image to remove the boxes around letters, if possible. > > > > See https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality > > > ShreeDevi > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > > > On Fri, Oct 20, 2017 at 1:24 PM, 朱裕清 <[email protected]> wrote: > > This is my target image: > >  > > Actually my question is similar to [this post]( > https://stackoverflow.com/questions/4944830/how-to-make-tesseract-to-recognize-only-numbers-when-they-are-mixed-with-letter). > > But I don't know why the following answer will lead to another direction. I > mean, I just hope to get those digits with high degree of confidence. Such > as I can do this with another language > >  > > Then I can just keep those degree of confidence with a threshold `0.9`. > But now I hope to use *Tesseract* to do this. > > First, I train a *number.traineddata* just for recognizing number. You can > get it [here](https://1drv.ms/u/s!Aumb0ijJibxOi1KVXFjwDzOVRQrm). > > > > tesseract.exe target.jpg stdout -l number --oem 0 -psm 6 > > > >  > > Note I will get all digits which include high confidence and low > confidence. Can we recognize the number and get the degree of its > confidence? I cannot find any information to implement it. If *Tesseract* > cannot do it. Any other method based on **C++** can implement my target? > Could anyone can give me some information for it? > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/273d9f86-39ce-42fe-8934-781f2103e4fa%40googlegroups.com > > <https://groups.google.com/d/msgid/tesseract-ocr/273d9f86-39ce-42fe-8934-781f2103e4fa%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > > > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected] <javascript:>. > To post to this group, send email to [email protected] > <javascript:>. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/790f9169-e724-49b2-b24a-320a10fea6f4%40googlegroups.com > > <https://groups.google.com/d/msgid/tesseract-ocr/790f9169-e724-49b2-b24a-320a10fea6f4%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/314f1361-038d-4b7a-b5ba-15b7ade5eddb%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

