Your image is 96 dpi. Increase the dpi to 300 and try.

Preprocess the image to remove the boxes around letters, if possible.

See https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Fri, Oct 20, 2017 at 1:24 PM, 朱裕清 <[email protected]> wrote:

> This is my target image:
>
> ![target image](https://i.stack.imgur.com/UYMZJ.png)
>
> Actually my question is similar to [this post](https://stackoverflow.
> com/questions/4944830/how-to-make-tesseract-to-recognize-
> only-numbers-when-they-are-mixed-with-letter). But I don't know why the
> following answer will lead to another direction. I mean, I just hope to get
> those digits with high degree of confidence. Such as I can do this with
> another language
>
> ![](https://i.stack.imgur.com/rF1gP.png)
>
> Then I can just keep those degree of confidence with a threshold `0.9`.
> But now I hope to use *Tesseract* to do this.
>
> First, I train a *number.traineddata* just for recognizing number. You can
> get it [here](https://1drv.ms/u/s!Aumb0ijJibxOi1KVXFjwDzOVRQrm).
>
>
> tesseract.exe target.jpg stdout -l number --oem 0 -psm 6
>
>
> ![](https://i.stack.imgur.com/OzgBS.png)
>
> Note I will get all digits which include high confidence and low
> confidence. Can we recognize the number and get the degree of its
> confidence? I cannot find any information to implement it. If *Tesseract*
> cannot do it. Any other method based on **C++** can implement my target?
> Could anyone can give me some information for it?
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/273d9f86-39ce-42fe-8934-781f2103e4fa%
> 40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/273d9f86-39ce-42fe-8934-781f2103e4fa%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWsYBa10DrvxLJq3Rggtqs41h4rJ2A7uC9JXA5HGCmAgw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to