On Saturday, March 5, 2016 at 5:11:55 AM UTC-5, Gunasekaran Velu wrote:
>
>
> >tesseract.exe Underline.png Underline -l eng -psm 1
>
> Result: This is underline word @
>
> Does it possible to do OCR recognition for underlined text/word on the 
> image? or some image processing need to apply on the image?
>
> Attached sample image.
>

Tesseract knows how to recognize underlined text, as you can see from that 
fact that it got "underline" correct in your example. For some reason it's 
getting confused by the underlined word "test", perhaps because it's at the 
end of the line?

It could potentially represent a bug, but I'd try to recreate it with a 
less artificial example. Of course, pre-processing would improve the 
situation and removing underlines should be that hard to do.

Tom 

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/50710944-745b-41c8-98ef-c4f365d904a6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to