On Wednesday, April 20, 2016 at 2:42:58 AM UTC-4, Jonas Pfannschmidt wrote:
>
> Hi,
>
> I'm trying to automate UI tests using OCR. The goal is to have a test 
> script with lines like: "click_text('Reports')" and it automatically clicks 
> on the button "Report".
>
> It works quite well ... sometimes. I've attached two sample screen 
> captures. The text on 'works.png' gets recognized reasonably well, 
> 'fails.png' returns only garbage. Both images have been created 
> programmatically in the same way (capture screen, resize by factor 4, 
> convert to greyscale). Does anybody know why one works and the other 
> doesn't?
>

Well, for one thing, the image that works has a lot more dark text on it, 
whereas the one that doesn't not only has less text, but some of the text 
that it has is greyed out.

At the end of the day Tesseract is going to be working on a bitonal image, 
since you've got a non-traditional application, I'd think you'd want to 
control as much of the image preprocessing as possible to make sure it's 
getting done in a way that's appropriate for your application, so rather 
than converting to greyscale, you should threshold and convert all the way 
down to bitonal.

Tom 

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/d7d41244-348d-4a1f-aaa7-0432e30e2d5b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to