Hi Romeo,

I can only answer a few of your questions, but I'll do that.

> 2) How do I preprocess the image? I was a bit surprised to find out that
> converting the image to grayscale before passing it to Tesseract results in
> different (and generally better) accuracy. Why? Also inverting the image works
> better for some text. What are the set of important transformations to play
> with?

Tesseract internally converts down to greyscale. I'd guess you saw
better results when you did that before passing to Tesseract as the
light/dark levels were better.

More generally, try to preprocess as much as possible before passing
to Tesseract. It sounds like you'd like to use Tesseract to
determine where text is in a busy image, which is going to be
tricky. You may well be able to do something like this using either
the API or hOCR output + script, by only accepting letters which
have high confidence (and hence ignoring speculative and incorrect
things based on noise elsewhere in the image).

It sounds like you're trying to move in that direction, which is
interesting, and I look forward to hearing more of how you get on.

Apologies I couldn't answer more of your questions. Hopefully
someone else on the list will be able to.

Nick

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to