OCR on a small set of char but with other letters I do not care

Richard Wang Sun, 16 Feb 2014 23:18:13 -0800

hello, 

I am trying to perform OCR on merchandise label with Chinese chars.
Here I have uploaded an example image. The character string I really
care about is "MX1251C".


Most of such string is composed by digit number and A-Z characters,
so I can configure the range of characters. Also, they are not English
words, so I can disable use of dictionary. 

As you can see, most of such image will contain a lot of Chinese characters,
but I do not care about them. 

My question is that if the existence of these Chinese characters make 
my problem more difficult, compared to the case if they are English letters.

If I want to speed up the recognition process and accuracy, how can I take
advantage of the special properties of my problem here?

thanks.
Richard

<https://lh4.googleusercontent.com/-_hlnACBMN5o/UwGViQ65HZI/AAAAAAAAAEs/D6Y9i60SrSE/s1600/IMG_20140215_152033.jpg>
I only need to recognize and extract the style number. 

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

OCR on a small set of char but with other letters I do not care

Reply via email to