Re: Limited Output

paulfeakins Thu, 14 May 2009 02:48:35 -0700

Hi Pierpaolo,

I've certainly been able to recognise only digits, see the FAQ
heading, "How do I recognize only digits?":
http://code.google.com/p/tesseract-ocr/wiki/FAQ


That works fine for digits, but I have had problems trying to limit
tesseract to a certain set of characters, it seems that the whitelist
variable isn't read properly.

For example, say I create that text file mentioned in the FAQ and
change the content to:
tessedit_char_whitelist D123456789

It will still recognise a zero, even though it should use a D instead.
So I would guess it's hardcoded somewhere in the source.

Hope that helps,
Paul.

On May 14, 8:36 am, Pierpaolo Monaco <[email protected]>
wrote:
> I need to process some alphanumeric string.
> The main problem is that I have just letter and digit in my imput string but
> the result of the ocr process is sometimes a point o a bracket, etc...
> Symbols that never will be in my imput domain.
> How can I limit the output?
> If I want, for example (for semplicity), just digits for result what I have
> to do?
> Hope somebody can help me.
>
> Thanks.
>
> --
> -----------------------------
> Pierpaolo Monaco
> ----------------------------
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Re: Limited Output

Reply via email to