[Bug-ocrad] Re: Feature request: numeric charset

Antonio Diaz Diaz Wed, 08 Jun 2005 06:54:52 -0700

Hello manfred.

Yes, ocrad will have some day options like "--charset=numeric", or, fortexts without numbers, "--charset=alphabetic". Also an user-definedcharset will probably be implemented.

Of course, it will be implemented sooner if someone offers to sponsorit. ;-)



Regards,
Antonio.


Manfred Schwarb wrote:

trying to recognize numbers in tables, I stumbled across
the usual OCR hassle:

Zero is recognized as "O" or "o", One is recognized aslowercase "L" or uppercase "i".

I think ocrad is doing it's best, and the results are great.
Nevertheless there are such mis-recognitions, inevitable, I think.

This could be avoided it there is a "--charset=numbers" or similar,
which restricts the charset to [0123456789], and perhaps [+-].

Alternatively, one could even think of an option
  --charset="0123456789", i.e. a list of characters out of the
ascii character set.

What do you think?



_______________________________________________
Bug-ocrad mailing list
[email protected]
http://lists.gnu.org/mailman/listinfo/bug-ocrad

[Bug-ocrad] Re: Feature request: numeric charset

Reply via email to