Hello, As far as I see Tesseract doesn't support recognizing documents using several character sets (latin and cyrillic for example). Commercial OCR systems usually do. Are there technical limitations preventing training Tesseract for the accurate recognition of multi-language texts? If there aren't who (apart from me) is going to take on this task?
-- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.

