I'm interested in implementing a word white (or black) list described by a regular expression. In my application I generally only need to detect single words with a predefined structure. Character whitelists definitely help, but I can't help but wonder if a word whitelist would be even better. I've searched around the forums in detail, and also looked at the code to some degree, but haven't been able to find an existing solution. Am I blind? Does anyone with more experience know if A) this would be a difficult thing to implement, and B) where to start looking. For instance, where are reject or accept decisions made on a word-level?
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

