Word White and Blacklisting with RegEx

Martin Roth Mon, 21 May 2012 09:08:46 -0700

I'm interested in implementing a word white (or black) list described by a 
regular expression. In my application I generally only need to detect 
single words with a predefined structure. Character whitelists definitely 
help, but I can't help but wonder if a word whitelist would be even better. 
I've searched around the forums in detail, and also looked at the code to 
some degree, but haven't been able to find an existing solution. Am I 
blind? Does anyone with more experience know if A) this would be a 
difficult thing to implement, and B) where to start looking. For instance, 
where are reject or accept decisions made on a word-level?


-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Word White and Blacklisting with RegEx

Reply via email to