On Wed, Jan 22, 2014 at 03:17:29PM +0000, Nick White wrote:
> Anyway, I agree, someone should document the user-patterns stuff.
> I'll try to do so if I get time, but if anyone wants to look sooner,
> or offer their own experiences with it, do go ahead!

I knew I'd seen *some* documentation for this feature. Naturally I
found it straight after sending the previous email. As mentioned in
the main tesseract manpage, dict/trie.h documents the format; see
http://code.google.com/p/tesseract-ocr/source/browse/trunk/dict/trie.h?r=999#188

If I'm reading it correctly, you should be able to specify your
rules with something like this:

www.\n\*.com
(\d\d\d) \d\d\d-\d\d\d\d
$\d\*.\d\d

Hope that helps!

Nick

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to