I am not sure how it works myself. I got the best results when I made a 
dawg list (See 
https://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 under 
Dictionary Data). Pretty much, if you have a list of words to use, Do 
wordlist2dawg 
frequent_words_list lang.freq-dawg lang.unicharset , and include this file 
when you do your combine_tessdata.

On Thursday, May 23, 2013 5:34:02 PM UTC+2, newbie wrote:
>
> Hello all,
>
> I have a concern about the user word list in Tesseract (eng.user-words). I 
> have no idea how to know if it is working or not. Could anyone advise me 
> how to test the user words is working properly? I would like to use this 
> user words, but don't see it works, then it confuses me. 
>
> One more concern,in Tesseract 3.02, it allows to set multiple language 
> when Tesseract init i.e. I can set eng+deu. But when I would like to init 
> Tesseract with eng.user-words, the Tesseract seems to require for deu also. 
> Am I correct? If so, how we can init just specified user-words list?
>
> Thanks.
>

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to