I wonder why lang.user-words is more accurate than lang.freq-
dawg&lang.word-dawg though they were both made by almost same
word_list file.
-------------------------------------------------------------------------
case 1 :
word_list.txt -> change name -> lang.user-words
case 2 :
word_list.txt('word_list.txt', which was same file used in case1.) ->
change name -> frequent_words_list.txt
->wordlist2dawg frequent_words_list lang.freq-dawg lang.unicharset
and
wordlist2dawg words_list lang.word-dawg lang.unicharset ->(this
lang.word-dawg file is only add 2 words)
------------------------------------
I thought 'case 2' supposed to be better than 'case 1' or same as
'case 1' in its result. However, the result was worse about 'case2'
than about 'case1' and I want to know why that happened.
thank you for reading!!
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en