I have gathered together a list of frequent words and a larger word list and I'm ready to start creating my dictionary to see if it helps my OCR results. However, when I run the wordlist2dawg command I get this:
$>wordlist2dawg frequent_words_list emop.freq-dawg emop.unicharset Loading unicharset from 'emop.unicharset' Reading word list from 'frequent_words_list' Reducing Trie to SquishedDawg Dawg is empty, skip producing the output file The end result is that nothing has happened. I get no dawg files produced. The same thing happens when I run this command on the word_list file I have as well. Without any output or error messages and with no real documentation, I can't figure out what the problem is. Has anyone else seen this and figured out how to fix it? I have been able to create dictionaries in the past with word lists that I got from other Tesseract languages, so I think my install/implementation is good. I'm running Tesseract locally on a Mac with OS 10.8.3. Thanks -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

