OK. I got it. My text editor (Text Wrangler) allowed me to change my newline format from "Mac (CR)" to "Unix (LF)". I had read in another post that it didn't matter, but I guess it does. Thanks
On Wednesday, May 1, 2013 4:55:50 PM UTC-5, matthew christy wrote: > > I have gathered together a list of frequent words and a larger word list > and I'm ready to start creating my dictionary to see if it helps my OCR > results. However, when I run the wordlist2dawg command I get this: > > $>wordlist2dawg frequent_words_list emop.freq-dawg emop.unicharset > Loading unicharset from 'emop.unicharset' > Reading word list from 'frequent_words_list' > Reducing Trie to SquishedDawg > Dawg is empty, skip producing the output file > > The end result is that nothing has happened. I get no dawg files produced. > The same thing happens when I run this command on the word_list file I have > as well. Without any output or error messages and with no real > documentation, I can't figure out what the problem is. > > Has anyone else seen this and figured out how to fix it? > > I have been able to create dictionaries in the past with word lists that I > got from other Tesseract languages, so I think my install/implementation is > good. I'm running Tesseract locally on a Mac with OS 10.8.3. > > Thanks > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

