On Wed, Aug 22, 2012 at 7:53 PM, Nick White <[email protected]> wrote: > On Wed, Aug 22, 2012 at 09:43:10AM -0700, Jani Monoses wrote: >> If I only do this I get: >> >> Re-initializing document dictionary... >> Error: word 'aerobuz/P' not in DAWG after adding it >> Error: failed to load /usr/share/tesseract-ocr/tessdata/ron.user-words >> >> So I need to do the words2dawg and recombination commands sequence as you >> suggested in your initial reply? > > I believe the .user-words file should just be a plain UTF-8 text > file, with one word per line. Is that what you're using?
Yes, the dictionary file provided by the hunspell-ro package in Ubuntu. http://paste.ubuntu.com/1161140/ It is UTF-8 from what I can tell. > The wordlist2dawg command is only used for the main dictionaries; > user-words is different. > > Nick -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

