On Wed, Aug 22, 2012 at 7:53 PM, Nick White <[email protected]> wrote:
> On Wed, Aug 22, 2012 at 09:43:10AM -0700, Jani Monoses wrote:
>> If I only do this I get:
>>
>> Re-initializing document dictionary...
>> Error: word 'aerobuz/P' not in DAWG after adding it
>> Error: failed to load /usr/share/tesseract-ocr/tessdata/ron.user-words
>>
>> So I need to do the words2dawg and recombination commands sequence as you
>> suggested in your initial reply?
>
> I believe the .user-words file should just be a plain UTF-8 text
> file, with one word per line. Is that what you're using?

Yes, the dictionary file provided by the hunspell-ro package in Ubuntu.

http://paste.ubuntu.com/1161140/

It is UTF-8 from what I can tell.

> The wordlist2dawg command is only used for the main dictionaries;
> user-words is different.
>
> Nick

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to