Hi Nick,

I removed the bogus words from that file (it is a list of words+ some
suffix metadata for the hunspell dictionary engine I guess), but I
still get errors. So it is not the '/' character.

$tesseract -l ron 005.tiff output uwo
Re-initializing document dictionary...
Error: word 'altimetrie' not in DAWG after adding it
Error: failed to load /usr/share/tesseract-ocr/tessdata/ron.user-words

$cat uwo
user_words_suffix    user-words

On Thu, Sep 6, 2012 at 3:26 PM, Nick White <[email protected]> wrote:
> On Wed, Aug 22, 2012 at 08:04:53PM +0300, Jani Monoses wrote:
>> On Wed, Aug 22, 2012 at 7:53 PM, Nick White <[email protected]> wrote:
>> > On Wed, Aug 22, 2012 at 09:43:10AM -0700, Jani Monoses wrote:
>> >> If I only do this I get:
>> >>
>> >> Re-initializing document dictionary...
>> >> Error: word 'aerobuz/P' not in DAWG after adding it
>> >> Error: failed to load /usr/share/tesseract-ocr/tessdata/ron.user-words
>
> Hi Jani,
>
> Did you ever work out what was causing this problem and fix it?
>
> If not I'll take another look and see if I have more luck
> tracking it down.
>
> Nick

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to