On Wed, Nov 23, 2011 at 11:14 AM, Örs <[email protected]> wrote:

> Hi there,
>
> I would like to run tesseract on Ubuntu 11.10 under gImageReader. I
> have downloaded hun.traineddata.gz

hun.traineddata.gz is tesseract 3.0x language data file.


> file and extracted it into the /usr/
> share/tesseract-ocr/tessdata  The file shows as a PCX image. When I
> run gImageReader I get the error: Unable to load unicharset file /usr/
> share/tesseract-ocr/tessdata/hun.unicharset


This indicate that your tessseract executable is 2.0x version. It is not
able to use 3.0x language file. There is no official hungarian support for
2.0x version.
Solution: please upgrade your tesseract to current version (maybe you will
need to compile it from source).


>  But how do I get this
> file?
>
> Any help would be highly appreciated.
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to