On Wed, Nov 23, 2011 at 11:14 AM, Örs <[email protected]> wrote: > Hi there, > > I would like to run tesseract on Ubuntu 11.10 under gImageReader. I > have downloaded hun.traineddata.gz
hun.traineddata.gz is tesseract 3.0x language data file. > file and extracted it into the /usr/ > share/tesseract-ocr/tessdata The file shows as a PCX image. When I > run gImageReader I get the error: Unable to load unicharset file /usr/ > share/tesseract-ocr/tessdata/hun.unicharset This indicate that your tessseract executable is 2.0x version. It is not able to use 3.0x language file. There is no official hungarian support for 2.0x version. Solution: please upgrade your tesseract to current version (maybe you will need to compile it from source). > But how do I get this > file? > > Any help would be highly appreciated. > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

