Thanks for all the clarification. Since I did not want to compile it from the source I rather added the repository 'ppa:jsevi83/personal' and so I successfully upgraded to the current version of tesseract and also installed the required language file.
On Nov 24, 8:21 am, zdenko podobny <[email protected]> wrote: > On Wed, Nov 23, 2011 at 11:14 AM, Örs <[email protected]> wrote: > > Hi there, > > > I would like to run tesseract on Ubuntu 11.10 under gImageReader. I > > have downloaded hun.traineddata.gz > > hun.traineddata.gz is tesseract 3.0x language data file. > > > file and extracted it into the /usr/ > > share/tesseract-ocr/tessdata The file shows as a PCX image. When I > > run gImageReader I get the error: Unable to load unicharset file /usr/ > > share/tesseract-ocr/tessdata/hun.unicharset > > This indicate that your tessseract executable is 2.0x version. It is not > able to use 3.0x language file. There is no official hungarian support for > 2.0x version. > Solution: please upgrade your tesseract to current version (maybe you will > need to compile it from source). > > > > > > > > > But how do I get this > > file? > > > Any help would be highly appreciated. > > > -- > > You received this message because you are subscribed to the Google > > Groups "tesseract-ocr" group. > > To post to this group, send email to [email protected] > > To unsubscribe from this group, send email to > > [email protected] > > For more options, visit this group at > >http://groups.google.com/group/tesseract-ocr?hl=en -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

