Sorry. Just figured it out. 1. should have downloaded "tesseract-2.00.<lang>.tar.gz " language pack instead of boxtiff-2.01.<lang>.tar.gz 2. since I was installing it under linux, i needed to put the the language pack first in tesserac package prior to compiling
Source: http://code.google.com/p/tesseract-ocr/wiki/ReadMe On 19 Mai, 13:29, denis56 <[email protected]> wrote: > His, > > just downloaded tesseract-2.01 and cannot get rid of this error > "Unable to load unicharset file /home/tesseract/share/tessdata/ > eng.unicharset" > > I have downloaded english language pack > fromhttp://code.google.com/p/tesseract-ocr/downloads/detail?name=boxtiff-... > and tried copying both "eng" folder and its contents to /home/ > tesseract/share/tessdata/ > but the error does not go away. language pack does not contain file > "eng.unicharset" and the one that was in tesseract distribution was > empty. > > Could anyone please suggest how to resolve the issue, > thanks. > > ps: read README, but have not found clear instructions. --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -~----------~----~----~----~------~----~------~--~---

