Ok, for anybody else that may get this error, I found a solution. I don't remember reading in any of the docs that you need to keep the language files in a directory named tessdata. But I noticed that all of the different data files I was downloading from all over the place had the files in a directory named tessdata, so I tried putting that file in the directory where I was doing my tests, and sure enough it now works. The misleading thing is that the error message in the logs includes a path to the eng.unicharset file that does NOT include the tessdata directory.
On Apr 21, 10:22 pm, Jake Munson <[email protected]> wrote: > I too am getting this error, but I DO have the eng.unicharset file in > the same directory as the executable, and it is a valid file (at least > I think it is). I can open the file and I see a bunch of lines like > this: > t 3 > h 3 > r 3 > > So what else am I doing wrong? I am using the precompiled windows exe > that is on the project Downloads page. I so got the 2.00 english data > files from that page. Is there maybe a bug in that executable > currently available from the Downloads section? > > On Apr 3, 10:43 am, paulfeakins <[email protected]> wrote: > > > Yep, you have to download the "eng" language and put all the files > > that begin with "eng." in to your tessdata directory. > > > Tesseract doesn't come with any languages installed by default so you > > have to add English first. It uses that language to help create your > > box files. > > > Hope that helps? > > > On 2 Apr, 22:14, roxaz <[email protected]> wrote: > > > > tesseract map.tiff fontfile batch.nochop makebox command does not work > > > for me somehow. i get this error in log: > > >Unable to load unicharset fileC:/Users/roxaz/Desktop/numrec/tesseract/ > > > bin.dbg6/tessdata/eng.unicharset > > > > doesnt help if i create that file too. how to fix this? --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -~----------~----~----~----~------~----~------~--~---

