Ok, for anybody else that may get this error, I found a solution.  I
don't remember reading in any of the docs that you need to keep the
language files in a directory named tessdata.  But I noticed that all
of the different data files I was downloading from all over the place
had the files in a directory named tessdata, so I tried putting that
file in the directory where I was doing my tests, and sure enough it
now works.  The misleading thing is that the error message in the logs
includes a path to the eng.unicharset file that does NOT include the
tessdata directory.

On Apr 21, 10:22 pm, Jake Munson <[email protected]> wrote:
> I too am getting this error, but I DO have the eng.unicharset file in
> the same directory as the executable, and it is a valid file (at least
> I think it is).  I can open the file and I see a bunch of lines like
> this:
> t 3
> h 3
> r 3
>
> So what else am I doing wrong?  I am using the precompiled windows exe
> that is on the project Downloads page.  I so got the 2.00 english data
> files from that page.  Is there maybe a bug in that executable
> currently available from the Downloads section?
>
> On Apr 3, 10:43 am, paulfeakins <[email protected]> wrote:
>
> > Yep, you have to download the "eng" language and put all the files
> > that begin with "eng." in to your tessdata directory.
>
> > Tesseract doesn't come with any languages installed by default so you
> > have to add English first. It uses that language to help create your
> > box files.
>
> > Hope that helps?
>
> > On 2 Apr, 22:14, roxaz <[email protected]> wrote:
>
> > > tesseract map.tiff fontfile batch.nochop makebox command does not work
> > > for me somehow. i get this error in log:
> > >Unable to load unicharset fileC:/Users/roxaz/Desktop/numrec/tesseract/
> > > bin.dbg6/tessdata/eng.unicharset
>
> > > doesnt help if i create that file too. how to fix this?
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to