After using regedit and pointing tessdata_prefix to the right place
and running again I got an error that referred to unicharset. The
entire contents of my tessdata subdirectory is:

 Directory of C:\tesseract\Tesseract-OCR\tessdata

04/08/2011  12:50p      <DIR>          .
04/08/2011  12:50p      <DIR>          ..
04/08/2011  12:50p      <DIR>          configs
04/08/2011  12:21p           2,395,687 deu.traineddata
10/03/2010  08:01a           1,926,792 eng.traineddata
04/08/2011  12:24p           2,292,872 fra.traineddata
04/08/2011  12:27p           2,434,628 ita.traineddata
04/08/2011  12:29p           2,281,434 spa.traineddata
04/08/2011  12:50p      <DIR>          tessconfigs
               5 File(s)     11,331,413 bytes
               4 Dir(s)  47,724,969,984 bytes free

(no unichar type files)

Now the error is back to:

C:\tesseract\Tesseract-OCR>tesseract ocr_107.tif beglat
Error openning data file C:\Program Files\Tesseract-OCR\tessdata/
eng.traineddata

Well behaved w32 apps like emacs and gnuw32 utilities don't tell
Windows about themselves, why does tesseract have to?




On Apr 12, 6:59 pm, caudex <[email protected]> wrote:
> After installing tesseract-ocr 3.0 successfully and running it
> against  3 or 4 pdfs, I now get the following error
>
> C:\tesseract\Tesseract-OCR>tesseract ocr_107.tif beglat
> Error openning data file C:\Program Files\Tesseract-OCR\tessdata/
> eng.traineddata
>
> A dir on ...\tessdata shows:
>
> 10/03/2010  08:01a           1,926,792 eng.traineddata
>
> Notice the misspelling of openning and the / instead of \ in the
> qualified path to eng.traineddata.
>
> Does any of you have a clue what could be going wrong here after it
> worked correctly a few times?
> I see that tesseract is looking for the tessdata subdirectory in the
> wrong place (Program Files) instead of the current directory (where
> the .tif's were created) but how did it work the first three times?
> Under program files there is no tesseract-ocr subdirectory.
>
> Thanks,
>
> Ed

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to