After using regedit and pointing tessdata_prefix to the right place
and running again I got an error that referred to unicharset. The
entire contents of my tessdata subdirectory is:
Directory of C:\tesseract\Tesseract-OCR\tessdata
04/08/2011 12:50p <DIR> .
04/08/2011 12:50p <DIR> ..
04/08/2011 12:50p <DIR> configs
04/08/2011 12:21p 2,395,687 deu.traineddata
10/03/2010 08:01a 1,926,792 eng.traineddata
04/08/2011 12:24p 2,292,872 fra.traineddata
04/08/2011 12:27p 2,434,628 ita.traineddata
04/08/2011 12:29p 2,281,434 spa.traineddata
04/08/2011 12:50p <DIR> tessconfigs
5 File(s) 11,331,413 bytes
4 Dir(s) 47,724,969,984 bytes free
(no unichar type files)
Now the error is back to:
C:\tesseract\Tesseract-OCR>tesseract ocr_107.tif beglat
Error openning data file C:\Program Files\Tesseract-OCR\tessdata/
eng.traineddata
Well behaved w32 apps like emacs and gnuw32 utilities don't tell
Windows about themselves, why does tesseract have to?
On Apr 12, 6:59 pm, caudex <[email protected]> wrote:
> After installing tesseract-ocr 3.0 successfully and running it
> against 3 or 4 pdfs, I now get the following error
>
> C:\tesseract\Tesseract-OCR>tesseract ocr_107.tif beglat
> Error openning data file C:\Program Files\Tesseract-OCR\tessdata/
> eng.traineddata
>
> A dir on ...\tessdata shows:
>
> 10/03/2010 08:01a 1,926,792 eng.traineddata
>
> Notice the misspelling of openning and the / instead of \ in the
> qualified path to eng.traineddata.
>
> Does any of you have a clue what could be going wrong here after it
> worked correctly a few times?
> I see that tesseract is looking for the tessdata subdirectory in the
> wrong place (Program Files) instead of the current directory (where
> the .tif's were created) but how did it work the first three times?
> Under program files there is no tesseract-ocr subdirectory.
>
> Thanks,
>
> Ed
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en.