Actually, you want to use the lowercase l as a language specification option:

tesseract image.tif output -l lang

Warm regards,
Dmitri Silaev
www.CustomOCR.com





On Wed, Aug 24, 2011 at 6:26 AM, tteveris <[email protected]> wrote:
> Installed Tesseract 3.0 and then copied the installed folder to the
> following location.
>
>
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\doc
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\gzip.exe
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\leptonlib.dll
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tesseract.exe
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\training
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\doc\AUTHORS
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\doc\COPYING
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\doc\eurotext.tif
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\doc
> \phototest.tif
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\doc\README
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\doc\ReleaseNotes
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata\configs
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata
> \eng.traineddata
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata
> \tessconfigs
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata\configs
> \ambigs.train
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata\configs
> \api_config
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata\configs
> \box.train
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata\configs
> \box.train.stderr
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata\configs
> \digits
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata\configs
> \inter
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata\configs
> \kannada
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata\configs
> \logfile
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata\configs
> \makebox
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata\configs
> \unlv
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata
> \tessconfigs\batch
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata
> \tessconfigs\batch.nochop
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata
> \tessconfigs\matdemo
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata
> \tessconfigs\msdemo
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata
> \tessconfigs\nobatch
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tessdata
> \tessconfigs\segdemo
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\training
> \cntraining.exe
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\training
> \combine_tessdata.exe
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\training
> \mftraining.exe
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\training
> \unicharset_extractor.exe
> C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\training
> \wordlist2dawg.exe
>
>
> My test app (DLL) calls CreateProcess() with the following command:
>
> "C:\Program Files (x86)\XXX\Software\OCR\Tesseract-OCR\tesseract.exe C:
> \Users\Tony\Pictures\ocr\phototest.tif c:\Temp\ocrout -L eng"
>
> The output file is created (ocrout.txt) but when I read Tesseract's
> STDOUT pipe I get the following:
>
> read_variables_file: Can't open L
> read_variables_file: Can't open eng
> Tesseract Open Source OCR Engine with Leptonica
> Number of found pages: 1.
>
>
> So I take it that I'm specifying the language parameter incorrect.
>
>
> Another question, in doing the above and not looking at the Tesseract
> code where does it look for the eng.traineddata file? Is it based on
> where Tesseract was sceduled from or do I have to set the current
> directory?
>
> Thanks in advance
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to