To avoid confustion, it is suggested that copy the tessdata folder in
"c:\foldercontainingimage\" and paste in "C:\Program Files\Tesseract-OCR\"
and also in "c:\" and test again.

On Fri, Aug 12, 2011 at 6:14 PM, Parmeet bhatia <[email protected]>wrote:

> I am on windows and have re-installed the application. The path variable
> have been set properly. I am not able to figure out how there could be two
> different lang files. I guess at the end all what tesseract.exe requires
> is eng.traineddata  file for English language..So i wonder how there could
> be different lang file..
>
> On Fri, Aug 12, 2011 at 4:39 PM, Dmitri Silaev <[email protected]>wrote:
>
>> The point is in the lang files, obviously. Tesseract uses different
>> (English) lang files in the above two cases.
>> Things to check:
>> - "tessdata" folder in "c:\foldercontainingimage\"
>> - "tessdata" folder in "c:\"
>> - "tessdata" folder in "C:\Program Files\Tesseract-OCR\"
>> - "TESSDATA_PREFIX" environment variable
>>
>> HTH
>>
>> Warm regards,
>> Dmitri Silaev
>> www.CustomOCR.com
>>
>>
>>
>>
>>
>> On Fri, Aug 12, 2011 at 1:03 PM, Parmeet bhatia
>> <[email protected]> wrote:
>> > Please find attach the image file. The version is 3.0
>> > Some extra info. : I am doing automatic page layout before giving it to
>> > tesseract but sometimes non-text blocks also got detected. The attached
>> > image is one example. With proper text blocks, the results are same
>> > but surprisingly not with the images like i attached.
>> >
>> > On Fri, Aug 12, 2011 at 2:09 PM, zdenko podobny <[email protected]>
>> wrote:
>> >>
>> >> can you please provide image file and info what version of tesseract
>> you
>> >> used?
>> >> Zdenko
>> >>
>> >> On Fri, Aug 12, 2011 at 9:03 AM, Parmeet bhatia <
>> [email protected]>
>> >> wrote:
>> >>>
>> >>> Hi All,
>> >>> I observe something pretty strange but could not figure out whats the
>> >>> problem. When i run tesseract from command line from the same folder
>> in
>> >>> which the image is i get diferent results compared to what i get when
>> from
>> >>> command line from different folder and give path to the image. to be
>> more
>> >>> clear the two command lines are:
>> >>> c:\foldercontainingimage> tesseract 1test.jpg out
>> >>> c:\>tesseract ./foldercontainingimage/1test.jpg out
>> >>> the results are different in two out.txt files. Any ideas whats
>> happening
>> >>> around?? Please find attached image and different recognized text
>> results.
>> >>> Thanks,
>> >>> Parmeet
>> >>>
>> >>> --
>> >>> You received this message because you are subscribed to the Google
>> >>> Groups "tesseract-ocr" group.
>> >>> To post to this group, send email to [email protected]
>> >>> To unsubscribe from this group, send email to
>> >>> [email protected]
>> >>> For more options, visit this group at
>> >>> http://groups.google.com/group/tesseract-ocr?hl=en
>> >>
>> >> --
>> >> You received this message because you are subscribed to the Google
>> >> Groups "tesseract-ocr" group.
>> >> To post to this group, send email to [email protected]
>> >> To unsubscribe from this group, send email to
>> >> [email protected]
>> >> For more options, visit this group at
>> >> http://groups.google.com/group/tesseract-ocr?hl=en
>> >
>> >
>> >
>> > --
>> > Parmeet
>> > https://sites.google.com/site/bhatiaparmeet/
>> >
>> > --
>> > You received this message because you are subscribed to the Google
>> > Groups "tesseract-ocr" group.
>> > To post to this group, send email to [email protected]
>> > To unsubscribe from this group, send email to
>> > [email protected]
>> > For more options, visit this group at
>> > http://groups.google.com/group/tesseract-ocr?hl=en
>> >
>>
>> --
>> You received this message because you are subscribed to the Google
>> Groups "tesseract-ocr" group.
>> To post to this group, send email to [email protected]
>> To unsubscribe from this group, send email to
>> [email protected]
>> For more options, visit this group at
>> http://groups.google.com/group/tesseract-ocr?hl=en
>>
>
>
>
> --
> Parmeet
> https://sites.google.com/site/bhatiaparmeet/
>
>  --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to