Dear all, I am very sorry for the mess i created.. Unfortunately there was another tesseract.exe in foldercontainingimage( don't know when i put in there and since there are infinite number of stuffs in that folder i did not realized this before ). Hence when i am running from foldercontainingimage it uses the tesseract.exe in that folder (hence using different lang files) rather than the one being set in environmental variable..Now it all make sense..many thanks for your feedbacks..
Sorry again! On Sat, Aug 13, 2011 at 12:36 AM, zdenko podobny <[email protected]> wrote: > Please run following command (including quotes!): > move "%TESSDATA_PREFIX%\tessdata\eng.traineddata" > "%TESSDATA_PREFIX%\tessdata\_eng.traineddata" > > than run: > c:\foldercontainingimage> tesseract 1test.jpg out > c:\>tesseract ./foldercontainingimage/1test.jpg out > > and send us results. > > Zdenko > > On Fri, Aug 12, 2011 at 2:44 PM, Parmeet bhatia > <[email protected]>wrote: > >> I am on windows and have re-installed the application. The path variable >> have been set properly. I am not able to figure out how there could be two >> different lang files. I guess at the end all what tesseract.exe requires >> is eng.traineddata file for English language..So i wonder how there could >> be different lang file.. >> >> On Fri, Aug 12, 2011 at 4:39 PM, Dmitri Silaev <[email protected]>wrote: >> >>> The point is in the lang files, obviously. Tesseract uses different >>> (English) lang files in the above two cases. >>> Things to check: >>> - "tessdata" folder in "c:\foldercontainingimage\" >>> - "tessdata" folder in "c:\" >>> - "tessdata" folder in "C:\Program Files\Tesseract-OCR\" >>> - "TESSDATA_PREFIX" environment variable >>> >>> HTH >>> >>> Warm regards, >>> Dmitri Silaev >>> www.CustomOCR.com >>> >>> >>> >>> >>> >>> On Fri, Aug 12, 2011 at 1:03 PM, Parmeet bhatia >>> <[email protected]> wrote: >>> > Please find attach the image file. The version is 3.0 >>> > Some extra info. : I am doing automatic page layout before giving it to >>> > tesseract but sometimes non-text blocks also got detected. The attached >>> > image is one example. With proper text blocks, the results are same >>> > but surprisingly not with the images like i attached. >>> > >>> > On Fri, Aug 12, 2011 at 2:09 PM, zdenko podobny <[email protected]> >>> wrote: >>> >> >>> >> can you please provide image file and info what version of tesseract >>> you >>> >> used? >>> >> Zdenko >>> >> >>> >> On Fri, Aug 12, 2011 at 9:03 AM, Parmeet bhatia < >>> [email protected]> >>> >> wrote: >>> >>> >>> >>> Hi All, >>> >>> I observe something pretty strange but could not figure out whats the >>> >>> problem. When i run tesseract from command line from the same folder >>> in >>> >>> which the image is i get diferent results compared to what i get when >>> from >>> >>> command line from different folder and give path to the image. to be >>> more >>> >>> clear the two command lines are: >>> >>> c:\foldercontainingimage> tesseract 1test.jpg out >>> >>> c:\>tesseract ./foldercontainingimage/1test.jpg out >>> >>> the results are different in two out.txt files. Any ideas whats >>> happening >>> >>> around?? Please find attached image and different recognized text >>> results. >>> >>> Thanks, >>> >>> Parmeet >>> >>> >>> >>> -- >>> >>> You received this message because you are subscribed to the Google >>> >>> Groups "tesseract-ocr" group. >>> >>> To post to this group, send email to [email protected] >>> >>> To unsubscribe from this group, send email to >>> >>> [email protected] >>> >>> For more options, visit this group at >>> >>> http://groups.google.com/group/tesseract-ocr?hl=en >>> >> >>> >> -- >>> >> You received this message because you are subscribed to the Google >>> >> Groups "tesseract-ocr" group. >>> >> To post to this group, send email to [email protected] >>> >> To unsubscribe from this group, send email to >>> >> [email protected] >>> >> For more options, visit this group at >>> >> http://groups.google.com/group/tesseract-ocr?hl=en >>> > >>> > >>> > >>> > -- >>> > Parmeet >>> > https://sites.google.com/site/bhatiaparmeet/ >>> > >>> > -- >>> > You received this message because you are subscribed to the Google >>> > Groups "tesseract-ocr" group. >>> > To post to this group, send email to [email protected] >>> > To unsubscribe from this group, send email to >>> > [email protected] >>> > For more options, visit this group at >>> > http://groups.google.com/group/tesseract-ocr?hl=en >>> > >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To post to this group, send email to [email protected] >>> To unsubscribe from this group, send email to >>> [email protected] >>> For more options, visit this group at >>> http://groups.google.com/group/tesseract-ocr?hl=en >>> >> >> >> >> -- >> Parmeet >> https://sites.google.com/site/bhatiaparmeet/ >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- Parmeet https://sites.google.com/site/bhatiaparmeet/ -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

