To avoid confustion, it is suggested that copy the tessdata folder in "c:\foldercontainingimage\" and paste in "C:\Program Files\Tesseract-OCR\" and also in "c:\" and test again.
On Fri, Aug 12, 2011 at 6:14 PM, Parmeet bhatia <[email protected]>wrote: > I am on windows and have re-installed the application. The path variable > have been set properly. I am not able to figure out how there could be two > different lang files. I guess at the end all what tesseract.exe requires > is eng.traineddata file for English language..So i wonder how there could > be different lang file.. > > On Fri, Aug 12, 2011 at 4:39 PM, Dmitri Silaev <[email protected]>wrote: > >> The point is in the lang files, obviously. Tesseract uses different >> (English) lang files in the above two cases. >> Things to check: >> - "tessdata" folder in "c:\foldercontainingimage\" >> - "tessdata" folder in "c:\" >> - "tessdata" folder in "C:\Program Files\Tesseract-OCR\" >> - "TESSDATA_PREFIX" environment variable >> >> HTH >> >> Warm regards, >> Dmitri Silaev >> www.CustomOCR.com >> >> >> >> >> >> On Fri, Aug 12, 2011 at 1:03 PM, Parmeet bhatia >> <[email protected]> wrote: >> > Please find attach the image file. The version is 3.0 >> > Some extra info. : I am doing automatic page layout before giving it to >> > tesseract but sometimes non-text blocks also got detected. The attached >> > image is one example. With proper text blocks, the results are same >> > but surprisingly not with the images like i attached. >> > >> > On Fri, Aug 12, 2011 at 2:09 PM, zdenko podobny <[email protected]> >> wrote: >> >> >> >> can you please provide image file and info what version of tesseract >> you >> >> used? >> >> Zdenko >> >> >> >> On Fri, Aug 12, 2011 at 9:03 AM, Parmeet bhatia < >> [email protected]> >> >> wrote: >> >>> >> >>> Hi All, >> >>> I observe something pretty strange but could not figure out whats the >> >>> problem. When i run tesseract from command line from the same folder >> in >> >>> which the image is i get diferent results compared to what i get when >> from >> >>> command line from different folder and give path to the image. to be >> more >> >>> clear the two command lines are: >> >>> c:\foldercontainingimage> tesseract 1test.jpg out >> >>> c:\>tesseract ./foldercontainingimage/1test.jpg out >> >>> the results are different in two out.txt files. Any ideas whats >> happening >> >>> around?? Please find attached image and different recognized text >> results. >> >>> Thanks, >> >>> Parmeet >> >>> >> >>> -- >> >>> You received this message because you are subscribed to the Google >> >>> Groups "tesseract-ocr" group. >> >>> To post to this group, send email to [email protected] >> >>> To unsubscribe from this group, send email to >> >>> [email protected] >> >>> For more options, visit this group at >> >>> http://groups.google.com/group/tesseract-ocr?hl=en >> >> >> >> -- >> >> You received this message because you are subscribed to the Google >> >> Groups "tesseract-ocr" group. >> >> To post to this group, send email to [email protected] >> >> To unsubscribe from this group, send email to >> >> [email protected] >> >> For more options, visit this group at >> >> http://groups.google.com/group/tesseract-ocr?hl=en >> > >> > >> > >> > -- >> > Parmeet >> > https://sites.google.com/site/bhatiaparmeet/ >> > >> > -- >> > You received this message because you are subscribed to the Google >> > Groups "tesseract-ocr" group. >> > To post to this group, send email to [email protected] >> > To unsubscribe from this group, send email to >> > [email protected] >> > For more options, visit this group at >> > http://groups.google.com/group/tesseract-ocr?hl=en >> > >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> > > > > -- > Parmeet > https://sites.google.com/site/bhatiaparmeet/ > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

