---------- Forwarded message ---------- From: 78yrsold <[email protected]> Date: Fri, Aug 12, 2011 at 6:36 PM Subject: Re: Getting different (garbage) text by running tesseract from different folders To: tesseract-ocr <[email protected]>
Better to follow dimitri' suggestion to find out reason. On Aug 12, 5:59 pm, "Sriranga(78yrsold)" <[email protected]> wrote: > To avoid confustion, it is suggested that copy the tessdata folder in > "c:\foldercontainingimage\" and paste in "C:\Program Files\Tesseract-OCR\" > and also in "c:\" and test again. > > On Fri, Aug 12, 2011 at 6:14 PM, Parmeet bhatia <[email protected] >wrote: > > > > > > > > > I am on windows and have re-installed the application. The path variable > > have been set properly. I am not able to figure out how there could be two > > different lang files. I guess at the end all what tesseract.exe requires > > is eng.traineddata file for English language..So i wonder how there could > > be different lang file.. > > > On Fri, Aug 12, 2011 at 4:39 PM, Dmitri Silaev <[email protected] >wrote: > > >> The point is in the lang files, obviously. Tesseract uses different > >> (English) lang files in the above two cases. > >> Things to check: > >> - "tessdata" folder in "c:\foldercontainingimage\" > >> - "tessdata" folder in "c:\" > >> - "tessdata" folder in "C:\Program Files\Tesseract-OCR\" > >> - "TESSDATA_PREFIX" environment variable > > >> HTH > > >> Warm regards, > >> Dmitri Silaev > >>www.CustomOCR.com > > >> On Fri, Aug 12, 2011 at 1:03 PM, Parmeet bhatia > >> <[email protected]> wrote: > >> > Please find attach the image file. The version is 3.0 > >> > Some extra info. : I am doing automatic page layout before giving it to > >> > tesseract but sometimes non-text blocks also got detected. The attached > >> > image is one example. With proper text blocks, the results are same > >> > but surprisingly not with the images like i attached. > > >> > On Fri, Aug 12, 2011 at 2:09 PM, zdenko podobny <[email protected]> > >> wrote: > > >> >> can you please provide image file and info what version of tesseract > >> you > >> >> used? > >> >> Zdenko > > >> >> On Fri, Aug 12, 2011 at 9:03 AM, Parmeet bhatia < > >> [email protected]> > >> >> wrote: > > >> >>> Hi All, > >> >>> I observe something pretty strange but could not figure out whats the > >> >>> problem. When i run tesseract from command line from the same folder > >> in > >> >>> which the image is i get diferent results compared to what i get when > >> from > >> >>> command line from different folder and give path to the image. to be > >> more > >> >>> clear the two command lines are: > >> >>> c:\foldercontainingimage> tesseract 1test.jpg out > >> >>> c:\>tesseract ./foldercontainingimage/1test.jpg out > >> >>> the results are different in two out.txt files. Any ideas whats > >> happening > >> >>> around?? Please find attached image and different recognized text > >> results. > >> >>> Thanks, > >> >>> Parmeet > > >> >>> -- > >> >>> You received this message because you are subscribed to the Google > >> >>> Groups "tesseract-ocr" group. > >> >>> To post to this group, send email to [email protected] > >> >>> To unsubscribe from this group, send email to > >> >>> [email protected] > >> >>> For more options, visit this group at > >> >>>http://groups.google.com/group/tesseract-ocr?hl=en > > >> >> -- > >> >> You received this message because you are subscribed to the Google > >> >> Groups "tesseract-ocr" group. > >> >> To post to this group, send email to [email protected] > >> >> To unsubscribe from this group, send email to > >> >> [email protected] > >> >> For more options, visit this group at > >> >>http://groups.google.com/group/tesseract-ocr?hl=en > > >> > -- > >> > Parmeet > >> >https://sites.google.com/site/bhatiaparmeet/ > > >> > -- > >> > You received this message because you are subscribed to the Google > >> > Groups "tesseract-ocr" group. > >> > To post to this group, send email to [email protected] > >> > To unsubscribe from this group, send email to > >> > [email protected] > >> > For more options, visit this group at > >> >http://groups.google.com/group/tesseract-ocr?hl=en > > >> -- > >> You received this message because you are subscribed to the Google > >> Groups "tesseract-ocr" group. > >> To post to this group, send email to [email protected] > >> To unsubscribe from this group, send email to > >> [email protected] > >> For more options, visit this group at > >>http://groups.google.com/group/tesseract-ocr?hl=en > > > -- > > Parmeet > >https://sites.google.com/site/bhatiaparmeet/ > > > -- > > You received this message because you are subscribed to the Google > > Groups "tesseract-ocr" group. > > To post to this group, send email to [email protected] > > To unsubscribe from this group, send email to > > [email protected] > > For more options, visit this group at > >http://groups.google.com/group/tesseract-ocr?hl=en -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

