Parmeeet, trust you are not Getting *different texts* by running tesseract from different folders now?. please confirm. Wishing you best of Luck.
On Sat, Aug 13, 2011 at 10:33 AM, Parmeet bhatia <[email protected]>wrote: > Dear all, > > I am very sorry for the mess i created.. Unfortunately there was another > tesseract.exe in foldercontainingimage( don't know when i put in there and > since there are infinite number of stuffs in that folder i did not realized > this before ). Hence when i am running from foldercontainingimage it uses > the tesseract.exe in that folder (hence using different lang files) rather > than the one being set in environmental variable..Now it all make > sense..many thanks for your feedbacks.. > > Sorry again! > > On Sat, Aug 13, 2011 at 12:36 AM, zdenko podobny <[email protected]> wrote: > >> Please run following command (including quotes!): >> move "%TESSDATA_PREFIX%\tessdata\eng.traineddata" >> "%TESSDATA_PREFIX%\tessdata\_eng.traineddata" >> >> than run: >> c:\foldercontainingimage> tesseract 1test.jpg out >> c:\>tesseract ./foldercontainingimage/1test.jpg out >> >> and send us results. >> >> Zdenko >> >> On Fri, Aug 12, 2011 at 2:44 PM, Parmeet bhatia <[email protected] >> > wrote: >> >>> I am on windows and have re-installed the application. The path variable >>> have been set properly. I am not able to figure out how there could be two >>> different lang files. I guess at the end all what tesseract.exe requires >>> is eng.traineddata file for English language..So i wonder how there could >>> be different lang file.. >>> >>> On Fri, Aug 12, 2011 at 4:39 PM, Dmitri Silaev <[email protected]>wrote: >>> >>>> The point is in the lang files, obviously. Tesseract uses different >>>> (English) lang files in the above two cases. >>>> Things to check: >>>> - "tessdata" folder in "c:\foldercontainingimage\" >>>> - "tessdata" folder in "c:\" >>>> - "tessdata" folder in "C:\Program Files\Tesseract-OCR\" >>>> - "TESSDATA_PREFIX" environment variable >>>> >>>> HTH >>>> >>>> Warm regards, >>>> Dmitri Silaev >>>> www.CustomOCR.com >>>> >>>> >>>> >>>> >>>> >>>> On Fri, Aug 12, 2011 at 1:03 PM, Parmeet bhatia >>>> <[email protected]> wrote: >>>> > Please find attach the image file. The version is 3.0 >>>> > Some extra info. : I am doing automatic page layout before giving it >>>> to >>>> > tesseract but sometimes non-text blocks also got detected. The >>>> attached >>>> > image is one example. With proper text blocks, the results are same >>>> > but surprisingly not with the images like i attached. >>>> > >>>> > On Fri, Aug 12, 2011 at 2:09 PM, zdenko podobny <[email protected]> >>>> wrote: >>>> >> >>>> >> can you please provide image file and info what version of tesseract >>>> you >>>> >> used? >>>> >> Zdenko >>>> >> >>>> >> On Fri, Aug 12, 2011 at 9:03 AM, Parmeet bhatia < >>>> [email protected]> >>>> >> wrote: >>>> >>> >>>> >>> Hi All, >>>> >>> I observe something pretty strange but could not figure out whats >>>> the >>>> >>> problem. When i run tesseract from command line from the same folder >>>> in >>>> >>> which the image is i get diferent results compared to what i get >>>> when from >>>> >>> command line from different folder and give path to the image. to be >>>> more >>>> >>> clear the two command lines are: >>>> >>> c:\foldercontainingimage> tesseract 1test.jpg out >>>> >>> c:\>tesseract ./foldercontainingimage/1test.jpg out >>>> >>> the results are different in two out.txt files. Any ideas whats >>>> happening >>>> >>> around?? Please find attached image and different recognized text >>>> results. >>>> >>> Thanks, >>>> >>> Parmeet >>>> >>> >>>> >>> -- >>>> >>> You received this message because you are subscribed to the Google >>>> >>> Groups "tesseract-ocr" group. >>>> >>> To post to this group, send email to [email protected] >>>> >>> To unsubscribe from this group, send email to >>>> >>> [email protected] >>>> >>> For more options, visit this group at >>>> >>> http://groups.google.com/group/tesseract-ocr?hl=en >>>> >> >>>> >> -- >>>> >> You received this message because you are subscribed to the Google >>>> >> Groups "tesseract-ocr" group. >>>> >> To post to this group, send email to [email protected] >>>> >> To unsubscribe from this group, send email to >>>> >> [email protected] >>>> >> For more options, visit this group at >>>> >> http://groups.google.com/group/tesseract-ocr?hl=en >>>> > >>>> > >>>> > >>>> > -- >>>> > Parmeet >>>> > https://sites.google.com/site/bhatiaparmeet/ >>>> > >>>> > -- >>>> > You received this message because you are subscribed to the Google >>>> > Groups "tesseract-ocr" group. >>>> > To post to this group, send email to [email protected] >>>> > To unsubscribe from this group, send email to >>>> > [email protected] >>>> > For more options, visit this group at >>>> > http://groups.google.com/group/tesseract-ocr?hl=en >>>> > >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To post to this group, send email to [email protected] >>>> To unsubscribe from this group, send email to >>>> [email protected] >>>> For more options, visit this group at >>>> http://groups.google.com/group/tesseract-ocr?hl=en >>>> >>> >>> >>> >>> -- >>> Parmeet >>> https://sites.google.com/site/bhatiaparmeet/ >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To post to this group, send email to [email protected] >>> To unsubscribe from this group, send email to >>> [email protected] >>> For more options, visit this group at >>> http://groups.google.com/group/tesseract-ocr?hl=en >>> >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> > > > > -- > Parmeet > https://sites.google.com/site/bhatiaparmeet/ > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

