Parmeet, Glad to note that now it is working perfectly for you now. It is presumed that eng.lang in the one folder may be default of tesseract-ocr while in another folder you might have generated eng.lang. Anyway all problems are solved for you. With Best of Luck, sriranga(78yrs)
On Sat, Aug 13, 2011 at 1:58 PM, Parmeet bhatia <[email protected]>wrote: > Sriranga, many thanks :) . Yes, i have already run the tesseract from > different folders and i am getting exactly the same result now..I still need > to figure out which (and from where) lang files was being used by the > tesseract.exe which was there in the foldercontainingimage..I might try to > figure out in free time but for now everything is working perfectly well :) > > On Sat, Aug 13, 2011 at 12:58 PM, Sriranga(78yrsold) < > [email protected]> wrote: > >> Parmeeet, >> trust you are not Getting *different texts* by running tesseract from >> different folders now?. >> please confirm. >> Wishing you best of Luck. >> >> >> On Sat, Aug 13, 2011 at 10:33 AM, Parmeet bhatia < >> [email protected]> wrote: >> >>> Dear all, >>> >>> I am very sorry for the mess i created.. Unfortunately there was another >>> tesseract.exe in foldercontainingimage( don't know when i put in there and >>> since there are infinite number of stuffs in that folder i did not realized >>> this before ). Hence when i am running from foldercontainingimage it uses >>> the tesseract.exe in that folder (hence using different lang files) rather >>> than the one being set in environmental variable..Now it all make >>> sense..many thanks for your feedbacks.. >>> >>> Sorry again! >>> >>> On Sat, Aug 13, 2011 at 12:36 AM, zdenko podobny <[email protected]>wrote: >>> >>>> Please run following command (including quotes!): >>>> move "%TESSDATA_PREFIX%\tessdata\eng.traineddata" >>>> "%TESSDATA_PREFIX%\tessdata\_eng.traineddata" >>>> >>>> than run: >>>> c:\foldercontainingimage> tesseract 1test.jpg out >>>> c:\>tesseract ./foldercontainingimage/1test.jpg out >>>> >>>> and send us results. >>>> >>>> Zdenko >>>> >>>> On Fri, Aug 12, 2011 at 2:44 PM, Parmeet bhatia < >>>> [email protected]> wrote: >>>> >>>>> I am on windows and have re-installed the application. The path >>>>> variable have been set properly. I am not able to figure out how there >>>>> could >>>>> be two different lang files. I guess at the end all what tesseract.exe >>>>> requires is eng.traineddata file for English language..So i wonder how >>>>> there could be different lang file.. >>>>> >>>>> On Fri, Aug 12, 2011 at 4:39 PM, Dmitri Silaev >>>>> <[email protected]>wrote: >>>>> >>>>>> The point is in the lang files, obviously. Tesseract uses different >>>>>> (English) lang files in the above two cases. >>>>>> Things to check: >>>>>> - "tessdata" folder in "c:\foldercontainingimage\" >>>>>> - "tessdata" folder in "c:\" >>>>>> - "tessdata" folder in "C:\Program Files\Tesseract-OCR\" >>>>>> - "TESSDATA_PREFIX" environment variable >>>>>> >>>>>> HTH >>>>>> >>>>>> Warm regards, >>>>>> Dmitri Silaev >>>>>> www.CustomOCR.com >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> On Fri, Aug 12, 2011 at 1:03 PM, Parmeet bhatia >>>>>> <[email protected]> wrote: >>>>>> > Please find attach the image file. The version is 3.0 >>>>>> > Some extra info. : I am doing automatic page layout before giving it >>>>>> to >>>>>> > tesseract but sometimes non-text blocks also got detected. The >>>>>> attached >>>>>> > image is one example. With proper text blocks, the results are same >>>>>> > but surprisingly not with the images like i attached. >>>>>> > >>>>>> > On Fri, Aug 12, 2011 at 2:09 PM, zdenko podobny <[email protected]> >>>>>> wrote: >>>>>> >> >>>>>> >> can you please provide image file and info what version of >>>>>> tesseract you >>>>>> >> used? >>>>>> >> Zdenko >>>>>> >> >>>>>> >> On Fri, Aug 12, 2011 at 9:03 AM, Parmeet bhatia < >>>>>> [email protected]> >>>>>> >> wrote: >>>>>> >>> >>>>>> >>> Hi All, >>>>>> >>> I observe something pretty strange but could not figure out whats >>>>>> the >>>>>> >>> problem. When i run tesseract from command line from the same >>>>>> folder in >>>>>> >>> which the image is i get diferent results compared to what i get >>>>>> when from >>>>>> >>> command line from different folder and give path to the image. to >>>>>> be more >>>>>> >>> clear the two command lines are: >>>>>> >>> c:\foldercontainingimage> tesseract 1test.jpg out >>>>>> >>> c:\>tesseract ./foldercontainingimage/1test.jpg out >>>>>> >>> the results are different in two out.txt files. Any ideas whats >>>>>> happening >>>>>> >>> around?? Please find attached image and different recognized text >>>>>> results. >>>>>> >>> Thanks, >>>>>> >>> Parmeet >>>>>> >>> >>>>>> >>> -- >>>>>> >>> You received this message because you are subscribed to the Google >>>>>> >>> Groups "tesseract-ocr" group. >>>>>> >>> To post to this group, send email to >>>>>> [email protected] >>>>>> >>> To unsubscribe from this group, send email to >>>>>> >>> [email protected] >>>>>> >>> For more options, visit this group at >>>>>> >>> http://groups.google.com/group/tesseract-ocr?hl=en >>>>>> >> >>>>>> >> -- >>>>>> >> You received this message because you are subscribed to the Google >>>>>> >> Groups "tesseract-ocr" group. >>>>>> >> To post to this group, send email to >>>>>> [email protected] >>>>>> >> To unsubscribe from this group, send email to >>>>>> >> [email protected] >>>>>> >> For more options, visit this group at >>>>>> >> http://groups.google.com/group/tesseract-ocr?hl=en >>>>>> > >>>>>> > >>>>>> > >>>>>> > -- >>>>>> > Parmeet >>>>>> > https://sites.google.com/site/bhatiaparmeet/ >>>>>> > >>>>>> > -- >>>>>> > You received this message because you are subscribed to the Google >>>>>> > Groups "tesseract-ocr" group. >>>>>> > To post to this group, send email to [email protected] >>>>>> > To unsubscribe from this group, send email to >>>>>> > [email protected] >>>>>> > For more options, visit this group at >>>>>> > http://groups.google.com/group/tesseract-ocr?hl=en >>>>>> > >>>>>> >>>>>> -- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "tesseract-ocr" group. >>>>>> To post to this group, send email to [email protected] >>>>>> To unsubscribe from this group, send email to >>>>>> [email protected] >>>>>> For more options, visit this group at >>>>>> http://groups.google.com/group/tesseract-ocr?hl=en >>>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> Parmeet >>>>> https://sites.google.com/site/bhatiaparmeet/ >>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-ocr" group. >>>>> To post to this group, send email to [email protected] >>>>> To unsubscribe from this group, send email to >>>>> [email protected] >>>>> For more options, visit this group at >>>>> http://groups.google.com/group/tesseract-ocr?hl=en >>>>> >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To post to this group, send email to [email protected] >>>> To unsubscribe from this group, send email to >>>> [email protected] >>>> For more options, visit this group at >>>> http://groups.google.com/group/tesseract-ocr?hl=en >>>> >>> >>> >>> >>> -- >>> Parmeet >>> https://sites.google.com/site/bhatiaparmeet/ >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To post to this group, send email to [email protected] >>> To unsubscribe from this group, send email to >>> [email protected] >>> For more options, visit this group at >>> http://groups.google.com/group/tesseract-ocr?hl=en >>> >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> > > > > -- > Parmeet > https://sites.google.com/site/bhatiaparmeet/ > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

