Sriranga, many thanks :) . Yes, i have already run the tesseract from different folders and i am getting exactly the same result now..I still need to figure out which (and from where) lang files was being used by the tesseract.exe which was there in the foldercontainingimage..I might try to figure out in free time but for now everything is working perfectly well :)
On Sat, Aug 13, 2011 at 12:58 PM, Sriranga(78yrsold) < [email protected]> wrote: > Parmeeet, > trust you are not Getting *different texts* by running tesseract from > different folders now?. > please confirm. > Wishing you best of Luck. > > > On Sat, Aug 13, 2011 at 10:33 AM, Parmeet bhatia <[email protected] > > wrote: > >> Dear all, >> >> I am very sorry for the mess i created.. Unfortunately there was another >> tesseract.exe in foldercontainingimage( don't know when i put in there and >> since there are infinite number of stuffs in that folder i did not realized >> this before ). Hence when i am running from foldercontainingimage it uses >> the tesseract.exe in that folder (hence using different lang files) rather >> than the one being set in environmental variable..Now it all make >> sense..many thanks for your feedbacks.. >> >> Sorry again! >> >> On Sat, Aug 13, 2011 at 12:36 AM, zdenko podobny <[email protected]>wrote: >> >>> Please run following command (including quotes!): >>> move "%TESSDATA_PREFIX%\tessdata\eng.traineddata" >>> "%TESSDATA_PREFIX%\tessdata\_eng.traineddata" >>> >>> than run: >>> c:\foldercontainingimage> tesseract 1test.jpg out >>> c:\>tesseract ./foldercontainingimage/1test.jpg out >>> >>> and send us results. >>> >>> Zdenko >>> >>> On Fri, Aug 12, 2011 at 2:44 PM, Parmeet bhatia < >>> [email protected]> wrote: >>> >>>> I am on windows and have re-installed the application. The path >>>> variable have been set properly. I am not able to figure out how there >>>> could >>>> be two different lang files. I guess at the end all what tesseract.exe >>>> requires is eng.traineddata file for English language..So i wonder how >>>> there could be different lang file.. >>>> >>>> On Fri, Aug 12, 2011 at 4:39 PM, Dmitri Silaev >>>> <[email protected]>wrote: >>>> >>>>> The point is in the lang files, obviously. Tesseract uses different >>>>> (English) lang files in the above two cases. >>>>> Things to check: >>>>> - "tessdata" folder in "c:\foldercontainingimage\" >>>>> - "tessdata" folder in "c:\" >>>>> - "tessdata" folder in "C:\Program Files\Tesseract-OCR\" >>>>> - "TESSDATA_PREFIX" environment variable >>>>> >>>>> HTH >>>>> >>>>> Warm regards, >>>>> Dmitri Silaev >>>>> www.CustomOCR.com >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> On Fri, Aug 12, 2011 at 1:03 PM, Parmeet bhatia >>>>> <[email protected]> wrote: >>>>> > Please find attach the image file. The version is 3.0 >>>>> > Some extra info. : I am doing automatic page layout before giving it >>>>> to >>>>> > tesseract but sometimes non-text blocks also got detected. The >>>>> attached >>>>> > image is one example. With proper text blocks, the results are same >>>>> > but surprisingly not with the images like i attached. >>>>> > >>>>> > On Fri, Aug 12, 2011 at 2:09 PM, zdenko podobny <[email protected]> >>>>> wrote: >>>>> >> >>>>> >> can you please provide image file and info what version of tesseract >>>>> you >>>>> >> used? >>>>> >> Zdenko >>>>> >> >>>>> >> On Fri, Aug 12, 2011 at 9:03 AM, Parmeet bhatia < >>>>> [email protected]> >>>>> >> wrote: >>>>> >>> >>>>> >>> Hi All, >>>>> >>> I observe something pretty strange but could not figure out whats >>>>> the >>>>> >>> problem. When i run tesseract from command line from the same >>>>> folder in >>>>> >>> which the image is i get diferent results compared to what i get >>>>> when from >>>>> >>> command line from different folder and give path to the image. to >>>>> be more >>>>> >>> clear the two command lines are: >>>>> >>> c:\foldercontainingimage> tesseract 1test.jpg out >>>>> >>> c:\>tesseract ./foldercontainingimage/1test.jpg out >>>>> >>> the results are different in two out.txt files. Any ideas whats >>>>> happening >>>>> >>> around?? Please find attached image and different recognized text >>>>> results. >>>>> >>> Thanks, >>>>> >>> Parmeet >>>>> >>> >>>>> >>> -- >>>>> >>> You received this message because you are subscribed to the Google >>>>> >>> Groups "tesseract-ocr" group. >>>>> >>> To post to this group, send email to >>>>> [email protected] >>>>> >>> To unsubscribe from this group, send email to >>>>> >>> [email protected] >>>>> >>> For more options, visit this group at >>>>> >>> http://groups.google.com/group/tesseract-ocr?hl=en >>>>> >> >>>>> >> -- >>>>> >> You received this message because you are subscribed to the Google >>>>> >> Groups "tesseract-ocr" group. >>>>> >> To post to this group, send email to [email protected] >>>>> >> To unsubscribe from this group, send email to >>>>> >> [email protected] >>>>> >> For more options, visit this group at >>>>> >> http://groups.google.com/group/tesseract-ocr?hl=en >>>>> > >>>>> > >>>>> > >>>>> > -- >>>>> > Parmeet >>>>> > https://sites.google.com/site/bhatiaparmeet/ >>>>> > >>>>> > -- >>>>> > You received this message because you are subscribed to the Google >>>>> > Groups "tesseract-ocr" group. >>>>> > To post to this group, send email to [email protected] >>>>> > To unsubscribe from this group, send email to >>>>> > [email protected] >>>>> > For more options, visit this group at >>>>> > http://groups.google.com/group/tesseract-ocr?hl=en >>>>> > >>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-ocr" group. >>>>> To post to this group, send email to [email protected] >>>>> To unsubscribe from this group, send email to >>>>> [email protected] >>>>> For more options, visit this group at >>>>> http://groups.google.com/group/tesseract-ocr?hl=en >>>>> >>>> >>>> >>>> >>>> -- >>>> Parmeet >>>> https://sites.google.com/site/bhatiaparmeet/ >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To post to this group, send email to [email protected] >>>> To unsubscribe from this group, send email to >>>> [email protected] >>>> For more options, visit this group at >>>> http://groups.google.com/group/tesseract-ocr?hl=en >>>> >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To post to this group, send email to [email protected] >>> To unsubscribe from this group, send email to >>> [email protected] >>> For more options, visit this group at >>> http://groups.google.com/group/tesseract-ocr?hl=en >>> >> >> >> >> -- >> Parmeet >> https://sites.google.com/site/bhatiaparmeet/ >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- Parmeet https://sites.google.com/site/bhatiaparmeet/ -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

