You can download latest version of tesseract-ocr and appropriate traineddata from
https://launchpad.net/~alex-p/+archive/ubuntu/tesseract-ocr I ran tesseract via command line with default values. You may need to remove the existing old version, before installing new. On 27-Feb-2018 1:14 AM, "Dusayanta Prasad" <[email protected]> wrote: > I am using tesseract in ubuntu command line, the version is > tesseract 3.04.01 > leptonica-1.73 > libgif 5.1.2 : libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : > libtiff 4.0.6 : zlib 1.2.8 : libwebp 0.4.4 : libopenjp2 2.1.0 > > Regarding the part of gibberish text, i had to convert the image to .tif > format . Then i used tesseract with the .tif image as: > tesseract img.tif outtif > The text generated in my case has notable difference from yours. Your > one's has a good accuracy. Please tell me how did u achieved so. > Have a look at the text file attachment > > On Sunday, February 25, 2018 at 9:48:32 PM UTC+5:30, shree wrote: >> >> which version of tesseract are you using? >> >> See attached results with Tesseract 4 and eng from tessdata_fast >> >> >> >> ShreeDevi >> ____________________________________________________________ >> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >> >> On Sun, Feb 25, 2018 at 8:16 PM, Zdenko Podobny <[email protected]> wrote: >> >>> https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality >>> >>> Zdenko >>> >>> 2018-02-25 11:38 GMT+01:00 Dusayanta Prasad <[email protected]>: >>> >>>> I am try to convert the below image using Tesseract in linux using the >>>> following command: >>>> >>>> tesseract img.jpg out -l eng >>>> >>>> >>>> <https://lh3.googleusercontent.com/-J5RCuBU_Wro/WpKRMkcPgdI/AAAAAAAADEM/NXaGsh1A-EgqRAC4KVOCK5TBeP_tSy8TwCLcBGAs/s1600/img.jpg> >>>> >>>> and i am getting the result like this >>>> >>>> >>>> <https://lh3.googleusercontent.com/-HMqQ9GLfPHk/WpKR9iiC2GI/AAAAAAAADEU/zvzRu4rai-smuhG1q2tCqwlyzVk5nyjFwCLcBGAs/s1600/text_snap.png> >>>> >>>> >>>> >>>> Please help me out. >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To post to this group, send email to [email protected]. >>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>> To view this discussion on the web visit https://groups.google.com/d/ms >>>> gid/tesseract-ocr/3fb7240e-612c-4e64-abc8-99a07c3a0447%40goo >>>> glegroups.com >>>> <https://groups.google.com/d/msgid/tesseract-ocr/3fb7240e-612c-4e64-abc8-99a07c3a0447%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to [email protected]. >>> To post to this group, send email to [email protected]. >>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>> To view this discussion on the web visit https://groups.google.com/d/ms >>> gid/tesseract-ocr/CAJbzG8wEWDctJwTsMtZ625cpjPKtfp6ee_UF% >>> 2BGHfzVEJbHmxOg%40mail.gmail.com >>> <https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8wEWDctJwTsMtZ625cpjPKtfp6ee_UF%2BGHfzVEJbHmxOg%40mail.gmail.com?utm_medium=email&utm_source=footer> >>> . >>> >>> For more options, visit https://groups.google.com/d/optout. >>> >> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/50e86a21-fba5-4e30-8cd7-fe998cd7a186% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/50e86a21-fba5-4e30-8cd7-fe998cd7a186%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXxBEKHcCJVWKPsY32GcnPxwemnHaVEZLc_zsc9%2B4uP3g%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

