Image editing you could do using ImageMagick (command line/java api) On Thursday, June 28, 2018 at 4:42:55 PM UTC-4, [email protected] wrote: > > Thank you Shree!! :) > > Ok after rotating it, > tesseract haven't succeed retrieving the text. > > *BUT* I kept experimenting with convert app (part of ImageMagick 6.8.9), > and resized the photo twice, > eventually the words got retrieved!! hooray! :) > > > > *So I was wondering.what is a good practice when taking shots with a > smartphone?* > it was mentioned: > Tesseract works best on images which have a DPI of at least 300 dpi, so > it may be beneficial to resize images. > > moreover it was mentioned in the git repo: > Tesseract does various image processing operations internally (using the > Leptonica library) before doing the actual OCR. > > *BTW is there a config parameter for enabling resizing image in case of a > problematic input?* > > > > > > On Thursday, June 28, 2018 at 10:38:20 PM UTC+3, shree wrote: >> >> Rotate your shot to correct orientation and try. >> >> On 6/28/18, [email protected] <[email protected]> wrote: >> > I'm quite new to tesseract and would like to use it in a project for >> OCR >> > purposes, >> > I found a tutorial on the web with photos, so I have executed tesseract >> > (tesseract 4.0.0-beta.2) on it, >> > and noticed it has *successfully retrieved every single word*, wow >> > IMPRESSIVE!! >> > >> > so I took my smartphone and took a crystal clear photo (no blurry), and >> > hoped it would work for me too. >> > but *NOTHING it failed miserably* (every word miss :/ bummer) >> > >> > I read this too: >> > https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality >> > >> > I tried to figure out what's i'm doing wrong by comparing the metedata >> EXIF >> > >> > of each photo, >> > but apparently the photo's metadata from the web tutorial has been >> stripped >> > >> > :/ >> > >> > Can someone explain to me. what am i missing here?? >> > I'm attaching the two photos. >> > >> > >> > Thank you in advance :) >> > >> > >> > -- >> > You received this message because you are subscribed to the Google >> Groups >> > "tesseract-ocr" group. >> > To unsubscribe from this group and stop receiving emails from it, send >> an >> > email to [email protected]. >> > To post to this group, send email to [email protected]. >> > Visit this group at https://groups.google.com/group/tesseract-ocr. >> > To view this discussion on the web visit >> > >> https://groups.google.com/d/msgid/tesseract-ocr/9676c56c-4ed4-4329-9aad-82937c495b91%40googlegroups.com. >> >> >> > For more options, visit https://groups.google.com/d/optout. >> > >> >> >> -- >> >> ____________________________________________________________ >> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >> >
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/710b97c2-9a4a-4047-8ece-ed563c23ec2d%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

