There are actually imagemagick scripts pre-baked for doing text clean-up. Google for imagemagick and textcleaner.
On Fri, Oct 24, 2014 at 5:27 PM, Simon Eigeldinger <[email protected]> wrote: > hi, > > is there a guideline what to do with poor quality pics? > i am blind so i have no clue what sighted people do with those. *smile* > and it seems tesseract can't do much about pic quality. > maybe imagemagick might be a good choice for fixing things? > > greetings, > simon > > > > Am 24.10.2014 um 23:02 schrieb Robert Melton: > >> Is that tiny file the actual file size you are running OCR on? If so, >> scale up the image and I am guessing results will improve greatly. >> >> On Fri, Oct 24, 2014 at 2:25 PM, BDristan <[email protected]> wrote: >>> >>> I'm quite new to tesseract. I just tried to OCR an image as follows: >>> >>> tesseract LockBits.tif LockBits -l eng >>> >>> The output text was pretty messed up. I ran tesseract 3.02 on Win7. >>> >>> I then run an on-line OCR and got a perfect result. >>> >>> Could someone please give me some hints on how to improve OCR with >>> tesseract. >>> >>> Attached is an image file that I used. >>> >>> Thanks. >>> >>> -- >>> You received this message because you are subscribed to the Google Groups >>> "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send an >>> email to [email protected]. >>> To post to this group, send email to [email protected]. >>> Visit this group at http://groups.google.com/group/tesseract-ocr. >>> To view this discussion on the web visit >>> >>> https://groups.google.com/d/msgid/tesseract-ocr/0274edc9-8744-489b-bcf5-0eabc9dbd5c0%40googlegroups.com. >>> For more options, visit https://groups.google.com/d/optout. >> >> >> >> > > -- > Simon Eigeldinger > Follow me on Twitter: http://www.twitter.com/domasofan/ > E-Mail: [email protected] > MSN: [email protected] > ICQ: 121823966 > Jabber: [email protected] > > --- > Diese E-Mail ist frei von Viren und Malware, denn der avast! Antivirus > Schutz ist aktiv. > http://www.avast.com > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at http://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/544AC453.9090205%40vol.at. > > For more options, visit https://groups.google.com/d/optout. -- Robert Melton | http://robertmelton.com -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAC_wg4Z8qqr-Lk3O7tb3iadd5gUu%3De9-6gzNM0oP4d7g9X4qaQ%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

