> I found that if I resize the image to enlarge it by three or even up > to five times the size, I use cubic interpolation resize in Gimp, this > improves the recognition of text by this program greatly.
Yap, I have found similar result. I was preparing some PDF files for OCR. While converting the PDF to image I have increased its size. using http://www.graphicsmagick.org/ gm convert -density 200 main_file.pdf image.png You can try both graphicsmagick and imagemagick (there might be similar functionality that you are looking for). Thanks Salahuddin On Nov 27, 2009, at 5:54 PM, philip wrote: > Hi, > > I am doing text recognition of small fonts. Typically at the size you > see on web-pages. > > I found that if I resize the image to enlarge it by three or even up > to five times the size, I use cubic interpolation resize in Gimp, this > improves the recognition of text by this program greatly. > > Is there any other image pre-processing I could do to improve > recognition rates? > > Thanks, Philip > > -- > > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected] > . > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > . > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

