> I found that if I resize the image to enlarge it by three or even up
> to five times the size, I use cubic interpolation resize in Gimp, this
> improves the recognition of text by this program greatly.

Yap, I have found similar result.

I was preparing some PDF files for OCR. While converting the PDF to  
image I have increased its size.

using http://www.graphicsmagick.org/

gm convert -density 200 main_file.pdf image.png


You can try both graphicsmagick and imagemagick (there might be  
similar functionality that you are looking for).

Thanks
Salahuddin

On Nov 27, 2009, at 5:54 PM, philip wrote:

> Hi,
>
> I am doing text recognition of small fonts. Typically at the size you
> see on web-pages.
>
> I found that if I resize the image to enlarge it by three or even up
> to five times the size, I use cubic interpolation resize in Gimp, this
> improves the recognition of text by this program greatly.
>
> Is there any other image pre-processing I could do to improve
> recognition rates?
>
> Thanks, Philip
>
> --
>
> You received this message because you are subscribed to the Google  
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to 
> [email protected] 
> .
> For more options, visit this group at 
> http://groups.google.com/group/tesseract-ocr?hl=en 
> .
>
>

--

You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.


Reply via email to