On Monday, June 22, 2015 at 7:56:51 AM UTC-4, Gunasekaran Velu wrote:
>
>
> I have attached the image as well as Tesseract OCR result for attached 
> image screen shot. the below OCR some words are missing from OCR how can i 
> improve the image quality to detect the missing words.
>
> The attached image DPI are
>
> Horizontal resolution - 204 DPI
> Vertical resolution    -    98 DPI
>
> Please help me to improve the OCR accuracy.
>

The easiest improvement to make would be to use "Fine" mode at a minimum to 
bring the vertical resolution up to 200 DPI.  If a higher resolution is 
available (e.g. "super fine") that would be even better.

The corner marks on the form are clearly designed to help with form 
processing, so I'd use them in your image processing pipeline to deskew, 
remove background printing, etc.

The form can be broken into zones to be recognized individually, using 
knowledge of the type of information expected to help tune things.

Tom 

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/4027970e-2d6b-466b-8cac-2359c3dcd7a0%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to