On Friday, July 31, 2015 at 3:26:16 AM UTC-4, Merv wrote:
>
> The task I am facing is extraction of rotated text (any angle) from a 
> scanned image. Kindly find the link : http://1drv.ms/1OS8elW which has 
> the sample PDF document that I need to OCR. The sample contains a blue 
> sticker on it and I need your guidance on extracting the text (printed 
> text) from it. 
>
> As you can see the orientation is not stable and i would be really 
> grateful if you could suggest a manner in which i could extract the text.
>

 Is the sticker always blue?  If so, it should be pretty easy to use image 
processing to identify its location, extract if from the background 
document it, square it up, and drop the blue background.  You can then OCR 
the resulting image.

Tom

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/03dd821d-1b3e-4378-8f19-3930eb2237ea%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to