On Jun 22, 6:48 am, patrickq <[email protected]> wrote:
> I tested it via ScanBizCards and Indeed OSD has no issues whatsover
> getting it right - there is 10 times the amount of text it needs and
> the image is very sharp, it's guaranteed to get it right. I am not
> familiar with the command-line tools however so I can't help, I'll
> just say that it should be very easy to write your own little utility
> making a call to DetectOS.
>
> Another easy solution: why don't you run Tesseract twice, first on the
> original image then on the image rotated 180 degree? I assume you only
> need these two possibilities because it's a FAX hence page size is
> taller than it is wide. Then pick the one that yields the most
> sensible text and the least gibberish characters.

That is my current method.  It just has produced some edge cases where
there isnt text like a graph per say and either side produces same
amount of false positive noise.  In those cases I just keep it the
same way it came in.  But was hoping for a more efficient method.  I
am glad the software works though  I guess i might need to invest time
in building a tool to detect orientation using tesseract.

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to