Hello,

This problem occurs when there is very little line spacing or in some cases,
font size is too small as footnotes/endnotes in "pleading" documents.

One trick to suggest based on my experience: Just scale your image, isolate
the individual blobs put them together and then perform recognition.
Ofcourse you should know the layout and/or line information in order to
render the correct results in recognized text.

Although this works with any OCR engine, but if there are any intrinsic
properties in the current version, I am interested to know!

Regards,
Vicky

--
Vicky Budhiraja
http://www.sitarasoft.com/

-----Original Message-----
From: [email protected] [mailto:[email protected]]
On Behalf Of mw18888
Sent: Monday, April 04, 2011 06:09
To: tesseract-ocr
Subject: How to improve the Tesseract OCR accuracy when two text lines
adjacent to each other


>From the testing, I see that the Tesseract OCR can't recognize the
characters if two text lines are adjacent (or vertically very closed
to each other.)

I wonder if I can  ( have a way to ) configure the Tesseract to
improve the OCR accuracy when two text lines are too closed.

Thank you in advance.

mw18888




-- 
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en.


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to