Hi folks, i've been playing a while with tesseract and opencv to get the best out of my scans. But lately i came across this problem: I need to scan a bunch of documents which are printed by an old needle-printer(I suppose), which has a thin "no-ink"-line horizontally through the text (s. attached pichture). With these documents i get no or very poor results. Could some one point me in the right direction how to get tesseract to read them? Is there some image-preprocessing I could do? Or do I have to train tesseract this "broken font"? (...that would be bad, because this line is not alway at the same position within the font).
All help welcome :) Thanks, Mo -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/a6982e8d-391b-46d2-9265-57d1cedfd297%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

