> I'm not interested in words or paragraph detection, just character (as you
> can tell by the attached image). I tried doing ocr per character
> (PSM_SINGLE_CHAR) but tesseract had problems with capitalization and
> numbers. I get the best results running in PSM_SINGLE_LINE.
>
> My question is, how can I disable word and paragraph detection (but keep
> character recognition)?

Did you try turning off the dictionary lookups by configuring
tessedit_unrej_any_wd = true ? The sauce:

   BOOL_MEMBER(tessedit_unrej_any_wd, false,
                "Dont bother with word plausibility", this->params()),

Reviewing your TIFF, it's definitely driving the dictionary functions
crazy. But I'm just a newb here too, so I hope someone checks my
answer!

-- 
  Phlip
  http://c2.com/cgi/wiki?ZeekLand

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to