No configs I know of but I have similar functionality implemented in a text post-processing step in my OCR pipeline.
On Wed, Sep 18, 2019 at 11:19 AM 'Sandra M.' via tesseract-ocr < [email protected]> wrote: > I'm using Tesseract with Python. I have an image with 1-6 words in it and > need to read the text. Sometimes the character "C", which look the same in > upper and lower case, is detected as lower case c instead of upper case C. > I see the problem, but in context to the following letters it should be > possible to detect the right notation. Is there any configuration or > something to improve this? > > I had a look at the configuration options of config='-psm x' with > different values for x, but nothing fits to my problem > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/ef0e07cc-5f7c-4ff3-bb07-ffdda4c68321%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/ef0e07cc-5f7c-4ff3-bb07-ffdda4c68321%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CABtjQ9JbJn5UeCJsrYcT-fM0S7sJWoM5E-vXV%3DvWX1JCnbMweg%40mail.gmail.com.

