Hi, this looks hard. You have two problems here, straighten the text and clean it up.
Once you have straighten the text to something like this: [image: 8829199908894_crop.jpg] google vision api <https://cloud.google.com/vision/> recognize it correctly. So it can be done. I do not know how they pre-process it, maybe they do not and throw it at a neural network trained with any kind of text. If you want to try to clean up you need to look into edge detection (gaussian difference) and/or wavelet decomposition. I tried with gimp with weak results. When you have something that barely looks like black on white you can try to fine tune the tesseract model but you need a lot of samples with hand transcribed text, unless you are so lucky to have pre-classified images. I would also try to fine tune the existing model with the image as it is, with no pre-processing at all other than straightening. It may even work better. To straighten the text you may try EAST text detection, rotate the bounding boxes. Or detect curved lines and dewarp it according to the radius. Or do component analysis, detect the letter boxes and dewarp accordingly. Not easy to do reliably on a random picture. Bye Lorenzo Il giorno lun 20 mag 2019 alle ore 20:20 David Bess < [email protected]> ha scritto: > Hi All, > > I am facing challenges reading DOT from a tire. As a person, I can clear > make it out with no difficulty. See attached image. In this example the > DOT is PJ40 KU1R 2011. I have already tried inverting, and binarisation, > but I just do not get any output. Did a lot of research online, and it > seems like a changeling problem, so I thought I would check with the smart > folks here on the forum. Thanks in advance for any help or advice. > > Best regards, > > David Bess > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/1902f58b-6131-4c9e-9711-096df60bfcec%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/1902f58b-6131-4c9e-9711-096df60bfcec%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAMgOLLyV2qw%3Du%3D8Sj9RK0_ARiY-Ne%2BG--4wc1cA6bFCjng7bXA%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

