Good morning, I'm trying to use Tesseract to read dates in image files. The problem I have is that the image is rather small. This is the cropped image with the date I have to process:
[image: test-raw.jpg] After some processing with Scikit-Image (rescaling, adding a white border, erosion and binarising) I get this image: [image: processed.png] To me it reads pretty well. Still, tesseract reads "» MAY 2021". The "5" is missing. How can I process the image to get the desired output, i.e. "5 MAY 2021". I'm using tesseract 4.1.1 with pytesseract. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/75f4dfc8-7ec0-4334-8b11-72fc268f1b83n%40googlegroups.com.

