Good morning, I'm trying to use Tesseract to read dates in image files. The 
problem I have is that the image is rather small. This is the cropped image 
with the date I have to process:


[image: test-raw.jpg]

After some processing with Scikit-Image (rescaling, adding a white border, 
erosion and binarising) I get this image:

[image: processed.png]

To me it reads pretty well. Still, tesseract  reads "» MAY 2021". The "5" 
is missing.

How can I process the image to get the desired output, i.e. "5 MAY 2021".

I'm using tesseract 4.1.1 with pytesseract.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/75f4dfc8-7ec0-4334-8b11-72fc268f1b83n%40googlegroups.com.

Reply via email to