Greetings everyone, I am standing in front of quite challenging task. Optical Character Recognition of the data from the IDs taken by smartphone camera. I have tried tesseract as-is, but the accuracy rate is somewhere around 40%.
I have started tweaking around, disabling dictionaries and preprocessing images to grayscale, using different page segmentation methods, but each setting produces various and different accuracy on different photos. I am asking you guys as experts in the field if there are some tips you could give me? See example here: https://drive.google.com/open?id=14PDZlbJ-HNFcHsPlE28cBT5VIxV9ceqW Dont mind the red parts. I have been doing at least some basic "protection". You have seen nothing obviously. Thanks! Tom A. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/4c9e4681-23dd-435b-a6f2-73ab78a122e7%40googlegroups.com.

