On Thursday, February 20, 2020 at 7:01:55 PM UTC+5:30, Justin Yeh wrote: > Take this attached image for example, seems like Tesseract 3.3.0 (downloaded > from nuget) cannot recognize few characters correctly: such as 8 and B, or 5 > and Z, or 0 and O etc... > > > Is there any way that I could get string from image more accurately? or ... > How do I avoid this kind of misinterpretation from tesseract? I have tried > to make them in bold size but the result is the same.... > > > Any advice is welcome here. Thanks in advance!!!!
Hi again, Here are the results from using tesseract v4.0.0.20181030 on windows PFA Regards Lakshay -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/6f193e24-5cd6-4d1c-913c-b9ad6946df70%40googlegroups.com.
test.pdf
Description: Adobe PDF document
82428B7B 1238B7B6

