Hello Guys, I'm making an experiment with pytesseract and tesseocr to read some files receives in my company mailbox.
One problem i`m finding is with symbols. This particular file has some "borders" made with "*" But the tesseract recognizes it only as a sequence of "r", "k" and"e" , like the one attached he translate as: "KRREKKKKKKK Shipping Instructions KREKKEKKKKKE". Is there some configuration that I may insert informing that my text may have symbols in it? Or at least ignore them instead of try to fit them into a character. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/fae66339-edbe-4bc9-aba2-7d81bcc94733o%40googlegroups.com.

