[tesseract-ocr] Proper identify Symbols

Felipe Vegini Wed, 10 Jun 2020 20:42:46 -0700

Hello Guys, I'm making an experiment with pytesseract and tesseocr to read 
some files receives in my company mailbox.


One problem i`m finding is with symbols. This particular file has some 
"borders" made with "*"
But the tesseract recognizes it only as a sequence of "r", "k" and"e" , 
like the one attached he translate as: "KRREKKKKKKK Shipping Instructions 
KREKKEKKKKKE".


Is there some configuration that I may insert informing that my text may 
have symbols in it?
Or at least ignore them instead of try to fit them into a character.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/fae66339-edbe-4bc9-aba2-7d81bcc94733o%40googlegroups.com.

[tesseract-ocr] Proper identify Symbols

Reply via email to