Hi. Have you tried to disable dictionary? https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality#dictionaries-word-lists-and-patterns
Best regards On Tuesday, July 25, 2017 at 8:06:57 AM UTC-3, Jérémy Hannouna wrote: > > Hi, > > I'm trying to extract a number from a document. this number contains > letters and numbers. > > If I try to recognize the line as a singleword, the letter Z is > automatically convert as a 2 and if I try to recognize the line with > multiple words, the 6 before the Z is convert as a G. > > I don't know how to configure Tesseract to just recognize letters without > trying to interpret it in a "context". > > I'm not sure my explaination is clear. > > Anyone here can help me please ? > > Thank you > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3fb9a0d2-e3b7-423a-9c12-9ebda48b3aba%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

