I don't think I ever found a solution to this, but it was so long ago I don't remember for sure. The project ended not long after posting the question and so I never had the chance to follow up.
On Tue, Aug 18, 2015 at 8:39 AM, Ian Carroll <[email protected]> wrote: > Michael, > > Any chance you solved this (old) problem? I'm encountering the same issue > and haven't found a fix yet. > > Thanks, > Ian > > On Tuesday, January 14, 2014 at 3:30:40 PM UTC-5, Michael Beauregard wrote: >> >> Hey everyone, >> >> I'm struggling to get an image with a date to be recognized correctly and >> would like some advice if possible. >> >> The image has the text "1946/05/29" using the following command: >> >> tesseract date.png date.out -psm 6 >> >> >> is recognized as: >> >> $ cat date.out.txt >> 1 946I05I29 >> >> >> I can deal with the unwanted space character easy enough, but I don't >> know what to do about the capital 'i' instead of forward slashes '/'. >> Interestingly enough, I enumerated through the ResultIterator and >> ChoiceIterator to see what symbols tesseract is matching and found that the >> forward slash isn't even considered: >> >> >> Result: I >> choice: I=99.000870 >> choice: l=96.095596 >> choice: !=89.777245 >> choice: i=84.559441 >> >> I would have expected one of the choices to be '/', but it wasn't. >> >> Any help would be greatly appreciated. >> >> Thanks, >> >> Michael >> > -- > You received this message because you are subscribed to a topic in the > Google Groups "tesseract-ocr" group. > To unsubscribe from this topic, visit > https://groups.google.com/d/topic/tesseract-ocr/w5EYFpAetuo/unsubscribe. > To unsubscribe from this group and all its topics, send an email to > [email protected]. > To post to this group, send email to [email protected]. > Visit this group at http://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/4745b515-07eb-4218-a2ae-945c98d0b037%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/4745b515-07eb-4218-a2ae-945c98d0b037%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAAHFqa_iesan1yR8bQ6c22tEJda1x4ASLJE5qM6goEHvpfPgww%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

