Michael,

Any chance you solved this (old) problem? I'm encountering the same issue 
and haven't found a fix yet.

Thanks,
Ian

On Tuesday, January 14, 2014 at 3:30:40 PM UTC-5, Michael Beauregard wrote:
>
> Hey everyone,
>
> I'm struggling to get an image with a date to be recognized correctly and 
> would like some advice if possible. 
>
> The image has the text "1946/05/29" using the following command:
>
> tesseract date.png date.out -psm 6
>
>
> is recognized as:
>
> $ cat date.out.txt
> 1 946I05I29
>
>
> I can deal with the unwanted space character easy enough, but I don't know 
> what to do about the capital 'i' instead of forward slashes '/'. 
> Interestingly enough, I enumerated through the ResultIterator and 
> ChoiceIterator to see what symbols tesseract is matching and found that the 
> forward slash isn't even considered:
>  
>
> Result: I                                     
> choice: I=99.000870                           
> choice: l=96.095596                           
> choice: !=89.777245                           
> choice: i=84.559441                           
>
> I would have expected one of the choices to be '/', but it wasn't.
>
> Any help would be greatly appreciated.
>
> Thanks,
>
> Michael
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/4745b515-07eb-4218-a2ae-945c98d0b037%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to