Hey everyone,

I'm struggling to get an image with a date to be recognized correctly and 
would like some advice if possible. 

The image has the text "1946/05/29" using the following command:

tesseract date.png date.out -psm 6


is recognized as:

$ cat date.out.txt
1 946I05I29


I can deal with the unwanted space character easy enough, but I don't know 
what to do about the capital 'i' instead of forward slashes '/'. 
Interestingly enough, I enumerated through the ResultIterator and 
ChoiceIterator to see what symbols tesseract is matching and found that the 
forward slash isn't even considered:
 

Result: I                                     
choice: I=99.000870                           
choice: l=96.095596                           
choice: !=89.777245                           
choice: i=84.559441                           

I would have expected one of the choices to be '/', but it wasn't.

Any help would be greatly appreciated.

Thanks,

Michael

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

<<attachment: date.png>>

Reply via email to