hello Everybody,
You can train this kind of "/" using tesseract. and use the train file
for solve this problem.
or
you can do some post processing algorithm.
On Tuesday, 18 August 2015 22:30:28 UTC+5:30, Michael Beauregard wrote:
>
> I don't think I ever found a solution to this, but it was so long ago I
> don't remember for sure. The project ended not long after posting the
> question and so I never had the chance to follow up.
>
> On Tue, Aug 18, 2015 at 8:39 AM, Ian Carroll <[email protected]
> <javascript:>> wrote:
>
>> Michael,
>>
>> Any chance you solved this (old) problem? I'm encountering the same issue
>> and haven't found a fix yet.
>>
>> Thanks,
>> Ian
>>
>> On Tuesday, January 14, 2014 at 3:30:40 PM UTC-5, Michael Beauregard
>> wrote:
>>>
>>> Hey everyone,
>>>
>>> I'm struggling to get an image with a date to be recognized correctly
>>> and would like some advice if possible.
>>>
>>> The image has the text "1946/05/29" using the following command:
>>>
>>> tesseract date.png date.out -psm 6
>>>
>>>
>>> is recognized as:
>>>
>>> $ cat date.out.txt
>>> 1 946I05I29
>>>
>>>
>>> I can deal with the unwanted space character easy enough, but I don't
>>> know what to do about the capital 'i' instead of forward slashes '/'.
>>> Interestingly enough, I enumerated through the ResultIterator and
>>> ChoiceIterator to see what symbols tesseract is matching and found that the
>>> forward slash isn't even considered:
>>>
>>>
>>> Result: I
>>> choice: I=99.000870
>>> choice: l=96.095596
>>> choice: !=89.777245
>>> choice: i=84.559441
>>>
>>> I would have expected one of the choices to be '/', but it wasn't.
>>>
>>> Any help would be greatly appreciated.
>>>
>>> Thanks,
>>>
>>> Michael
>>>
>> --
>> You received this message because you are subscribed to a topic in the
>> Google Groups "tesseract-ocr" group.
>> To unsubscribe from this topic, visit
>> https://groups.google.com/d/topic/tesseract-ocr/w5EYFpAetuo/unsubscribe.
>> To unsubscribe from this group and all its topics, send an email to
>> [email protected] <javascript:>.
>> To post to this group, send email to [email protected]
>> <javascript:>.
>> Visit this group at http://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/4745b515-07eb-4218-a2ae-945c98d0b037%40googlegroups.com
>>
>> <https://groups.google.com/d/msgid/tesseract-ocr/4745b515-07eb-4218-a2ae-945c98d0b037%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/1fb62354-abcb-47a3-80dd-0041dd76163d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.