A work-around could be easily implemented with a sed script.

On Thu, May 24, 2018, 7:41 AM shree <[email protected]> wrote:

> Please try with script/Latin traineddata to see if you get better results.
>
> I have added your comment to issue at
> https://github.com/tesseract-ocr/langdata/pull/54
>
>
>
> On Thursday, May 24, 2018 at 5:05:55 PM UTC+5:30, Thomas Güttler wrote:
>>
>> I use tesseract 4.0 via docker (tesseractshadow/tesseract4re)
>>
>> Very often tesseract detects "StraBe" instead of "Straße".
>>
>> Yes, I use -l=deu
>>
>> The word "Straße" is very common in german. It means "street".
>>
>> Since "StraBe" makes no sense I would like to improve this.
>>
>> What do you suggest?
>>
>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/494dba60-4142-4bfc-8b14-2cae4f8e71ed%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/494dba60-4142-4bfc-8b14-2cae4f8e71ed%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CA%2BOX7tofqPsY5RTNBCBWYBPa0dbYra5UwkCExgCtG%3D%2BjciOpAA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to