A work-around could be easily implemented with a sed script. On Thu, May 24, 2018, 7:41 AM shree <[email protected]> wrote:
> Please try with script/Latin traineddata to see if you get better results. > > I have added your comment to issue at > https://github.com/tesseract-ocr/langdata/pull/54 > > > > On Thursday, May 24, 2018 at 5:05:55 PM UTC+5:30, Thomas Güttler wrote: >> >> I use tesseract 4.0 via docker (tesseractshadow/tesseract4re) >> >> Very often tesseract detects "StraBe" instead of "Straße". >> >> Yes, I use -l=deu >> >> The word "Straße" is very common in german. It means "street". >> >> Since "StraBe" makes no sense I would like to improve this. >> >> What do you suggest? >> >> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/494dba60-4142-4bfc-8b14-2cae4f8e71ed%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/494dba60-4142-4bfc-8b14-2cae4f8e71ed%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CA%2BOX7tofqPsY5RTNBCBWYBPa0dbYra5UwkCExgCtG%3D%2BjciOpAA%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

