It's called "confidence" value in Tesseract terminology. hocr format output contains confidency values, at word level, I believe.
On Saturday, June 29, 2019 at 8:53:05 AM UTC-5, Mox Betex wrote: > > Is it possible to get percentage of accuracy of recognized text? > > I need to recognize multiple languages (2 languages) and tesseract doesn't > know exactly what language is when I put parametar -l lang1+lang2. > What I want to do is to scan with both languages separately, but I would > need some percentage of accuracy to determine probability of language. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/85f970be-8bea-4c43-b3bc-0eb09534e9d7%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.