[tesseract-ocr] Re: Percentage of accuracy

2019-06-29 Thread Quan Nguyen
Yes, its values range from 0 to 100.

On Saturday, June 29, 2019 at 12:00:45 PM UTC-5, Mox Betex wrote:
>
> I have found w_conf attribute in .hocr file.
> How should I interpret that value? Does high w_conf value means high 
> accuracy?
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/6343064a-1934-4fc3-ab1c-c69dbb37a1ae%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Percentage of accuracy

2019-06-29 Thread Mox Betex
I have found w_conf attribute in .hocr file.
How should I interpret that value? Does high w_conf value means high 
accuracy?

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/dc17a169-f81f-4c96-9f8f-5d6e253e8647%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Percentage of accuracy

2019-06-29 Thread Quan Nguyen
It's called "confidence" value in Tesseract terminology. hocr format output 
contains confidency values, at word level, I believe.

On Saturday, June 29, 2019 at 8:53:05 AM UTC-5, Mox Betex wrote:
>
> Is it possible to get percentage of accuracy of recognized text?
>
> I need to recognize multiple languages (2 languages) and tesseract doesn't 
> know exactly what language is when I put parametar -l lang1+lang2.
> What I want to do is to scan with both languages separately, but I would 
> need some percentage of accuracy to determine probability of language.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/85f970be-8bea-4c43-b3bc-0eb09534e9d7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Percentage of accuracy

2019-06-29 Thread Mox Betex
Is it possible to get percentage of accuracy of recognized text?

I need to recognize multiple languages (2 languages) and tesseract doesn't 
know exactly what language is when I put parametar -l lang1+lang2.
What I want to do is to scan with both languages separately, but I would 
need some percentage of accuracy to determine probability of language.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/431b5e21-18c6-4f4f-a78b-5b9e8374b249%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Invalid resolution 0, using 70dpi instead

2019-06-29 Thread Mox Betex
I tried to solve this problem using exiftool to write Units:PixelsPerInch 
in png file because it was Units:Undefined, but I had no luck.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/363fc182-2c07-48f3-aebf-504cbfb7a49d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.