Search the archives, use Google. Here's a link from someone's solution
http://code.google.com/p/tesseract-ocr/issues/detail?id=714#c1

On Tue, Jun 19, 2012 at 4:52 AM, eva charles <[email protected]> wrote:
> Hi...i executed the following code to generate character-wise confidence
> values:
>
> int main(int argc, char **argv) {
>
>         const char *lang="eng";
>         const PIX   *pixs;
>          if ((pixs = pixRead(argv[1])) == NULL) {
>            cout <<"Unsupported image type"<<endl;
>             exit(3);
>           }
>         TessBaseAPI  api;
>         api.SetVariable("save_blob_choices", "T");
>     api.SetPageSegMode(tesseract::PSM_SINGLE_WORD  );
>     api.SetImage(pixs);
>         int rc = api.Init(argv[0], lang);
>         api.Recognize(NULL);
>         ResultIterator* ri = api.GetIterator();
>         if(ri != 0)
>         {
>             do
>             {
>                 const char* symbol = ri->GetUTF8Text(RIL_SYMBOL);
>                 if(symbol != 0)
>                 {
>                     float conf = ri->Confidence(RIL_SYMBOL);
>                     cout<<"\nnext symbol: "<< symbol << " confidence: " <<
> conf <<"\n" <<endl;
>
>                  }
>
>
>                 delete[] symbol;
>                     }    while((ri->Next(RIL_SYMBOL)));
>         }
>         return 0;
> }
>
> the output obtained for the attached image was:
> next symbol: N confidence: 72.3563
> next symbol: B confidence: 72.3563
>
> next symbol: E confidence: 69.9937
> next symbol: T confidence: 69.9937
> next symbol: R confidence: 69.9937
> next symbol: A confidence: 69.9937
> next symbol: N confidence: 69.9937
> next symbol: G confidence: 69.9937
> next symbol: - confidence: 69.9937
> next symbol: I confidence: 69.9937
>
> As is evident, the confidence values for characters belonging to the same
> word is the same.
> Is this the expected output? Shouldn't the confidence values be different
> for each character?
> I tried executing the code for a word in which each character was in
> different font style..and yet, the confidence value was the same for
> characters belonging to the same word.
> PLease help me out..
>
> Thanks in Advance
> Uni.
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to