Thank you, I will try it.

-----Original Message-----
From: [email protected] [mailto:[email protected]]
On Behalf Of TP
Sent: Tuesday, April 24, 2012 12:00 AM
To: [email protected]
Subject: Re: How to get the word quality information

On Mon, Apr 23, 2012 at 5:20 AM, Binhua Liu <[email protected]>
wrote:
> Hi all,
>
> I have try to use
> api.SetVariable("tessedit_reject_bad_qual_wds","TRUE"); to set "reject 
> bad quality word", but still get many bad match words, my question is
>
> 1, how can I set the bottom line of word quality, then reject all 
> words under the line?
>
> 2, Can I get each word quality information from output? or you can 
> tell me which class and which property store the word quality 
> information?

Dunno about #1, but for #2, assuming

  tesseract::TessBaseAPI *apiP;

then do:

  // Get bounding box coords of all words
  BOXA *boxesP = apiP->GetWords(NULL);

  // Get recognition confidence of all words
  int *wordConfidencesP = apiP->AllWordConfidences();

...

    // Cleanup
    boxaDestroy(&boxesP);
    delete [] wordConfidencesP;


See baseapi.h for details [1]

[1]
http://code.google.com/p/tesseract-ocr/source/browse/trunk/api/baseapi.h#539

--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To post to this group, send email to [email protected] To
unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to