Thank you, I will try it. -----Original Message----- From: [email protected] [mailto:[email protected]] On Behalf Of TP Sent: Tuesday, April 24, 2012 12:00 AM To: [email protected] Subject: Re: How to get the word quality information
On Mon, Apr 23, 2012 at 5:20 AM, Binhua Liu <[email protected]> wrote: > Hi all, > > I have try to use > api.SetVariable("tessedit_reject_bad_qual_wds","TRUE"); to set "reject > bad quality word", but still get many bad match words, my question is > > 1, how can I set the bottom line of word quality, then reject all > words under the line? > > 2, Can I get each word quality information from output? or you can > tell me which class and which property store the word quality > information? Dunno about #1, but for #2, assuming tesseract::TessBaseAPI *apiP; then do: // Get bounding box coords of all words BOXA *boxesP = apiP->GetWords(NULL); // Get recognition confidence of all words int *wordConfidencesP = apiP->AllWordConfidences(); ... // Cleanup boxaDestroy(&boxesP); delete [] wordConfidencesP; See baseapi.h for details [1] [1] http://code.google.com/p/tesseract-ocr/source/browse/trunk/api/baseapi.h#539 -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

