On Mon, Apr 23, 2012 at 5:20 AM, Binhua Liu <[email protected]> wrote:
> Hi all,
>
> I have try to use
> api.SetVariable("tessedit_reject_bad_qual_wds","TRUE"); to set "reject
> bad quality word", but still get many bad match words, my question is
>
> 1, how can I set the bottom line of word quality, then reject all
> words under the line?
>
> 2, Can I get each word quality information from output? or you can
> tell me which class and which property store the word quality
> information?
Dunno about #1, but for #2, assuming
tesseract::TessBaseAPI *apiP;
then do:
// Get bounding box coords of all words
BOXA *boxesP = apiP->GetWords(NULL);
// Get recognition confidence of all words
int *wordConfidencesP = apiP->AllWordConfidences();
...
// Cleanup
boxaDestroy(&boxesP);
delete [] wordConfidencesP;
See baseapi.h for details [1]
[1] http://code.google.com/p/tesseract-ocr/source/browse/trunk/api/baseapi.h#539
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en