forgot to mention that I am using tesseract C++ API:
tesseract::ResultIterator* res_it = api->GetIterator();
tesseract::PageIteratorLevel level = tesseract::RIL_SYMBOL;
tesseract::ChoiceIterator ci(*res_it);
do {
if (ci.Confidence() >= 0) {
Choice* c = new Choice();
const char* ch = ci.GetUTF8Text();
}
} while (ci.Next());
[email protected] schrieb am Donnerstag, 3. September 2020 um 08:10:53
UTC+2:
> Hi all,
> I am using the new choice iterator in tesseract 5 to get the confidences
> for all choices for each symbol of my text. But spaces (word bounderies)
> are not shown, so I have no way to know when a space is between symbols. Is
> there a way to for example combine the word iterator with the choice
> iterator or any other way to know when a new word starts?
>
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/6a8400a4-e57b-40ad-bdd6-4184a58d76cen%40googlegroups.com.